Thomas Henson

  • Data Engineering Courses
    • Installing and Configuring Splunk
    • Implementing Neural Networks with TFLearn
    • Hortonworks Getting Started
    • Analyzing Machine Data with Splunk
    • Pig Latin Getting Started Course
    • HDFS Getting Started Course
    • Enterprise Skills in Hortonworks Data Platform
  • Pig Eval Series
  • About
  • Big Data Big Questions

What’s New in Hadoop 3.0?

December 20, 2017 by Thomas Henson 1 Comment

New in Hadoop 3.0

Major Hadoop Release! Hadoop 3.0 is has dropped! There is a lot of excitement in the Hadoop community for a 3.0 release. Now is the time to find out what's new in Hadoop 3.0 so you can plan for an upgrade to your existing Hadoop clusters. In this … [Continue reading]

Filed Under: Hadoop Tagged With: Big Data Big Questions, Hadoop, HDFS

Learning Roadmap for Data Engineers?

December 19, 2017 by Thomas Henson Leave a Comment

Is there a learning Roadmap for Data Engineers? Data Engineers are highly sought after field for Developers and Administrators. One factor driving developers into that space is the average salary of 100K - 150Kwhich is well above average for IT … [Continue reading]

Filed Under: Data Engineers Tagged With: Big Data, Data Engineer Skills, Data Engineers

How to Find HDFS Path URL?

December 17, 2017 by Thomas Henson 1 Comment

Have you ever been running a script in from the HDFS command line gotten this error? Or running one of your favorite HDFS or Hadoop fs commands... Maybe you were trying to remember the HDFS URL and couldn't figure it out? Well it happens … [Continue reading]

Filed Under: Hadoop Tagged With: Hadoop, HDFS

Ultimate Hadoop Python Example

December 7, 2017 by Thomas Henson Leave a Comment

Python Hadoop Example

What are the options for using Python in Hadoop? Python developers are looking to transition their Python Skills in the Hadoop Ecosystem. In a recent episode of Big Data Big Questions I answered question about using Python on Hadoop. Let's take a … [Continue reading]

Filed Under: Hadoop Tagged With: Data Engineer, Hadoop, MapReduce, Python

Should Data Engineers Know Machine Learning Algorithms?

November 10, 2017 by Thomas Henson Leave a Comment

How involved should Data Engineers be in learning Machine Learning Algorithms?  For the past few years Data Scientist are one of the hottest jobs in IT. A huge part of what Data Scientist do is selecting Machine Learning Algorithms for projects … [Continue reading]

Filed Under: Data Engineers Tagged With: Data Engineers, Machine Learning, Mahout, Spark

Python vs. Scala Freelance Data Engineers

November 9, 2017 by Thomas Henson Leave a Comment

Which is better for Freelance Data Engineers Scala or Python? Picking up freelance gigs can be a challenge especially when just starting out. So which language is better for getting freelance gigs Scala or Python? In today's episode Big Data Big … [Continue reading]

Filed Under: Data Engineers Tagged With: Data Engineers, Freelance, Python, Scala

Book Review: Boyd the Fighter Pilot Who Changed the Art of War

November 8, 2017 by Thomas Henson Leave a Comment

Why read a fighter pilot book?  Ever heard of the OODA loop? It's the basis for agile development. Observe, Orient , Decide, and Act (OODA) is the feedback loop coined by John Boyd. The point of the loop is to go through these steps repeatedly … [Continue reading]

Filed Under: Book Review Tagged With: Book Review, Books, DevOps

Big Data Beard Podcast Announcement

November 7, 2017 by Thomas Henson Leave a Comment

How do you keep up with all the news going on in the Big Data community? Announcing the Big Data Beard Podcast, a Podcast devoted to Big Data news, architecture, and the software powering the big data ecosystem. Watch the video below to learn how I … [Continue reading]

Filed Under: Big Data Tagged With: Big Data, Big Data Beard Podcast, Data Engineers, Podcast

Isilon Quick Tips: Creating Snapshots with Isilon’s OneFS from Command Line

November 6, 2017 by Thomas Henson Leave a Comment

How do you manage OneFS snapshots from the CLI?  It's easy to use the isi snapshot snapshots commands. We have worked through setting up Isilon's OneFS Snapshots from the WebCLI in multiple Isilon Quick Tips. Let's turn our focus now to setting … [Continue reading]

Filed Under: Isilon Tagged With: Isilon, Isilon Quick Tips

Complete Pig Join Example

October 30, 2017 by Thomas Henson 2 Comments

Let's say you have two sets of structured or unstructured data. How to combine two sets of data (relations) in Pig Latin? Look at the example below. If you wanted to combine the cereal and price data sets what would you use? Pig Latin offers Joins … [Continue reading]

Filed Under: Hadoop Pig Tagged With: Apache Pig, Apache Pig Latin, Apache Pig Tutorial

Setting Up Passwordless SSH for Ambari Agent

October 26, 2017 by Thomas Henson Leave a Comment

Want to know one of the hardest part for me installing Hadoop with Ambari? Setting up Passwordless ssh for all nodes so that Ambari Agent could do the install. Looking back it might be a trivial thing to get right, but at that time my Linux skills … [Continue reading]

Filed Under: Ambari Tagged With: Ambari, Hadoop, Hadoop Distributed File System, Hortonworks, Learn Hadoop

Kappa Architecture Examples in Real-Time Processing

October 11, 2017 by Thomas Henson Leave a Comment

Kappa Architecture Examples

“Is it possible to build a prediction model based on real-time processing data frameworks such as the Kappa Architecture?” Yes we can build models based on the real-time processing and in fact there are some you use every day.... In today's … [Continue reading]

Filed Under: Big Data Tagged With: Big Data, Big Data Big Questions, IoT, Kappa

  • « Previous Page
  • 1
  • …
  • 5
  • 6
  • 7
  • 8
  • 9
  • …
  • 16
  • Next Page »

Subscribe to Newsletter

Archives

  • February 2021 (2)
  • January 2021 (5)
  • May 2020 (1)
  • January 2020 (1)
  • November 2019 (1)
  • October 2019 (9)
  • July 2019 (7)
  • June 2019 (8)
  • May 2019 (4)
  • April 2019 (1)
  • February 2019 (1)
  • January 2019 (2)
  • September 2018 (1)
  • August 2018 (1)
  • July 2018 (3)
  • June 2018 (6)
  • May 2018 (5)
  • April 2018 (2)
  • March 2018 (1)
  • February 2018 (4)
  • January 2018 (6)
  • December 2017 (5)
  • November 2017 (5)
  • October 2017 (3)
  • September 2017 (6)
  • August 2017 (2)
  • July 2017 (6)
  • June 2017 (5)
  • May 2017 (6)
  • April 2017 (1)
  • March 2017 (2)
  • February 2017 (1)
  • January 2017 (1)
  • December 2016 (6)
  • November 2016 (6)
  • October 2016 (1)
  • September 2016 (1)
  • August 2016 (1)
  • July 2016 (1)
  • June 2016 (2)
  • March 2016 (1)
  • February 2016 (1)
  • January 2016 (1)
  • December 2015 (1)
  • November 2015 (1)
  • September 2015 (1)
  • August 2015 (1)
  • July 2015 (2)
  • June 2015 (1)
  • May 2015 (4)
  • April 2015 (2)
  • March 2015 (1)
  • February 2015 (5)
  • January 2015 (7)
  • December 2014 (3)
  • November 2014 (4)
  • October 2014 (1)
  • May 2014 (1)
  • March 2014 (3)
  • February 2014 (3)
  • January 2014 (1)
  • September 2013 (3)
  • October 2012 (1)
  • August 2012 (2)
  • May 2012 (1)
  • April 2012 (1)
  • February 2012 (2)
  • December 2011 (1)
  • September 2011 (2)

Tags

Agile AI Apache Pig Apache Pig Latin Apache Pig Tutorial ASP.NET AWS Big Data Big Data Big Questions Book Review Books Data Analytics Data Engineer Data Engineers Data Science Deep Learning DynamoDB Hadoop Hadoop Distributed File System Hadoop Pig HBase HDFS IoT Isilon Isilon Quick Tips Learn Hadoop Machine Learning Machine Learning Engineer Management Motivation MVC NoSQL OneFS Pig Latin Pluralsight Project Management Python Quick Tip quick tips Scrum Splunk Streaming Analytics Tensorflow Tutorial Unstructured Data

Recent Posts

  • Tips & Tricks for Studying Machine Learning Projects
  • Getting Started as Big Data Product Marketing Manager
  • What is a Chief Data Officer?
  • What is an Industrial IoT Engineer with Derek Morgan
  • Ultimate List of Tensorflow Resources for Machine Learning Engineers

Copyright © 2025 · eleven40 Pro Theme on Genesis Framework · WordPress · Log in