Thomas Henson

  • Data Engineering Courses
    • Installing and Configuring Splunk
    • Implementing Neural Networks with TFLearn
    • Hortonworks Getting Started
    • Analyzing Machine Data with Splunk
    • Pig Latin Getting Started Course
    • HDFS Getting Started Course
    • Enterprise Skills in Hortonworks Data Platform
  • Pig Eval Series
  • About
  • Big Data Big Questions

Isilon Quick Tips: Setting Up Access Zones in OneFS

September 26, 2017 by Thomas Henson Leave a Comment

Is there anyway to partition or containerize workloads in Isilon? 100% yes! Isilon's OneFS offers Access Zones to divide different workflows/users/AD servers/ GroupNets/etc. in the same Isilon Cluster. Learn to setup Access Zones in Isilon's … [Continue reading]

Filed Under: Isilon Tagged With: Isilon, Isilon Quick Tips

How to Execute HBase Script from the Command Line

September 25, 2017 by Thomas Henson Leave a Comment

In the past few post we've been working with creating tables and interacting with data in HBase. What happens when we have bulk data to upload? Do we want to enter each row from the HBase Shell? Wow that doesn't sound fun! Let's speed up … [Continue reading]

Filed Under: HBase Tagged With: HBase

Big Data Big Questions: Learning to Become a Data Engineer?

September 22, 2017 by Thomas Henson 2 Comments

Data Scientist for the past few years has been named the sexiest job in IT. However the Data Engineer is a huge part of the Big Data movement. The Data Engineer is one the top paying jobs in IT. On average the Data Engineer can make anywhere from 90K … [Continue reading]

Filed Under: Career Tagged With: Big Data, Big Data Big Questions, Data Engineer

Using HBase Scan From the HBase Shell

September 18, 2017 by Thomas Henson 1 Comment

Continued Post in the HBase series. In this post we will continue from the example created in the Creating a Table in HBase. Now that we have our Asteroid Warning System table created in HBase let's learn how to use the HBase Scan table to quickly … [Continue reading]

Filed Under: HBase Tagged With: HBase, NoSQL

HBase Error Solved – Error: Can’t get master address from ZooKeeper

September 11, 2017 by Thomas Henson 3 Comments

The inspiration for this post on the HBase Error "Error: Can't get master address from ZooKeeper" came from my work on the HBase Creating a Table blog post. Here is the quick and easy way to check for this error. Why Does This Always … [Continue reading]

Filed Under: HBase Tagged With: Errors, HBase

How to Create a Table in HBase

September 5, 2017 by Thomas Henson 4 Comments

HBase is one the hottest Non-Relational Databases on Hadoop right now! HBase is a NoSQL built to work on top the Hadoop Distrubuted File System (HDFS). HDFS is built on the concept of Schema-on-Read where a schema is applied to the data on read. … [Continue reading]

Filed Under: HBase Tagged With: HBase, NoSQL

Bound vs. Unbound Data in Real Time Analytics

August 9, 2017 by Thomas Henson Leave a Comment

Bound vs. Unbound Data

Breaking The World of Processing Streaming and Real-Time analytics are pushing the boundaries of our analytic architecture patterns. In the big data community we now break down analytics processing into batch or streaming. If you glance at the top … [Continue reading]

Filed Under: Streaming Analytics Tagged With: Big Data, Real-Time Analytics, Streaming Analytics, Unstructured Data

Big Data Big Questions: Kappa Architecture for Real-Time

August 7, 2017 by Thomas Henson Leave a Comment

Kappa Architecture for Real-Time

  Should I Use Kappa Architecture For Real-Time Analytics? Analytics architectures are challenging to design. If you follow the latest trends in Big Data, you'll see a lot different architecture patterns to chose from. Architects have a … [Continue reading]

Filed Under: Big Data Tagged With: Big Data Big Questions, Kappa, Streaming Analytics

16 Hadoop fs Commands Every Data Engineer Must Know

July 31, 2017 by Thomas Henson 3 Comments

Commands in Hadoop The Hadoop shell is the CLI for the Hadoop cluster. Most of the time Hadoop Administrators will find themselves using the Hadoop CLI just as much as the HDP, Ambari, or CDH management interface. Learning how to navigate and run … [Continue reading]

Filed Under: Hadoop Tagged With: Hadoop, HDFS, HDFS Commnads

13 Step By Step Apache Hive Data Types

July 24, 2017 by Thomas Henson 3 Comments

hive data types

Hive is one of the leading SQL engine running on Hadoop. Hive has had a long relationship with Hadoop from the start to support SQL like syntax. Even though Hive supports SQL like syntax there are some differences the in the Hive data types vs. SQL … [Continue reading]

Filed Under: Hive Tagged With: Hadoop, Hive, SQL, SQL on Hadoop

What is a Data Lake

July 23, 2017 by Thomas Henson Leave a Comment

what is a data lake

  Explaining the Data Lake The Enterprise space is notorious for throwing around jargon. Take Data Lake for example the term Data lake. Does it mean there is a real lake in my data center because that sounds like a horrible idea. Or is a Data … [Continue reading]

Filed Under: Big Data Tagged With: Big Data, Big Data Big Questions, Data Lake, Enterprise

Python Options in Hadoop

July 14, 2017 by Thomas Henson 1 Comment

New developers in the Hadoop ecosystem often struggle to get involved because they think they need to learn Java. Where do Python and non-Java developers turn to when developing in the Hadoop eco-system? What are the Python options in … [Continue reading]

Filed Under: Hadoop

  • « Previous Page
  • 1
  • …
  • 6
  • 7
  • 8
  • 9
  • 10
  • …
  • 16
  • Next Page »

Subscribe to Newsletter

Archives

  • February 2021 (2)
  • January 2021 (5)
  • May 2020 (1)
  • January 2020 (1)
  • November 2019 (1)
  • October 2019 (9)
  • July 2019 (7)
  • June 2019 (8)
  • May 2019 (4)
  • April 2019 (1)
  • February 2019 (1)
  • January 2019 (2)
  • September 2018 (1)
  • August 2018 (1)
  • July 2018 (3)
  • June 2018 (6)
  • May 2018 (5)
  • April 2018 (2)
  • March 2018 (1)
  • February 2018 (4)
  • January 2018 (6)
  • December 2017 (5)
  • November 2017 (5)
  • October 2017 (3)
  • September 2017 (6)
  • August 2017 (2)
  • July 2017 (6)
  • June 2017 (5)
  • May 2017 (6)
  • April 2017 (1)
  • March 2017 (2)
  • February 2017 (1)
  • January 2017 (1)
  • December 2016 (6)
  • November 2016 (6)
  • October 2016 (1)
  • September 2016 (1)
  • August 2016 (1)
  • July 2016 (1)
  • June 2016 (2)
  • March 2016 (1)
  • February 2016 (1)
  • January 2016 (1)
  • December 2015 (1)
  • November 2015 (1)
  • September 2015 (1)
  • August 2015 (1)
  • July 2015 (2)
  • June 2015 (1)
  • May 2015 (4)
  • April 2015 (2)
  • March 2015 (1)
  • February 2015 (5)
  • January 2015 (7)
  • December 2014 (3)
  • November 2014 (4)
  • October 2014 (1)
  • May 2014 (1)
  • March 2014 (3)
  • February 2014 (3)
  • January 2014 (1)
  • September 2013 (3)
  • October 2012 (1)
  • August 2012 (2)
  • May 2012 (1)
  • April 2012 (1)
  • February 2012 (2)
  • December 2011 (1)
  • September 2011 (2)

Tags

Agile AI Apache Pig Apache Pig Latin Apache Pig Tutorial ASP.NET AWS Big Data Big Data Big Questions Book Review Books Data Analytics Data Engineer Data Engineers Data Science Deep Learning DynamoDB Hadoop Hadoop Distributed File System Hadoop Pig HBase HDFS IoT Isilon Isilon Quick Tips Learn Hadoop Machine Learning Machine Learning Engineer Management Motivation MVC NoSQL OneFS Pig Latin Pluralsight Project Management Python Quick Tip quick tips Scrum Splunk Streaming Analytics Tensorflow Tutorial Unstructured Data

Recent Posts

  • Tips & Tricks for Studying Machine Learning Projects
  • Getting Started as Big Data Product Marketing Manager
  • What is a Chief Data Officer?
  • What is an Industrial IoT Engineer with Derek Morgan
  • Ultimate List of Tensorflow Resources for Machine Learning Engineers

Copyright © 2025 · eleven40 Pro Theme on Genesis Framework · WordPress · Log in