Thomas Henson

  • Data Engineering Courses
    • Installing and Configuring Splunk
    • Implementing Neural Networks with TFLearn
    • Hortonworks Getting Started
    • Analyzing Machine Data with Splunk
    • Pig Latin Getting Started Course
    • HDFS Getting Started Course
    • Enterprise Skills in Hortonworks Data Platform
  • Pig Eval Series
  • About
  • Big Data Big Questions

Pig String Functions

March 21, 2016 by Thomas Henson Leave a Comment

Pig String Function Series

Stuck trying to manipulate a string in Hadoop and don't want to use Java? No Problem use Pig's built in String Functions. Why Pig for ETL? Using Apache Pig in Hadoop is a must for ETL transactions. Pig allows for developer to quickly write a … [Continue reading]

Filed Under: Hadoop Pig Tagged With: Apache Pig, Apache Pig Latin, Hadoop Pig, Learn Hadoop, Pig String Series

HDFS Getting Started Course

February 22, 2016 by Thomas Henson 4 Comments

Are you ready to get some Hadoop knowledge dropped on you? Well here it is after eight long months since my last Pluralsight course. HDFS Getting Started has been launched. I couldn't be more excited to have this course released. HDFS … [Continue reading]

Filed Under: Hadoop Tagged With: Big Data, Hadoop, HDFS, Pluralsight

Top 10 Favorite Post of 2015

January 4, 2016 by Thomas Henson Leave a Comment

So long 2015 2015 is done. I love the New Year because it's always a good time to look back at what you have accomplished. 2015 presented me with new challenges and opportunities. As I was planning out my goals for next year, I wanted to look back … [Continue reading]

Filed Under: Article Tagged With: Agile, Apache Pig, Big Data, Pig Latin, Top 10

How Big Data Impacts Holiday Shopping

December 21, 2015 by Thomas Henson Leave a Comment

Christmas is a magical time of year. I still remember the Christmas when I was 7 years old. After all the gifts had been opened my parents made me take the trash out to the road. It had been a great Christmas I was very happy with all my gifts and so … [Continue reading]

Filed Under: Big Data Tagged With: Big Data

Comparing Data with Pig Latin MAX() Function

November 23, 2015 by Thomas Henson Leave a Comment

Last time we tackled how to use the Min() function in Pig and so this week we are going to learn to use the opposite function the MAX(). It's just like the MIN() function but instead of finding the lowest value in an array/column, it finds the … [Continue reading]

Filed Under: Hadoop Pig Tagged With: Apache Pig, Apache Pig Latin, Apache Pig Tutorial, Big Data

Pig Latin Eval Function MIN

September 1, 2015 by Thomas Henson Leave a Comment

If you were working in Excel could you easily find the minimum value in a column or row of data? Of course you could. Excel has a function built in to find the minimum value and many other functions. Well so does Apache Pig, you just have to learn … [Continue reading]

Filed Under: Hadoop Pig Tagged With: Apace Pig, Apache Pig Latin, Apache Pig Tutorial

Execute Pig Script from Command Line

August 3, 2015 by Thomas Henson Leave a Comment

Ready to run a Pig script with the Grunt Shell or Pig Editor?   The time has come to take the training wheels off and run a Pig script without using the Grunt Shell. At least that is how I felt when I ran my first Pig script from the … [Continue reading]

Filed Under: Hadoop Pig Tagged With: Apache Pig, Apache Pig Tutorial, Hadoop Pig

Apache Pig Eval Functions Series

July 27, 2015 by Thomas Henson 5 Comments

pig eval series

Ready to master the Apache Pig but not sure how to get started? How can I master Apache Pig? The process for mastering a programming language is that same as learning any other skills. Practice, Practice, Practice. The practice needs to be focused … [Continue reading]

Filed Under: Hadoop Pig Tagged With: Apache Pig, Apache Pig Latin, Hadoop, Pig Eval Series

Pig Eval Series: Tokenize

July 2, 2015 by Thomas Henson 1 Comment

In this Pig Eval tutorial we are going to use the Apache Pig Tokenize function. If you not familiar with the tokenize function you're probably thinking we are going to do something crazy like turn a field into a game token that can be used a Chucky … [Continue reading]

Filed Under: Hadoop Pig Tagged With: Apace Pig, Hadoop, Hadoop Pig, Pig Latin, TOKENIZE

Apache Pig Latin Tutorial

June 15, 2015 by Thomas Henson Leave a Comment

Hadoop development is  one of the top skills most desired in software development. One of the reasons is because Hadoop is early in the product life cycle. It's like getting involved with Relational Databases back in the early 80's. Huge … [Continue reading]

Filed Under: Hadoop Pig Tagged With: Apache Pig Latin, Apache Pig Tutorial, Hadoop Pig, Hadopo, Video

Learn to Process Data with Apache Pig

May 26, 2015 by Thomas Henson 2 Comments

learn to process data with Apache Pig

Apache Pig is one of the hottest languages in the Hadoop ecosystem. Right now the average salary for a Pig Developer is $124,563 according to a report released in Infoworld. A Pig developer can process both unstructured and semi-structured in … [Continue reading]

Filed Under: Hadoop Pig Tagged With: Apache Pig, Big Data, Hadoop, Hadoop Pig

Pig Latin Concatenation Function

May 18, 2015 by Thomas Henson 1 Comment

Pig Latin Concati

Today we are going to talk about how to concatenate fields using Pig Latin. For this week's example we are going to use a different data set than we have used in the Apache Pig Latin Eval Function series. Our new data set is a sample data set … [Continue reading]

Filed Under: Hadoop Pig Tagged With: Apache Pig, Big Data, Hadoop, Hadoop Pig

  • « Previous Page
  • 1
  • …
  • 10
  • 11
  • 12
  • 13
  • 14
  • …
  • 16
  • Next Page »

Subscribe to Newsletter

Archives

  • February 2021 (2)
  • January 2021 (5)
  • May 2020 (1)
  • January 2020 (1)
  • November 2019 (1)
  • October 2019 (9)
  • July 2019 (7)
  • June 2019 (8)
  • May 2019 (4)
  • April 2019 (1)
  • February 2019 (1)
  • January 2019 (2)
  • September 2018 (1)
  • August 2018 (1)
  • July 2018 (3)
  • June 2018 (6)
  • May 2018 (5)
  • April 2018 (2)
  • March 2018 (1)
  • February 2018 (4)
  • January 2018 (6)
  • December 2017 (5)
  • November 2017 (5)
  • October 2017 (3)
  • September 2017 (6)
  • August 2017 (2)
  • July 2017 (6)
  • June 2017 (5)
  • May 2017 (6)
  • April 2017 (1)
  • March 2017 (2)
  • February 2017 (1)
  • January 2017 (1)
  • December 2016 (6)
  • November 2016 (6)
  • October 2016 (1)
  • September 2016 (1)
  • August 2016 (1)
  • July 2016 (1)
  • June 2016 (2)
  • March 2016 (1)
  • February 2016 (1)
  • January 2016 (1)
  • December 2015 (1)
  • November 2015 (1)
  • September 2015 (1)
  • August 2015 (1)
  • July 2015 (2)
  • June 2015 (1)
  • May 2015 (4)
  • April 2015 (2)
  • March 2015 (1)
  • February 2015 (5)
  • January 2015 (7)
  • December 2014 (3)
  • November 2014 (4)
  • October 2014 (1)
  • May 2014 (1)
  • March 2014 (3)
  • February 2014 (3)
  • January 2014 (1)
  • September 2013 (3)
  • October 2012 (1)
  • August 2012 (2)
  • May 2012 (1)
  • April 2012 (1)
  • February 2012 (2)
  • December 2011 (1)
  • September 2011 (2)

Tags

Agile AI Apache Pig Apache Pig Latin Apache Pig Tutorial ASP.NET AWS Big Data Big Data Big Questions Book Review Books Data Analytics Data Engineer Data Engineers Data Science Deep Learning DynamoDB Hadoop Hadoop Distributed File System Hadoop Pig HBase HDFS IoT Isilon Isilon Quick Tips Learn Hadoop Machine Learning Machine Learning Engineer Management Motivation MVC NoSQL OneFS Pig Latin Pluralsight Project Management Python Quick Tip quick tips Scrum Splunk Streaming Analytics Tensorflow Tutorial Unstructured Data

Recent Posts

  • Tips & Tricks for Studying Machine Learning Projects
  • Getting Started as Big Data Product Marketing Manager
  • What is a Chief Data Officer?
  • What is an Industrial IoT Engineer with Derek Morgan
  • Ultimate List of Tensorflow Resources for Machine Learning Engineers

Copyright © 2025 · eleven40 Pro Theme on Genesis Framework · WordPress · Log in