Thomas Henson - Making Hadoop Simple

Execute Pig Script from Command Line

Ready to run a Pig script with the Grunt Shell or Pig Editor?   The time has come to take the training wheels off and run a Pig script without using the Grunt Shell. At least that is how I felt when I ran my…

Continue reading →

Apache Pig Eval Functions Series

Ready to master the Apache Pig but not sure how to get started? How can I master Apache Pig? The process for mastering a programming language is that same as learning any other skills. Practice, Practice, Practice. The practice needs to be focused and using…

Continue reading →

Pig Eval Series: Tokenize

In this Pig Eval tutorial we are going to use the tokenize function. If you not familiar with the tokenize function you’re probably thinking we are going to do something crazy like turn a field into a game token that can be used a Chucky Cheese…

Continue reading →

Apache Pig Latin Tutorial

Hadoop development is  one of the top skills most desired in software development. One of the reasons is because Hadoop is early in the product life cycle. It’s like getting involved with Relational Databases back in the early 80’s. Huge opportunity!! Do I need to…

Continue reading →

Learn to Process Data with Apache Pig

Apache Pig is one of the hottest languages in the Hadoop ecosystem. Right now the average salary for a Pig Developer is $124,563 according to a report released in Infoworld. A Pig developer can process both unstructured and semi-structured in Hadoop. If you like the sound of learning…

Continue reading →

Pig Latin Concatenation Function

Today we are going to talk about how to concatenate fields using Pig Latin. For this week’s example we are going to use a different data set than we have used in the Apache Pig Latin Eval Function series. Our new data set is a…

Continue reading →

Pig Latin Average Function

Is there a function built in Pig Latin that will average a particular field? The answer is yes, Pig Latin has an Eval function that averages fields. How does it compare to SQL? If you are familiar with SQL then you know about the AVG()…

Continue reading →

Pig Latin Getting Started Course

Guess whose first Pluralsight course was released today? This guy’s: It’s been an incredible journey shooting my first Pluralsight course. I’ve certainly learned a great deal throughout the process. For the last month I’ve been recording away and trying to get my first course ready (“Pig Latin: Getting…

Continue reading →

Pig Latin Data Types

How can you define data types in your Pig Latin scripts? What data types are allowed in Pig Latin? Is everything just a string? The good news is that everything is not a string, and if you have any Java experience the data types will…

Continue reading →

How to Sum a Field with Apache Pig

What is the quickest way to get a sum of a particular set of fields in Pig Latin? Answer: by using the sum operator. Let’s suppose we are working on at a table of population data that looks like the following: The column’s headers: Year…

Continue reading →

Page 1 of 6 1 2 3 4 5 ... Last →