Thomas Henson

  • Data Engineering Courses
    • Installing and Configuring Splunk
    • Implementing Neural Networks with TFLearn
    • Hortonworks Getting Started
    • Analyzing Machine Data with Splunk
    • Pig Latin Getting Started Course
    • HDFS Getting Started Course
    • Enterprise Skills in Hortonworks Data Platform
  • Pig Eval Series
  • About
  • Big Data Big Questions

Archives for February 2016

HDFS Getting Started Course

February 22, 2016 by Thomas Henson 4 Comments

Are you ready to get some Hadoop knowledge dropped on you?

Well here it is after eight long months since my last Pluralsight course.

HDFS Getting Started has been launched. I couldn’t be more excited to have this course released.

HDFS Getting Started

HDFS Getting Started is baseline course for anyone working with Hadoop. Starting development with Hadoop is easy when testing in your local sandbox but what happens when it’s time to go from testing to production?

Hadoop management and orchestration is hard. Most task are accomplished from the command line. Even something as simple as moving data from your local machine into HDFS can seem complicated.

What’s HDFS Getting Started about?

My new Pluralsight course, HDFS Getting Started, walks through real life examples of moving data around in the Hadoop Distributed File System (HDFS). Learning to use the hdfs dfs commands will ensure you have the baseline Hadoop skills needed to excel in the Hadoop ecosystem.

Structured data is all around us in the form of relational databases. In this course we will ingest data from MySQL database into HDFS using Sqoop. Walk through a quick tutorial of writing a Sqoop script to move structured stock market data in MySQL into HDFS.

Pig and Hive are great ways to structure data in HDFS for analysis, but moving that data around in HDFS can get tricky. In this course we walk through using both applications to analyze stock market data. All from the Hive and Pig command lines.

Hbase is another hot application in the Hadoop ecosystem. Do you know how to move data from HDFS into HBase? In HDFS Getting Started learn to take our stock market data index it and move it into HBase by writting a Pig script.

How is the Course Broken down?

HDFS Getting started is broken down into six modules. The modules cover different applications and how they use HDFS to query/ingest/manipulate/move data in Hadoop.

HDFS Getting Started Modules

  1. Understanding HDFS
  2. Creating, Manipulating and Retrieving HDFS Files
  3. Transferring Relational Data to HDFS using Sqoop
  4. Querying Data with Hive and Pig
  5. Processing Sparse Data with HBase
  6. Automating Basic HDFS Operations

Let me know if you have any questions about the course or a suggestion for a new course.

Filed Under: Hadoop Tagged With: Big Data, Hadoop, HDFS, Pluralsight

Subscribe to Newsletter

Archives

  • February 2021 (2)
  • January 2021 (5)
  • May 2020 (1)
  • January 2020 (1)
  • November 2019 (1)
  • October 2019 (9)
  • July 2019 (7)
  • June 2019 (8)
  • May 2019 (4)
  • April 2019 (1)
  • February 2019 (1)
  • January 2019 (2)
  • September 2018 (1)
  • August 2018 (1)
  • July 2018 (3)
  • June 2018 (6)
  • May 2018 (5)
  • April 2018 (2)
  • March 2018 (1)
  • February 2018 (4)
  • January 2018 (6)
  • December 2017 (5)
  • November 2017 (5)
  • October 2017 (3)
  • September 2017 (6)
  • August 2017 (2)
  • July 2017 (6)
  • June 2017 (5)
  • May 2017 (6)
  • April 2017 (1)
  • March 2017 (2)
  • February 2017 (1)
  • January 2017 (1)
  • December 2016 (6)
  • November 2016 (6)
  • October 2016 (1)
  • September 2016 (1)
  • August 2016 (1)
  • July 2016 (1)
  • June 2016 (2)
  • March 2016 (1)
  • February 2016 (1)
  • January 2016 (1)
  • December 2015 (1)
  • November 2015 (1)
  • September 2015 (1)
  • August 2015 (1)
  • July 2015 (2)
  • June 2015 (1)
  • May 2015 (4)
  • April 2015 (2)
  • March 2015 (1)
  • February 2015 (5)
  • January 2015 (7)
  • December 2014 (3)
  • November 2014 (4)
  • October 2014 (1)
  • May 2014 (1)
  • March 2014 (3)
  • February 2014 (3)
  • January 2014 (1)
  • September 2013 (3)
  • October 2012 (1)
  • August 2012 (2)
  • May 2012 (1)
  • April 2012 (1)
  • February 2012 (2)
  • December 2011 (1)
  • September 2011 (2)

Tags

Agile AI Apache Pig Apache Pig Latin Apache Pig Tutorial ASP.NET AWS Big Data Big Data Big Questions Book Review Books Data Analytics Data Engineer Data Engineers Data Science Deep Learning DynamoDB Hadoop Hadoop Distributed File System Hadoop Pig HBase HDFS IoT Isilon Isilon Quick Tips Learn Hadoop Machine Learning Machine Learning Engineer Management Motivation MVC NoSQL OneFS Pig Latin Pluralsight Project Management Python Quick Tip quick tips Scrum Splunk Streaming Analytics Tensorflow Tutorial Unstructured Data

Follow me on Twitter

My Tweets

Recent Posts

  • Tips & Tricks for Studying Machine Learning Projects
  • Getting Started as Big Data Product Marketing Manager
  • What is a Chief Data Officer?
  • What is an Industrial IoT Engineer with Derek Morgan
  • Ultimate List of Tensorflow Resources for Machine Learning Engineers

Copyright © 2023 · eleven40 Pro Theme on Genesis Framework · WordPress · Log in

 

Loading Comments...