Today there are so many applications and frameworks in the Hadoop ecosystem, most of which are written in Java. So does this mean anyone wanting to become a Hadoop developer or Big Data Developer must learn Java? Should you go through hours and weeks of training to learn Java to become an awesome Hadoop Ninja or Big Data Developer? Will not knowing Java hinder your Big Data career? Watch this video and find out.
Transcript Of The Video
(forgive any errors it was transcribe by a machine)
Hi I’m Thomas Henson with ThomasHenson.com and today we’re starting a new series called big data big questions. This is a series where I’m going to answer questions all from the community all about big data. So feel free to submit your questions and at the end of this episode I’ll show you how.So today the first question I have is a very common question a lot of people ask do you need to know Java in order to be a big data developer find out the answer right after this.
Do you need to know Java in order to be a big data developer the simple answer is no. Maybe that was the case in early Hadoop 1.0 but even then there were a lot of tools that were being created like Pig, Hive and HBase that are all using different syntax so that you can extrapolate and kind of abstract away Java. Because the key is I mean if you’re a data analyst or a Hadoop administrator most of those people aren’t going to have Java skills. And so for the community to really move forward with this big data and to do we needed to be able to say that it was a tool that not only Java developers were going to be able to use. And so that’s where pig and hive and a lot of those other tools came in.
Now as we start to look into the new 2.0 and a new 3 Hadoop it’s really not the case. Now Java is not going to hinder you right so it’s going to be beneficial if you do know it. But I don’t think it’s something that you would want to run out and have to learn just to be able to become a big data developer.
Then the question is – when you say Big Data developer what are we really talking about? So are we talking about somebody that’s you know writing MapReduce jobs or writing Spark jobs that’s you know where we look at as a big data developer. Or are we talking about maybe a data scientist. Where a data scientist is probably using more like R and Python and some of those skills to pull their insights back. And then of course your Hadoop administrators they don’t need to know Java it’s beneficial if they know Linux and some of the other pieces but Java’s not really necessary no we’ll stay in a lot of the technology.
So if you look at getting out of the Hadoop world but start looking at spark spark has Java so you can write you know you can write your spark jobs in Java but you can also do it in Python and Scala. It’s not a requirement for people to have Java and I would say that there’s a lot of developers out there that are big data developers that don’t have any Java skills. And that’s quite ok so don’t let that hinder you jump in join an open-source community project do something to expand your big data knowledge and become a big data developer.
Well that’s all we have today make sure to submit your question so I’ve got a space on my blog where you can submit the questions or just submit them here in the comment section and I’ll answer your big data big questions see you again.