Hadoop: The Definitive Guide by Tom White - Comprehensive Big Data Solutions for Enterprise Analytics, Cloud Computing & Distributed Storage | Perfect for IT Professionals, Data Engineers & Developers
Hadoop: The Definitive Guide by Tom White - Comprehensive Big Data Solutions for Enterprise Analytics, Cloud Computing & Distributed Storage | Perfect for IT Professionals, Data Engineers & Developers

Hadoop: The Definitive Guide by Tom White - Comprehensive Big Data Solutions for Enterprise Analytics, Cloud Computing & Distributed Storage | Perfect for IT Professionals, Data Engineers & Developers

$48.24 $64.32 -25% OFF

Free shipping on all orders over $50

7-15 days international

12 people viewing this product right now!

30-day free returns

Secure checkout

38982140

Guranteed safe checkout
amex
paypal
discover
mastercard
visa
apple pay

Reviews

******
- Verified Buyer
This book is the single best source to begin your career in Big Data Development. However this book should not be the first entry point, which will frustrate you. This review hopes to help the juniors and newbies, who want to enter the big data world.Cloudera CCD-410 certification ranges between tough to very tough. Period.TRAINING : You are not mandated to take a training. I took a relatively inexpensive training ($300) from edureka dot in, an online training website in India. They give a good overview at 10,000 feet are very good for the price,but no where close enough to get certified. Check out their first session available for free at Youtube. They do have steps to install your own VM, simple project , HIVE,PIG etc. If time and money permits, I strongly suggest going to official cloudera training. It costs about $3000 and includes a free test voucher , so effectively about $2700. Saves you months in preparation time and distinct advantage over your peers that should pay for itself.Install VM, try few commands, PIG, hive commands, Also try Amazon elastic mapreduce which reduces lot of manual typing and allows you to focus on the coding itself.LEARNING FROM THIS BOOK: After a training, start with this book. The first Eight chapters are critical (Approximately 300 out of 550 pages). If you are smart,sharp and young , expect to read these eight chapters about three times, more is just fine. Add some time to read rest of chapters once Or twice before the test and all the external links. If you are a busy professional, give a six month window to take the test. Knowing Java is a definitive plus. Buy the Cloudera mock examination after getting comfortable and familiar with Mapreduce($125). It is a nice resource. Explains every answer, links to where you can get more information . Just as an FYI, the real test was far more complex and difficult.SCENARIOS BASED ON A MAPREDUCE CODE:You will need to go through the example code, understand what each line does, why it is there, what happens if you comment out a line of the code. As an example, job.setMapOutputKeyClass(Text.class); job.setMapOutputValueClass(Text.class); return job.waitForCompletion(false) ? 0 : -1;> What does waitForCompletion mean?,> Is Reduce Job Must Or Optional ?> How Many Files will running a Map job produce?> Will the code compile or will it error at run time based on datatypes.?> What will happen if you run the same job twice ?> What happens to the map data after the job?> How does Hadoop handle huge files that cross block boundaries ?> What happens if you do not explicitly set a mapper or reducer ?> Will a combiner help , based on a scenario ?> Which daemon decides the number of Map job to run ?> How does hadoop handle the blocks when a node crashes?SCENARIOS BASED ON HIVEQL:This is an extension of previous scenarios. A small table, a simple SQL query ( example : select stationid,max(temp) from tableX. Answer choice are four set of mapreduce code and you have to chose the right one. Expect to read and understand the mapreduce that emulates how you create a distinct, how you do a sum, average, max, min etc. According to Cloudera website, these are the percentage of questions.CHAPTER 3 : 17 PercentCHAPTER 4 : 6 PercentCHAPTER 5 : 7 PercentCHAPTER 6 : 18 PercentCHAPTER 7 : 6 PercentCHAPTER 8 : 7 PercentPIG /HIVE/SQOOP/Zookeeper : 8 percent combined (no Hbase)Chapter no 2 has no reference but is very important. Expect several questions from that chapter since it gives a good overview. Remaining is all the links that cloudera suggests to read and get familier. SQOOP import syntax, creating a hive table via sqoop , creating and populating hive table via sqoop are must knows.WHY GETTING CERTIFIED:I have heard the tiring argument that certification is purely academic. Tell that to your doctor or your Dentist. Sound fundamentals are the foundations behind real world experience. Big Data is no different. Understanding the basics will give the confidence; experience will follow while you keep your client happy.WHY BIG DATA :My interest on Big Data was spooked by the Harvard Business Review Article claiming that "Data Scientist" was the hottest job of the 21st century. Follow that by googling for "Rayid Ghani", claimed as the data scientist behind Obama's second term victory.hbr dot org forwardslash 2012 forwardslash 10 forwardslash data-scientist-the-sexiest-job-of-the-21st-century forwardslash ar forwardslash1OTHER CHOICES :> Coursera provides a free course "Introduction To Data Science". I signed up for their first batch but could not finish with office commitments.> Youtube for "Stanford University Hadoop" by Amr AwadallahI was impressed with these books; You also might like them.> Big Data: A Revolution That Will Transform How We Live, Work and Think> Big Data at Work: Dispelling the Myths, Uncovering the Opportunities> Data Science for Business: What you need to know about data mining and data-analytic thinkingSUMMARY:Some day Big Data will become a commodity skillset,but not now. I did a search in glassdoor to see the demand for Hadoop vs some other hot ones. Hadoop is head and shoulders above the rest.Hadoop - 30,011 postings on Apr 2014Oracle DBA - 9227 postings ( A Perpetual hot skillset)Salesforce - 9968 postingsPlease post any questions in the comment section and I will certainly try to answer them.
We use cookies to enhance your browsing experience, serve personalized ads or content, and analyze our traffic. By clicking "Allow cookies", you consent to our use of cookies. More Information see our Privacy Policy.
Top