If you’re a data scientist ready to tackle statistical and machine learning techniques across large data sets, this practical guide provides a solid introduction to the world of clustered computing and analytics with Hadoop.
Instead of deployment, operations, or software development, this book focuses on particular analyses you can build, the data warehousing techniques that Hadoop provides, and the higher order data workflows it can produce.
In the first two chapters of Data Analytics with Hadoop, you will learn how data is transforming business and society and Hadoop as an operating system for big data.
Get the first two chapters free:
- Chapter 1: The Age of the Data Product
- Chapter 2: An Operating System for Big Data