eBooks

Data Analytics with Hadoop Zaloni_Preview Edition.pdf?hsCtaTracking=8254b676-4354-4c49-97ac-f08e23de7c8a%7Cb8c38c3c-35d1-40c5-b4e9-ab2a4b497a60&__hstc=111218075.a71d874649e61a8f39ac37304909af70.143958

Issue link: https://resources.zaloni.com/i/790569

Contents of this Issue

Navigation

Page 5 of 50

Table of Contents Preface. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . v 1. The Age of the Data Product. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 What is a Data Product? 12 Building Data Products at Scale with Hadoop 13 Leveraging Large Datasets 14 Hadoop for Data Products 15 The Data Science Pipeline and the Hadoop Ecosystem 16 Big Data Workflows 18 Building Data Products with Hadoop 19 2. An Operating System for Big Data. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21 Basic Concepts 22 Hadoop Architecture 23 A Hadoop Cluster 25 Hadoop Distributed File System (HDFS) 28 Yet Another Resource Negotiator (YARN) 30 Working with a Distributed File System 30 Basic File System Operations 31 File Permissions in HDFS 34 Other HDFS Interfaces 34 Working with Distributed Computation 35 MapReduce: A Functional Programming Model 36 MapReduce: Implemented on a Cluster 38 Beyond a Map and Reduce: Job Chaining 45 Submitting a MapReduce Job to YARN 46 Conclusion 48 iii

Articles in this issue

view archives of eBooks - Data Analytics with Hadoop Zaloni_Preview Edition.pdf?hsCtaTracking=8254b676-4354-4c49-97ac-f08e23de7c8a%7Cb8c38c3c-35d1-40c5-b4e9-ab2a4b497a60&__hstc=111218075.a71d874649e61a8f39ac37304909af70.143958