Hadoop Application Architectures Chapter Preview

February 23, 2017

Get expert guidance on architecting end-to-end data management solutions with Apache Hadoop. While many sources explain how to use various components in the Hadoop ecosystem, this practical book takes you through architectural considerations necessary to tie those components together into a complete tailored application, based on your particular use case.

This book covers:

  • Factors to consider when using Hadoop to store and model data
  • Best practices for moving data in and out of the system
  • Data processing frameworks, including MapReduce, Spark, and Hive
  • Common Hadoop processing patterns
  • Giraph, GraphX, and other tools for large graph processing on Hadoop
  • Using workflow orchestration and scheduling tools such as Apache Oozie
  • Near-real-time stream processing
  • Architecture examples for clickstream analysis, fraud detection, and data warehousing
Previous Flipbook
Creating a Data-Driven Organization Chapter Preview
Creating a Data-Driven Organization Chapter Preview

Practical Advice From the Trenches

No More Flipbooks

Looking to accelerate your data?

Contact Us