Greg Wood

  • Elasticsearch as a Hive Datastore: Is it a Stretch?

    Elasticsearch as a Hive Datastore: Is it a Stretch?

    Elasticsearch is a fully capable data store with many of the resiliency features of HDFS underlying its robust search functionality, but it should only store some of your data.

    Learn More
  • Hive Basics - Elasticsearch Integration

    Hive Basics - Elasticsearch Integration

    The concerns and benefits of using the Elasticsearch-Hadoop connector for extending the existing external table structures of Hive. Greg Wood explains how to set up your own ES-Hadoop connector.

    Learn More
  • How to Ingest XML into Hive for Easy SQL Queries

    How to Ingest XML into Hive for Easy SQL Queries

    How can you use standard Hadoop components to ingest common business-facing data sources as quickly as easily as possible? Start by ingesting XML into Hive for easy SQL queries.

    Learn More
  • 5 Guidelines for Building a Successful Data Catalog

    5 Guidelines for Building a Successful Data Catalog

    Navigate the muddy waters of building an actionable data catalog, especially in the cloud, and increase your chances of success by following these guidelines.

    Learn More
  • Top Streaming Technologies for Data Lakes and Real-Time Data

    Top Streaming Technologies for Data Lakes and Real-Time Data

    More than ever, streaming technologies are at the forefront of the Hadoop ecosystem. This post is meant to provide a basic overview of the various ways Hadoop technologies fit into the data lake.

    Learn More
  • loading
    Loading More...