Greg Wood

  • 4 Top Data Governance Observations from DGIQ 2017

    4 Top Data Governance Observations from DGIQ 2017

    Data governance is a huge, important, complex topic, but absolutely one worth taking the effort to consider. Here are several key points from DGIQ17 to begin this thought exercise.

    Learn More
  • Elasticsearch as a Hive Datastore: Is it a Stretch?

    Elasticsearch as a Hive Datastore: Is it a Stretch?

    Elasticsearch is a fully capable data store with many of the resiliency features of HDFS underlying its robust search functionality, but it should only store some of your data.

    Learn More
  • Hive Basics - Elasticsearch Integration

    Hive Basics - Elasticsearch Integration

    The concerns and benefits of using the Elasticsearch-Hadoop connector for extending the existing external table structures of Hive. Greg Wood explains how to set up your own ES-Hadoop connector.

    Learn More
  • Applied Data Lakes: Building a 360° View of Your Customer

    Applied Data Lakes: Building a 360° View of Your Customer

    Data lakes are the perfect solution to managing, governing and preparing data to build a 360-degree view of your customer.

    Learn More
  • How to Ingest XML into Hive for Easy SQL Queries

    How to Ingest XML into Hive for Easy SQL Queries

    How can you use standard Hadoop components to ingest common business-facing data sources as quickly as easily as possible? Start by ingesting XML into Hive for easy SQL queries.

    Learn More
  • 5 Guidelines for Building a Successful Data Catalog

    5 Guidelines for Building a Successful Data Catalog

    Navigate the muddy waters of building an actionable data catalog, especially in the cloud, and increase your chances of success by following these guidelines.

    Learn More
  • Train Your (Hadoop) Elephant with Fewer Data Lake Management and Governance Tools

    Train Your (Hadoop) Elephant with Fewer Data Lake Management and Governance Tools

    In the past year, the focus of big data has expanded from creating new streaming and computing frameworks into creating ways to manage and control these frameworks. Unfortunately, none of the...

    Learn More
  • Top Streaming Technologies for Data Lakes and Real-Time Data

    Top Streaming Technologies for Data Lakes and Real-Time Data

    More than ever, streaming technologies are at the forefront of the Hadoop ecosystem. This post is meant to provide a basic overview of the various ways Hadoop technologies fit into the data lake.

    Learn More
  • Data Fracking: Going Deep into the Data Lake Using Drill

    Data Fracking: Going Deep into the Data Lake Using Drill

    Your data lake is finally live. After months and months of planning, designing, tinkering, configuring and reconfiguring, your company is ready to see the fruits of your labor. There’s just one...

    Learn More
  • So, You Want to Be a Tech Visionary? An Executive Guide to Data Lakes

    So, You Want to Be a Tech Visionary? An Executive Guide to Data Lakes

    You’ve heard it time and time again: cloud is the future; those who don’t adopt modern big data practices will fall behind the pack; the next wave of IT disruption is right around the corner. And...

    Learn More
  • loading
    Loading More...