Adam Diaz

Big data & Hadoop thought-leader

  • Up Your Game: How to Rock Data Quality Checks in the Data Lake

    Up Your Game: How to Rock Data Quality Checks in the Data Lake

    Common sense tells us one can’t use data unless its quality is understood. Data quality checks are critical for the data lake – but it’s not unusual for companies to initially gloss over this...

    Learn More
  • Integrating Big Data Platforms with Zaloni: REST API

    Integrating Big Data Platforms with Zaloni: REST API

    In conversations about the Zaloni Data Lake Management Platform, the most common question we hear is “Can it integrate with product X?” The answer is yes!

    Learn More
  • Managing Memory is Easier Using YARN

    Managing Memory is Easier Using YARN

    There is a long list of items that can be tuned in Hadoop, but understanding how each daemon uses memory in Hadoop is fundamental to effective tuning. Daemons launch JVMs (Java Virtual Machines)...

    Learn More
  • Big Data Maturity Stages: Is Your Data Ready to Be a Product?

    Big Data Maturity Stages: Is Your Data Ready to Be a Product?

    The idea of turning your business data into a product, also termed “data as a product,” is a known concept that I didn’t invent. It has been documented well by many groups with various well formed...

    Learn More
  • Tez and LLAP Improvements to Make Hive Faster

    Tez and LLAP Improvements to Make Hive Faster

    Before the days of Spark, there was a huge Cloudera vs Hortonworks fight over what was to be the SQL/RDBMS based solution on Hadoop. Hortonworks having a choke hold on the Hive project espoused...

    Learn More
  • The Best Ways to Get Started with HCatalog

    The Best Ways to Get Started with HCatalog

    HCatalog, also called HCat, is an interesting Apache project. It has the unique distinction of being one of the few Apache projects that were once a part of another project, became its own...

    Learn More
  • loading
    Loading More...