With our latest data lake management platform updates, we’re pushing the boundaries of what has up until now typically defined a data lake: Hadoop. Why are we moving in this direction? Because it makes sense for our clients, who need a solution to centralize management of data from siloed data systems, legacy databases and hybrid architectures. Our solutions support the concept of a data lake beyond Hadoop to encompass a more holistic, enterprise-wide approach. By constructing a “logical” data lake architecture versus a physical one, we can give companies transparency into all of their data regardless of its location, enable application of enterprise-wide governance capabilities, and allow for expanded, controlled access for self-serve business users across the organization.
Centralized command center
Governance, siloed systems, security, privacy and enabling broader yet controlled access to data continue to be significant challenges for most enterprises when it comes to data. With this new release, our platform continues to enable enterprises to design pipelines for complex data formats from ingestion to analytics, and simplify and automate data preparation processes so that analysts can spend more time finding insights. This new version builds on these capabilities and provides more advanced and customizable dashboards, making it the “command center” for governing data within the data lake. The update to our self-service data capability, provides a better, more intuitive user experience, enabling users to identify the data they want, faster.
Data Lake Management Platform: More visibility and governance
There are two key updates for our platform that are most exciting. First, we provide more ways to manage and leverage all the data in an organization. It is now easier to ingest data, work with data and manage metadata for nearly any data source and type, such as complex data formats and natively ingested formats, including Avro, JSON and VSAM. More support for relational database management system (RDBMS) connections allows engineers to easily sync thousands of tables at once while still providing fine-grained control and governance for processing change data, thresholds for parallelism, etc. And, it provides increased flexibility for the types of data source and amount of data that can be synced with the self-service metadata catalog.
Second, we’ve designed a new Metadata Exchange Framework that enables the platform to sync metadata with third-party metadata and governance tools and security frameworks to help organizations ensure their data lake complies with corporate standards defined at the enterprise level.
Self-service capability: Metadata display your way
The new platform release gives users more customization options so they can get to the data they want, faster. Features include a more customizable UI that enables more granular and flexible discovery, creation and prioritization of custom metadata attributes, more information about data lineage, a more targeted display of data catalog results, and integration with data visualization tools.
For more details about new features, read the press release.
About the AuthorMore Content by Kelly Hopkins Schupp