본문 바로가기

분류 전체보기377

OpenMetadata OpenMetadata: Data Discovery, Profiling, Collaboration, Lineage. (open-metadata.org) OpenMetadata: Data Discovery, Profiling, Collaboration, Lineage. An end-to-end metadata management solution that includes data catalog, data discovery, governance, data quality, observability, and people collaboration. open-metadata.org All Data in One Place A central store to integrate metadata from different s.. 2022. 10. 20.
Project Nessie Project Nessie: Transactional Catalog for Data Lakes with Git-like semantics Transactional Catalog for Data Lakes Git-inspired data version control Cross-table transactions and visibility Open data lake approach, supporting Hive, Spark, Dremio, AWS Athena, etc. Works with Apache Iceberg and Delta Lake tables Run as a docker image, AWS Lambda or fork it on GitHub Get in touch via our Google Group.. 2022. 10. 20.
Trino DB Trino is a distributed SQL query engine designed to query large data sets distributed over one or more heterogeneous data sources. Trino | Distributed SQL query engine for big data Distributed SQL query engine for big data Trino is a high performance, distributed SQL query engine for big data. trino.io Overview# To understand Trino, you must first understand the terms and concepts used throughou.. 2022. 10. 20.
Presto https://prestodb.io/ Presto | Distributed SQL Query Engine for Big Data Distributed SQL Query Engine for Big Data prestodb.io Presto: Fast and reliable SQL query engine for data analytics and the open lakehouse For data engineers who struggle with managing multiple query languages and interfaces to siloed databases and storage, Presto is the fast and reliable engine that provides one simple ANSI.. 2022. 10. 19.