DuckLake
DuckLake is an open table format designed for data lakes, created by the team behind DuckDB, that aims to simplify lakehouse architecture by using a standard SQL database to manage all metadata instead of complex file hierarchies.
SQLMesh
SQLMesh is a data transformation framework that helps manage SQL-based data pipelines with version control, testing, and deployment capabilities.
ClickHouse
ClickHouse is a high-performance, columnar OLAP (Online Analytical Processing) database designed for fast analytics on large datasets. It operates on a client-server model and is optimized for low-latency queries over billions of rows.
PeerDB
PeerDB is a data synchronization and replication tool designed to move data efficiently between databases and data warehouses. It enables real-time or batch ETL/ELT workflows by capturing changes (via CDC - Change Data Capture) from source databases (e.g., PostgreSQL) and syncing them to destinations like DuckDB, Snowflake, or data lakes.
Metabase
Metabase is an open-source business intelligence (BI) and data visualization tool that allows users to ask questions about their data using a simple interface—no SQL required for basic queries.
Label Studio
Label Studio is an open-source data labeling and annotation tool designed to simplify the creation of high-quality training datasets for machine learning and artificial intelligence models.

