* [lakeFS v1.81.0](https://github.com/treeverse/lakeFS) – Tool that transforms object storage into a Git-like repository for managing data lakes. * [DataOps Data Quality TestGen 5.48.0](https://github.com/DataKitchen/dataops-testgen) – Tool for generating and executing data quality tests through profiling, hygiene reviews, and ongoing anomaly monitoring. * [ktx v0.8.0](https://github.com/Kaelio/ktx-ai-data-agents-context) – Self-improving context layer that maps warehouses, ingests company knowledge, builds semantic metrics, and serves agents for accurate data queries. * [ktx v0.8.0](https://github.com/Kaelio/ktx) – Self-improving context layer that builds semantic models, maps warehouse schema, and serves agents with approved metric-driven SQL. * [altimate-code v0.8.0](https://github.com/AltimateAI/altimate-code) – Deterministic data engineering harness offering SQL analysis, column-level lineage, dbt integration, FinOps, and multi-cloud warehouse connectivity. * [ktx v0.7.0](https://github.com/Kaelio/ktx-ai-data-agents-mcp-context-skills) – Self-improving context layer that builds semantic warehouse knowledge, maps datasets, and enables agents to query metrics accurately. * [Duckle v0.1.0](https://github.com/SouravRoy-ETL/duckle) – Local-first ETL/ELT studio with a drag-and-drop visual pipeline designer, on-device AI assistant, and DuckDB execution. * [Dataform Core 3.0.59](https://github.com/dataform-co/dataform) – Meta-language extending SQL for creating tables and workflows with dependency management and data quality testing in BigQuery. * [Qwery v0.2.3](https://github.com/Guepard-Corp/qwery-core) – Terminal-based AI data analyst converting natural-language queries into SQL, executing them locally, and returning answers. * [DataKitchen Data Observability Installer latest](https://github.com/DataKitchen/data-observability-installer) – Installer and quickstart setup for DataKitchen's data quality and observability product suite. * [Conduit v0.15.0-nightly.2026...](https://github.com/ConduitIO/conduit) – Data streaming tool for building and running real-time data pipelines with connectors and processors.