data-diff
Codemonkey
2022-06-23
11753 次浏览 · 117 次点赞
Data Diff is an open-source package that can be run in a CLI or wrapped into any data orchestrator such as Airflow, Dagster, etc. Compare datasets quickly (seconds/minutes) at a large (millions/billions of rows) scale across different databases.