These prebuilt wheel files can be used to install our Python packages as of a specific commit.
Built at 2026-06-30T14:23:09.173111+00:00.
{
"timestamp": "2026-06-30T14:23:09.173111+00:00",
"branch": "fix/oversized-query-properties-truncation",
"commit": {
"hash": "a75210a84b5e8e2676c1e4e5f8c50bd358523ad0",
"message": "fix(ingestion): prevent oversized query metadata from failing runs\n\nTwo compounding bugs caused Snowflake (and other SQL connectors) ingestion\nruns to fail with repeated GMS \"400 Cannot parse request entity\" errors on\nqueryProperties aspects:\n\n1. The SQL parsing aggregator merges temp-table query chains into one\n synthetic \"composite\" query and concatenated every constituent statement\n with no size cap, letting the merged text balloon to hundreds of MB.\n\n2. EnsureAspectSizeProcessor.ensure_query_properties_size, the guard that\n should truncate oversized statements, compared a raw character count\n against a JSON-byte reduction target. JSON escaping (\\n, \\\", control\n chars) inflates the serialized size past the raw length, so the guard\n gave up and emitted the oversized aspect anyway.\n\nFixes:\n- Cap composite query statement text (default 5MB, overridable via\n DATAHUB_MAX_COMPOSITE_QUERY_STATEMENT_CHARS) and record truncations in the\n aggregator report.\n- Rewrite the size guard to measure serialized size correctly via binary\n search over the statement prefix, fall back to dropping name/description\n when the non-statement overhead alone is over budget, and always emit a\n structured warning + record the truncation.\n\nAdds unit tests for both fixes.\n\nCo-authored-by: Cursor "
},
"base": {
"hash": "96dd0886d3971c05b3265a1fce958c7541ac226b",
"message": "fix(smoke-test): restore CDC consumer auth env and enforce data product rename auth (#18082)\n\nCo-authored-by: Cursor "
},
"pr": {
"number": 18102,
"title": "fix(ingestion): prevent oversized query metadata from failing runs",
"url": "https://github.com/datahub-project/datahub/pull/18102"
}
}
Current base URL: unknown
| Package | Size | Install command |
|---|---|---|
acryl-datahub |
4.295 MB | uv pip install 'acryl-datahub @ <base-url>/artifacts/wheels/acryl_datahub-0.0.0.dev1-py3-none-any.whl' |
acryl-datahub-actions |
0.116 MB | uv pip install 'acryl-datahub-actions @ <base-url>/artifacts/wheels/acryl_datahub_actions-0.0.0.dev1-py3-none-any.whl' |
acryl-datahub-airflow-plugin |
0.072 MB | uv pip install 'acryl-datahub-airflow-plugin @ <base-url>/artifacts/wheels/acryl_datahub_airflow_plugin-0.0.0.dev1-py3-none-any.whl' |
acryl-datahub-dagster-plugin |
0.021 MB | uv pip install 'acryl-datahub-dagster-plugin @ <base-url>/artifacts/wheels/acryl_datahub_dagster_plugin-0.0.0.dev1-py3-none-any.whl' |
acryl-datahub-gx-plugin |
0.011 MB | uv pip install 'acryl-datahub-gx-plugin @ <base-url>/artifacts/wheels/acryl_datahub_gx_plugin-0.0.0.dev1-py3-none-any.whl' |
prefect-datahub |
0.011 MB | uv pip install 'prefect-datahub @ <base-url>/artifacts/wheels/prefect_datahub-0.0.0.dev1-py3-none-any.whl' |