DataHub Python Builds

These prebuilt wheel files can be used to install our Python packages as of a specific commit.

Build context

Built at 2026-06-30T14:23:09.173111+00:00.

{
  "timestamp": "2026-06-30T14:23:09.173111+00:00",
  "branch": "fix/oversized-query-properties-truncation",
  "commit": {
    "hash": "a75210a84b5e8e2676c1e4e5f8c50bd358523ad0",
    "message": "fix(ingestion): prevent oversized query metadata from failing runs\n\nTwo compounding bugs caused Snowflake (and other SQL connectors) ingestion\nruns to fail with repeated GMS \"400 Cannot parse request entity\" errors on\nqueryProperties aspects:\n\n1. The SQL parsing aggregator merges temp-table query chains into one\n   synthetic \"composite\" query and concatenated every constituent statement\n   with no size cap, letting the merged text balloon to hundreds of MB.\n\n2. EnsureAspectSizeProcessor.ensure_query_properties_size, the guard that\n   should truncate oversized statements, compared a raw character count\n   against a JSON-byte reduction target. JSON escaping (\\n, \\\", control\n   chars) inflates the serialized size past the raw length, so the guard\n   gave up and emitted the oversized aspect anyway.\n\nFixes:\n- Cap composite query statement text (default 5MB, overridable via\n  DATAHUB_MAX_COMPOSITE_QUERY_STATEMENT_CHARS) and record truncations in the\n  aggregator report.\n- Rewrite the size guard to measure serialized size correctly via binary\n  search over the statement prefix, fall back to dropping name/description\n  when the non-statement overhead alone is over budget, and always emit a\n  structured warning + record the truncation.\n\nAdds unit tests for both fixes.\n\nCo-authored-by: Cursor "
  },
  "base": {
    "hash": "96dd0886d3971c05b3265a1fce958c7541ac226b",
    "message": "fix(smoke-test): restore CDC consumer auth env and enforce data product rename auth (#18082)\n\nCo-authored-by: Cursor "
  },
  "pr": {
    "number": 18102,
    "title": "fix(ingestion): prevent oversized query metadata from failing runs",
    "url": "https://github.com/datahub-project/datahub/pull/18102"
  }
}

Usage

Current base URL: unknown

Package Size Install command
acryl-datahub 4.295 MB uv pip install 'acryl-datahub @ <base-url>/artifacts/wheels/acryl_datahub-0.0.0.dev1-py3-none-any.whl'
acryl-datahub-actions 0.116 MB uv pip install 'acryl-datahub-actions @ <base-url>/artifacts/wheels/acryl_datahub_actions-0.0.0.dev1-py3-none-any.whl'
acryl-datahub-airflow-plugin 0.072 MB uv pip install 'acryl-datahub-airflow-plugin @ <base-url>/artifacts/wheels/acryl_datahub_airflow_plugin-0.0.0.dev1-py3-none-any.whl'
acryl-datahub-dagster-plugin 0.021 MB uv pip install 'acryl-datahub-dagster-plugin @ <base-url>/artifacts/wheels/acryl_datahub_dagster_plugin-0.0.0.dev1-py3-none-any.whl'
acryl-datahub-gx-plugin 0.011 MB uv pip install 'acryl-datahub-gx-plugin @ <base-url>/artifacts/wheels/acryl_datahub_gx_plugin-0.0.0.dev1-py3-none-any.whl'
prefect-datahub 0.011 MB uv pip install 'prefect-datahub @ <base-url>/artifacts/wheels/prefect_datahub-0.0.0.dev1-py3-none-any.whl'