Skip to content

Raw

The raw layer is the untyped landing zone for data crawled by the Scrapy spider. Tables sit in the <env>__raw ClickHouse schema and are declared as dbt sources, not models — there is no SQL transformation here, just the JSON payloads as scraped.

SourceTableLoaded at
bet365 / ended_eventsbet365__ended_eventsextracted_at
bet365 / oddsbet365__oddsextracted_at
bet365 / lineupsbet365__lineupsextracted_at
bet365 / dbt_test_resultsdbt_test_results_rawexecuted_at
bet365 / dbt_test_failuresdbt_test_failures_rawrecorded_at

Source freshness is checked with a 26-hour warn / 50-hour error threshold across all five tables (configured in models/raw/betsapi/properties.yml).