bluedoor data·SF Superior Court API·bluedoor.sh ↗

HomeSourceshf_sf_criminal_court

jamiequint/sf_criminal_court Hugging Face Dataset

Source ID hf_sf_criminal_court. Use source caveats and join keys before treating context records as court facts.

Source overview

Source IDhf_sf_criminal_court
Namejamiequint/sf_criminal_court Hugging Face Dataset
OwnerIndependent public dataset compiler using SFSC and public agency sources
Layerreference_enrichment
CoveragePublic Hugging Face parquet snapshot last modified 2026-05-04 with 13 tables: 77,406 cases, 776,728 ROA rows, 72,289 attorney rows, 318,993 calendar rows, 318,993 calendar+judge-assignment rows, 44,029 SFSC charge-disposition rows, 13,790 inferred SFSC case matches, DA arrest/prosecuted tables, and judicial assignment dimensions. The prototype now loads a bounded 120-case reference extract into the API: 120 cases, 4,079 docket rows, 452 attorney rows, 1,844 judge-enriched hearing rows, 340 charge-disposition rows, and 292 court-charge outcome rows.
FormatsParquet
Join keyscase_id, case_number, court_number, filed_date, charge_multiset, department, event_date
CaveatsHugging Face card declares CC-BY-NC-4.0; production/commercial use needs legal review or official-source re-derivation., Cases, ROA, attorneys, and calendar rows are scraped/derived reference data and should not outrank a fresh official SFSC scrape., Rule 10.500 charge-disposition rows are court-owned facts, but public case-number joins are deterministic-inferred because the court spreadsheet used anonymized IDs., Judge names on calendar rows are published department assignments, not proof of the actual sitting judge for a specific hearing.

Linked cases

0 matching cases for this source filter.

No matching cases.

Source artifacts

Artifact IDSource IDArtifact TypePathURLCaptured At
-hf_sf_criminal_courtderived_public_dataset---

Full source record

Access Modeparquet_download
Cadencesnapshot; last observed scrape in research notes was 2026-03-20
CoveragePublic Hugging Face parquet snapshot last modified 2026-05-04 with 13 tables: 77,406 cases, 776,728 ROA rows, 72,289 attorney rows, 318,993 calendar rows, 318,993 calendar+judge-assignment rows, 44,029 SFSC charge-disposition rows, 13,790 inferred SFSC case matches, DA arrest/prosecuted tables, and judicial assignment dimensions. The prototype now loads a bounded 120-case reference extract into the API: 120 cases, 4,079 docket rows, 452 attorney rows, 1,844 judge-enriched hearing rows, 340 charge-disposition rows, and 292 court-charge outcome rows.
Government Levelmixed_public_derived
IDhf_sf_criminal_court
Layerreference_enrichment
Namejamiequint/sf_criminal_court Hugging Face Dataset
OwnerIndependent public dataset compiler using SFSC and public agency sources
Profile Statuspromoted_bounded_reference_extract
Canonical Recordscourt_case, court_charge, charge_disposition, prosecution_event, arrest_event, judge_assignment, source_record
CaveatsHugging Face card declares CC-BY-NC-4.0; production/commercial use needs legal review or official-source re-derivation., Cases, ROA, attorneys, and calendar rows are scraped/derived reference data and should not outrank a fresh official SFSC scrape., Rule 10.500 charge-disposition rows are court-owned facts, but public case-number joins are deterministic-inferred because the court spreadsheet used anonymized IDs., Judge names on calendar rows are published department assignments, not proof of the actual sitting judge for a specific hearing.
Evidencedocs/research/master-findings.md, docs/research/enrichment-findings.md, artifacts/source-discovery/hf-sf-criminal-court.dataset.json, artifacts/source-discovery/hf-sf-criminal-court.README.md, artifacts/source-discovery/hf-sf-criminal-court.file-heads.json, artifacts/source-discovery/hf-sf-criminal-court.remote-parquet-profile.json, data/hf_sf_criminal_court_raw/hf_sf_criminal_court.json, data/hf_sf_criminal_court_raw/manifest.json, scripts/profile_hf_sf_criminal_court.py
FormatsParquet
Join Keyscase_id, case_number, court_number, filed_date, charge_multiset, department, event_date
Known Endpoints-
Rate Limit NotesNo live court traffic is required for this reference extract., HEAD probes showed all 13 parquet files total roughly 37 MB; DuckDB can profile them remotely or a batch job can cache them offline., Do not query Hugging Face per API request; use scheduled/offline refresh if license policy permits.
Source Urlshttps://huggingface.co/datasets/jamiequint/sf_criminal_court
Get this page with API

Rendered from the bluedoor SF Superior Court API. Reproduce it:

GET https://api.bluedoor.sh/sf-superior-court/v1/sources/hf_sf_criminal_courtJSON
GET https://api.bluedoor.sh/sf-superior-court/v1/case-search?source_id=hf_sf_criminal_court&division=criminal&limit=25&include_facets=trueJSON
GET https://api.bluedoor.sh/sf-superior-court/v1/source-artifactsJSON