public_data/not_livable_delivery/LOADER_README.txt

13 lines
709 B
Plaintext
Raw Permalink Normal View History

2026-06-02 15:27:40 +00:00
not_livable_permits -- loading notes
====================================
Rows: 135,159. Files: not_livable_permits.csv / .parquet / data_dictionary.csv
PREFERRED -- parquet (dtypes embedded, no re-contamination):
import pandas as pd; df = pd.read_parquet('not_livable_permits.parquet')
CSV: read code columns as str (else '06037'->int 6037, leading zero lost):
STR_COLS = ['permit_id', 'city', 'state', 'source_dataset', 'tract_geoid', 'county_fips']
df = pd.read_csv('not_livable_permits.csv', dtype={c:str for c in STR_COLS})
FIPS canonical: state=2, county_fips=5, tract_geoid=11 digits.
Tight headline = uninhabitable_on_census_day & residential_flag & ~excluded_any (=32,199).