foxtrot/generic_loader
David Peterson 0c5e6e31f0 Enhance memory management in load_folder.py and load_sas.py for improved performance
Added memory management optimizations in the _worker_load_append_file function to release unused memory from pyarrow's pool and trigger Python's garbage collection. Implemented explicit memory trimming using glibc's malloc_trim to ensure efficient memory usage during long-running processes. Updated the copy_dataframes function in load_sas.py to release pyarrow's memory pool between chunks, preventing high memory usage in long-lived workers. These changes aim to reduce memory footprint and improve overall performance during large dataset processing.
2026-04-21 10:46:54 -05:00
..
samples adding explorer 2026-04-20 16:27:54 -05:00
.env.example Adding generic loader 2026-04-18 09:34:48 -05:00
generate_sample_folder.py Add generate_sample_folder.py and load_folder.py for clustered SAS file generation and loading 2026-04-18 11:25:04 -05:00
generate_sample_sas.py Adding generic loader 2026-04-18 09:34:48 -05:00
load_folder.py Enhance memory management in load_folder.py and load_sas.py for improved performance 2026-04-21 10:46:54 -05:00
load_sas.py Enhance memory management in load_folder.py and load_sas.py for improved performance 2026-04-21 10:46:54 -05:00
sample_config.yaml adding index field 2026-04-20 10:18:09 -05:00
sample_folder_config.yaml Enhance file clustering by implementing numeric sorting for last digit groups in stems and updating documentation for embedded-digit handling in auto-detection. 2026-04-20 11:48:22 -05:00