Refactored the load_cluster function in load_folder.py to support parallel file loading using ProcessPoolExecutor, improving performance during the append phase. Added workers parameter for controlling parallelism and integrated a progress_queue for real-time progress updates. Introduced read_sas_metadata function in load_sas.py to efficiently read metadata from SAS files, optimizing the pre-scan process for global progress tracking. |
||
|---|---|---|
| .. | ||
| samples | ||
| .env.example | ||
| generate_sample_folder.py | ||
| generate_sample_sas.py | ||
| load_folder.py | ||
| load_sas.py | ||
| sample_config.yaml | ||
| sample_folder_config.yaml | ||