Introduced a new command-line argument, --chunk-rows, allowing users to specify the number of rows per chunk for pyreadstat streaming and COPY operations. This option overrides the GENERIC_LOADER_CHUNK_ROWS environment variable and auto-scaling behavior when using multiple workers. Enhanced memory management by providing detailed information on peak memory usage based on the specified chunk size, improving performance and usability during large dataset processing. |
||
|---|---|---|
| .. | ||
| samples | ||
| .env.example | ||
| generate_sample_folder.py | ||
| generate_sample_sas.py | ||
| load_folder.py | ||
| load_sas.py | ||
| sample_config.yaml | ||
| sample_folder_config.yaml | ||