Is your feature request related to a problem? Please describe.
We were historically doing pd.read_parquet(list_of_files) but now have moved to pd.concat((pd.read_parquet(f) for f in files)).
https://github.com/NVIDIA-NeMo/Curator/pull/1249/files#r2546234214
Describe the solution you'd like
A clear and concise description of what you want to happen.
Describe alternatives you've considered
A clear and concise description of any alternative solutions or features you've considered.
Additional context
Add any other context or screenshots about the feature request here.