Search code examples
pythonparquet

How can I load a parquet file line by line


To save space on my laptop I saved a fairly large dataset as parquet files via dask. And was wondering if there is a way to read in the parquet files line by line.

Thanks for the help.


Solution

  • You can use ParquetFile.iter_row_groups from the fastparquet library to iterate over the lines of a parquet file.