Borrar filtros
Borrar filtros

Import large .par file

12 visualizaciones (últimos 30 días)
Carlo Zarattini
Carlo Zarattini el 24 de Mayo de 2023
Respondida: Saffan el 29 de Mayo de 2023
I need to import a .par file with 2.5gb of dimension. The .par file is a numerical table with 10 columns and few millions of rows.
I tried to used the parquetread function but i got an error message ("Unable to read Parquet"). A reduced version of the parquet file, 500mb, is uploaded successfully. My conclusion is that the original .par file is too large.
Any suggestion on how to import large .par file?

Respuestas (1)

Saffan
Saffan el 29 de Mayo de 2023
Hi Carlo,
One way to solve this issue could be by splitting the original parquet file into multiple smaller files and create “parquetDatastore” of them. The data can then be obtained from the datastore using the “read” function. A datastore is a repository of files where each individual file can fit in the memory but the entire collection of files may not fit. They are used to work with the data incrementally and hence reduces the memory usage.
You can refer to the following link for more information on datastores: https://www.mathworks.com/help/releases/R2022b/matlab/ref/datastore.html

Categorías

Más información sobre Datastore en Help Center y File Exchange.

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by