How to read\open a csv file with millions of rows and hundreds of coloumns to compare/delete and save

6 visualizaciones (últimos 30 días)
Hi
i have csv files with millions of rows and hundreds of coloumns that i want to open\read in order to compare the files, remove duplicates and save the new file as csv also, and many other modifications..
when i used csvreader the PC stuck! so any help here
  9 comentarios
Shayma
Shayma el 21 de Sept. de 2016
Editada: Shayma el 21 de Sept. de 2016
o.k it have been a long time now, but i got the R2014b,i tried to use datastore and it works, at least it opens the first chunk thank you :)
Shayma
Shayma el 22 de Sept. de 2016
How long it suppose to take reading 100000 lines each chunk from 13.5 GB files ?

Iniciar sesión para comentar.

Respuestas (1)

George
George el 12 de Ag. de 2016
You can do this with textscan, but your formatSpec is going to be pretty gnarly.
fid = fopen('data.csv');
% your formatSpec will be very long because of the number of fields
formatSpec = '%s %s %f %d'; % reads a string, a string, a float, an integer
A = textscan(fid, formatSpec, 'HeaderLines', 1, 'Delimiter', ',');
fclose(fid);
csvread

Categorías

Más información sobre Large Files and Big Data en Help Center y File Exchange.

Etiquetas

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by