Read many data blocks from text file
4 visualizaciones (últimos 30 días)
Mostrar comentarios más antiguos
Timo
el 13 de Jun. de 2017
Comentada: Timo
el 7 de Jul. de 2017
Hello everybody,
I tried to read certain data blocks of different text files (as well as .lila files). The data within the files is more or less structured in the same way and the files consist of 2 columns (; delimited) and contain a header of 17-lines and subsequently in column 1 a time series (hourly time steps) with a corresponding value in column 2 is following. This scheme is repeating many times within each fil, but the time series of each data block is different, whereby the amount of lines between 2 different headers differs.
Here an example of the data of a file:
- Station;X1
- Stationsnummer;1
- Stationskennung;M
- Betreiber;D1
- Datenart;TTAU
- Datentyp;P
- Datenursprung;mes
- Datenbezug;Station
- Dimension;Grad C
- Zeitintervall;01:00
- Zeitzone;UTC+1
- X-Koordinate;4385224
- Y-Koordinate;5604764
- Koordinatensystem;31468
- Hoehe;450
- Hohensystem;m ue NN
- Kommentar;jweuiwe 01.01.1997 00:00;-100
- 01.01.1997 01:00;-1
- 01.01.1997 02:00;-1
- 01.01.1997 03:00;-1
- 01.01.1997 04:00;-1
- 01.01.1997 05:00;-1
- 01.01.1997 06:00;-3
- 01.01.1997 07:00;-1
- 01.01.1997 08:00;-1
- 01.01.1997 09:00;-1
- 01.01.1997 10:00;-1
- 01.01.1997 11:00;-4
- Station;X2
- Stationsnummer;2
- Stationskennung;SMUE
- Betreiber;DWD
- Datenart;TTAU
- Datentyp;P
- Datenursprung;mes
- Datenbezug;Station
- Dimension;Grad C
- Ztintervall;01:00
- Zeitzone;UTC+1
- X-Koordinate;4412996
- Y-Koordinate;5613131
- Koordinatensystem;31468
- Hoehe;937
- Hohensystem;m ue NN
- Kommentar;wewerreewr
- 01.01.1997 00:00;-1
- 01.01.1997 01:00;-1
- 01.01.1997 02:00;-12...
In reality the time series is not restricted to a low number of measurements as depicted above, but can contain thousands of lines and each gauge has a different time series. My goal is to seperate the time series of each station and therefore to delete the header of each station and read the dates and corresponding values in a distinct array/cell-array. I read a lot how to do it and tried different ways but I am new and have huge problems to get a solution. My approach until now:
FID=fopen('teststation.lila','r')
C=textscan(FID,'%s %s','delimiter',';')
C2=[C{:}]
g=0
for i=1:length(C2)
if strcmp(C2{i,1},'Station')
g=g+1
Cneu{g,1}=i
end
end
lCneu=length(Cneu)
Data={}
for t=1:lCneu
for i=Cneu{t,1}+17:Cneu{t+1,1}-1
Data{i-17,1}=C2{i,1}
end
end
In the beginning it reads the time series between the stations but in the end the error Index exceeds matrix dimensions arises (for the line with i=Cneu{t,1}+17:Cneu{t+1,1}-1 ). Then I tried it likes this:
for t=1:lCneu
if t>lCneu
Cneu{t+1,1}=90
for i=Cneu{t,1}+17:Cneu{t+1,1}-1 Data{i-17,1}=C2{i,1}
end
end
end
But then I only get an empty cell-array.
During my search I often found some recommendations relating to ~feof(FID) and fgetl(FID), but I don't figured out how to use these commands or if they are really helpful for my problem. An other option I was thinking about was to use fprintf due to the fact that I have to compare each time series for missing dates (or hours) with a reference time series.
Hopefully someone could help me. Thanks in advance :) Greetings
Timo
0 comentarios
Respuesta aceptada
Jayaram Theegala
el 26 de Jun. de 2017
Since the date that you are dealing with has the same number of columns, with the same delimiter, you can use the "readtable" function. For more information about this function, you can click on the following URL:
Once the data is in a table (with all character vector/string type), you can do further processing based on its contents, and further allows you to remove the header contents.
You may find the following MATLAB script helpful to get started:
data = readtable('test.txt'); % I copied the data you provided into this this text file and using the readtable function
dates = []; %the variable which will store all the date and times
counter = 1;
for i = 1:height(data)
try
dates = [dates;datetime(data{i,1}, 'InputFormat', 'dd.MM.yyyy hh:mm')];
counter = counter+1;
catch ME
%You can display that header is omitted if you want
end
end
Más respuestas (0)
Ver también
Categorías
Más información sobre Data Import and Export en Help Center y File Exchange.
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!