Random sample, I want the 5% of the data per each hour
2 visualizaciones (últimos 30 días)
Mostrar comentarios más antiguos
Rachele Franceschini
el 4 de Jun. de 2021
Editada: Scott MacKenzie
el 4 de Jun. de 2021
I have a database with 19 columns. One column has date, month, year and hour. I would like to get, per each hour, the 5% of the data. Naturaly, I would like to see all the other data, along with the column of time.
Can you help me?
I saw the comand resample, but at the moment, I am in difficulty.
2 comentarios
Scott MacKenzie
el 4 de Jun. de 2021
It would help if you post the data -- or, better yet, a subset of the data -- and any code you have written so far.
Respuesta aceptada
Scott MacKenzie
el 4 de Jun. de 2021
Editada: Scott MacKenzie
el 4 de Jun. de 2021
There might be a way to simplify this, but I believe the script below achieves what you are after...
% read all the data into a table
T = readtable('https://www.mathworks.com/matlabcentral/answers/uploaded_files/642145/Cartel1.xlsx');
% build a vector of 0s and 1s --> each 1 occurs where the hour changes
dt = datetime(T{:,3});
hr = hour(dt);
z = diff(hr);
% build a vector of the indices where the time changes
idx = find(z); % indices of 1s in z
idx = [0; idx];
% build a vector of new indices, selecting at random 5% of the rows for each hour
idxNew = [];
for i=2:length(idx)
n = round(0.05 * (idx(i) - idx(i-1)+1));
idxNew = [idxNew, randi([idx(i-1)+1, idx(i)], 1, n)];
end
% create new table with 5% of the rows for each hour
Tnew = T(idxNew,:);
With this script, your data set is now much smaller. See below. That's the general idea, right?
2 comentarios
Scott MacKenzie
el 4 de Jun. de 2021
@Rachele Franceschini You're welcome.
BTW, I just fixed a small bug in the answer script. The second index in each range included the first row of the following hour. It's fixed now. Good luck.
Más respuestas (1)
KSSV
el 4 de Jun. de 2021
Let A be your data matrix.
[m,n] =size(A) ;
p = round(5/100*m) ;
idx = randsample(m,n) ;
iwant = A(idx,:)
0 comentarios
Ver también
Categorías
Más información sobre Startup and Shutdown en Help Center y File Exchange.
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!