Requested 48000x48000 (17.2GB) array exceeds maximum array size preference. Creation of arrays greater than this limit may take a long time and cause MATLAB to become unresponsive. Error in dtw (line 16) d=(repmat(​t(:),1,M)-​repmat(r(:​)',N,1)).^​2; %t

360 visualizaciones (últimos 30 días)
clc,clear, close all
warning off;
[audio,fs]=audioread('11.wav');
[speech,fs]=audioread('13.wav');
[dist,ix,iy] = dtw(audio,speech)
  3 comentarios
manoj sain
manoj sain el 1 de Jun. de 2021
array size is the problem.
because when i apply dtw on audio files.
array size error will pop up
Jan
Jan el 1 de Jun. de 2021
Editada: Jan el 5 de Abr. de 2022
Correct. For the dtw() command, a matrix is created, which has the sizes of the inputs as side lengths. This means, that 2 huge signals create a huge^2 matrix. If this exhausts your memory, you need more memory or smaller inputs.There is no magic third way.

Iniciar sesión para comentar.

Respuesta aceptada

Chidvi Modala
Chidvi Modala el 3 de Jun. de 2021
This issue could be due to RAM limitations. You may try the following suggestions:
1.Go to MATLAB > Preferences > Workspace and ensure the Maximum array size limit is set to 100%.
Then execute 'memory' command in the Command Window and send the output. Ensure that the Maximum possible array size is larger than the memory required by the data.
2. Also, check that the Java Heap Memory is not set to a very large value because that might restrict the amount of memory available to perform computations.
  4 comentarios
Walter Roberson
Walter Roberson el 15 de Jul. de 2023
You simply cannot do what you want to do, not unless you have a lot of RAM.
dtw() creates an array that is numel() of the first parameter by numel() of the second parameter. When you are passing in entire audio selections, numel() is often fairly large -- for example, 3 minutes of mono at 44100 samples per second is
format long g
3*60*44100
ans =
7938000
close to 8 mega-samples, and if you were to try to dynamic time warp against something the same size you would need
ans^2
ans =
63011844000000
array entries each of which were 8 bytes long.
For this reason, Don't Do That . Instead, divide the signals up into overlapping segments, apply a windowing function to each segment to reduce edge effects, and dtw one segment of the first against one signal of the second. Or if you need to figure out where one starts with respect to another, divide one into segments and xcorr each segment against the whole of the other until you get a match, and then trim appropriately before doing segment-by-segment dtw.
Sathish
Sathish el 7 de Nov. de 2023
I followed @Chidvi Modala suggestion and along with that I unchecked the Matlab array size limit. Now I dont see any memory issue in my case.

Iniciar sesión para comentar.

Más respuestas (1)

Walter Roberson
Walter Roberson el 3 de Jun. de 2021
This is the kind of reason why spectrograms and similar processes take windows into the data.
To prevent artifacts where the windows join, typically overlapping windows are used.
For some kinds of processes, 50% overlap is used -- so for example for [1 2 3 4 5 6], one window would be [1 2 3 4], then the second window would be [3 4 5 6].
For other kinds of processes, a 10% overlap is common.
For audio, instead of a fixed size of overlap, it sometimes make sense to calculate the overlap based upon a particular time. To make up a number, there might be certain cases where some kinds of distortions tend to become perceptible around 5 milliseconds, so the overlap might be chosen in terms of the number of samples that fit 5 milliseconds.
The size of the window, together with the sampling frequency, will determine the frequency resolution.
If you have 48000 samples, then you just might be working with one second of sound at 48000 samples per second. 48000 is one of the "magic numbers" in audio: 48000 samples per second gets used for some kinds of professional audio, such as DVDs, but 24000 samples per second is not nearly as likely to be used. Instead, near that range, 22050 samples per second is more likely, as that is CD quality.
At 48000 Hz, that is about 21 microseconds per sample. https://www.sfu.ca/sonic-studio-webdav/handbook/Binaural_Hearing.html says that humans can detect timing differences of 30 microseconds. You just might be studying timing perception, so potentially you might not be able to window much at all.

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by