regexp: either 'once' or 'all'

10 visualizaciones (últimos 30 días)
Anthony
Anthony el 24 de Nov. de 2013
Editada: Cedric el 25 de Nov. de 2013
Hi, I want to do a search only for the first n times. But it seems the 'options' in regexp can only do 'once' or 'all'. no intermediate choice, say '5', huh? thanks a lot!
  2 comentarios
Azzi Abdelmalek
Azzi Abdelmalek el 24 de Nov. de 2013
Give an example
Anthony
Anthony el 24 de Nov. de 2013
for example, a sentence like A='I have a dream have a dream have a dream bla bla bla...'. this sentence might be pretty long, so i wanna control for how many times it will search to make running time not too much. regexp(A,'dream','once') cannot accomplish this presumably, i can strsplit this into words using a cell, and do the regexp only for the first 40 words. something like B=strsplit(A,' '); regexp(B(1:40),'dream'); But, here comes the second reason: more important, this sentence belongs to a 10000*1 cell, each of which includes such a sentence. and i prefer to working with the entire cell array, not using any for loop. thanks a lot

Iniciar sesión para comentar.

Respuestas (2)

Walter Roberson
Walter Roberson el 24 de Nov. de 2013
(pattern){5}
In some cases what you might be after more might be
N = 5; %times to match
Rexpat = [ repmat( ['(' pattern '[).*?]', 1, N-1), ')' ];
regexp(string, Rexpat)
  1 comentario
Cedric
Cedric el 25 de Nov. de 2013
Editada: Cedric el 25 de Nov. de 2013
There should be a 3rd param. 'once' in the call to REGEXP, to avoid getting multiple matches if there are multiple times N occurrences of the initial pattern.
@Anthony: I would profile both the Rexmat pattern and the original pattern, because reducing the number of matches by increasing the complexity of the pattern doesn't always lead to an increase in performance. It's quite the opposite in fact.

Iniciar sesión para comentar.


Image Analyst
Image Analyst el 25 de Nov. de 2013
Perhaps you want strfind():
A='I have a dream have a dream have a dream bla bla bla...';
indexes = strfind(A, 'dream') % Find all locations.
indexes = indexes(1:5); % Take first 5 only.
In the command window:
indexes =
10 23 36
  2 comentarios
Image Analyst
Image Analyst el 25 de Nov. de 2013
There's an old saying on how to write and speak: "Never say "Blah, blah, blah" when "Blah" will do. ;-)
Cedric
Cedric el 25 de Nov. de 2013
+1 for the saying-driven solution ;-)

Iniciar sesión para comentar.

Categorías

Más información sobre Characters and Strings en Help Center y File Exchange.

Etiquetas

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by