Speaker recognition
1 visualización (últimos 30 días)
Mostrar comentarios más antiguos
I am going to do a project based on speaker recognition (not speech recognition). I saw many files in the internet and came across many methods.
First part of the program is that they find MFCC and after that we have to do the pattern recognition.
Here are some of the pattern recognition algorithms that I came across 1)VQ algorithm followed by LBG algorithm for clustering. 2)K means algorithm.
I have another idea. I know neural networks for pattern recognition in image processing. Will that work with speaker recognition?
Is there any code in matlab central for speaker recognition? If you have done this project before please tell me the method that you followed.
Thanks in advance.
Much awaiting for your response.
2 comentarios
Image Analyst
el 15 de Oct. de 2011
From the subject line I thought he was talking about speaker identification (recognizing a particular speaker and extracting his speech), like the "cocktail party problem" http://research.ics.tkk.fi/ica/cocktail/cocktail_en.cgi but I've heard of that being solved with ICA, not the acronyms he listed.
Respuestas (8)
William
el 14 de Oct. de 2011
Many use a Gausian Mixture Model (GMM) after using the MFCC. There is a really good toolbox for these operations called "voicebox.m" it is a collection of functions that all you to extract and classify data from speech via wavread()
William
el 14 de Oct. de 2011
Look over this website. I had to do this a year ago for a class and this is exactly what I followed
Here is the link for voicebox
2 comentarios
William
el 14 de Oct. de 2011
It isn't terrible. if you collect a lot of data with the MFCC than your model might be more accurate but the time to process will slow way down.
William
el 14 de Oct. de 2011
There are numerous GMM algorithms that could be used to do this. find one that you understand so that if it ever stops working you can figure out why.
2 comentarios
i Venky
el 14 de Oct. de 2011
2 comentarios
Greg Heath
el 15 de Oct. de 2011
Both the MLP and RBF with a single hidden layer are universal approximators and can be used for both regression and pattern
recognition. If you are familiar with the NN Toolbox, it
shouldn't take long to try both.
Greg
William
el 14 de Oct. de 2011
I don't have a lot of experience with Neural Networks. If you understand them try it and see what happens. Worst case you lose sa few hours of time.
Brian Hemmat
el 20 de Mzo. de 2020
Audio Toolbox provides several examples for speaker recognition (both identification and verification):
0 comentarios
Ver también
Categorías
Más información sobre Sequence and Numeric Feature Data Workflows en Help Center y File Exchange.
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!