Gaussian Mixture Model for speech recognition
3 ビュー (過去 30 日間)
古いコメントを表示
Hi all! I'm implementing a tool for speech recognition (command based).
My training data are 21 commands (7 different commands with 3 utterances for each). I did:
- the pre-processing phase (silence removal and end-point detection)
- the features extraction phase (with MFCC calculation).
So, for every utterance in my training set, i have a MFCC matrix with 12 columns (12=number of MFCC) and as much rows as the number of frames i divided the signal.
For the recognition phase, i was wondering to use the gmdistribution tool.
I read this article:
http://www.mathworks.it/company/newsletters/digest/2010/jan/word-recognition-system-matlab.html but i didn't understand this code line:
% model = gmdistribution.fit(MFCCtraindata,M);
What is the MFCCtraindata parameter?
Is it the MFCC matrix associated with every utterance?
For each command i have 3 utterances, so i have 3 different MFCC matrixes.
How can i do to create a unique gmm if, for every command, i will got 3 different gmm?
Any kind of help will be appreciated.
Thank you!!
0 件のコメント
回答 (5 件)
Rania Ziedan
2015 年 10 月 22 日
i really need help in the same issue if you handled it could you help me thanks in advance
0 件のコメント
MUZITIANXINJIE
2016 年 6 月 26 日
Yes,I want,but no one help me! I really need to use the deep learning tu classfy the voice recognition . thanks for your help.
0 件のコメント
hanieh rafiee
2017 年 2 月 19 日
Hi Is the answer to your question receipts? Will you help me please?
0 件のコメント
参考
カテゴリ
Help Center および File Exchange で Speech Recognition についてさらに検索
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!