現在この提出コンテンツをフォロー中です。
- フォローしているコンテンツ フィードに更新が表示されます。
- コミュニケーション基本設定に応じて電子メールを受け取ることができます
Computes mel frequency cepstral coefficient (MFCC) features from a given speech signal. The speech signal is first preemphasised using a first order FIR filter with preemphasis coefficient. The preemphasised speech signal is subjected to the short-time Fourier transform analysis with a specified frame duration, frame shift and analysis window function. This is followed by magnitude spectrum computation, followed by filterbank design with M triangular filters uniformly spaced on the mel scale between lower and upper frequency limits. The filterbank is applied to the magnitude spectrum values to produce filterbank energies (FBEs). Log-compressed FBEs are then decorrelated using the discrete cosine transform to produce cepstral coefficients. Final step applies sinusoidal lifter to produce liftered MFCCs that closely match those produced by HTK. Demo scripts are included.
引用
Kamil Wojcicki (2026). HTK MFCC MATLAB (https://jp.mathworks.com/matlabcentral/fileexchange/32849-htk-mfcc-matlab), MATLAB Central File Exchange. に取得済み.
謝辞
ヒントを得たファイル: Triangular Filterbank, File I/O for Cell Arrays, Framing Routines
ヒントを与えたファイル: Classification of musical genres using HMM.
