plotting voice segments in sound file

So my project is to take a .wav file with speech segments and create a script that will label the voice portions on the plot of the actual speech based on its spectrum. So, we know that voice frequencies range from 85-400 Hz. I've taken the FFT of the sample and the frequency distribution is strange. Very high at the low range and high range with almost nothing in the voice range. There's not a lot of other noise in the sample. any advice would be appreciated. What I would like to do is measure frequency across time and label parts that fall within in speech frequencies as the speech portions.

4 件のコメント

Jan
Jan 2014 年 5 月 11 日
Please show us the code you use.
Joseph
Joseph 2014 年 5 月 11 日
to start:
fs=10000;
[sp,fs]=wavread('soundfile.wav'); %this particular sound file is 10 seconds long
>> spDFT=fft(sp);
>> stem (abs(spDFT))
Anveshkumar Kolluri
Anveshkumar Kolluri 2016 年 6 月 28 日
You can actually perform Fourier transform, which zeroes out the un-voiced signal and you are left with only voiced part.
Now you can plot the graph to get only the voiced part.
Image Analyst
Image Analyst 2016 年 6 月 29 日
You forgot to attach 'soundfile.wav'. Why not just threshold the signal? Are there other noises just as loud as the voice but in a different frequency range?

サインインしてコメントする。

回答 (0 件)

質問済み:

2014 年 5 月 11 日

コメント済み:

2016 年 6 月 29 日

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by