Classifying environmental sounds using AlexNet

I am trying to extract Spectrograms and MFCC from audio files then calssify them using AlexNet or GooglNet. Any tips ?

 採用された回答

Utkarsh
Utkarsh 2020 年 6 月 19 日

0 投票

Hi Ibrahim,
For classification of spectrogram with alexnet, you might wish to train the last layers of alexnet according to your dataset (spectrograms of audio). For more details about transfer learning with alexnet refer here.
You may also refer here for an approach without alexnet.

3 件のコメント

Ibrahim A
Ibrahim A 2020 年 6 月 19 日
Hi Utkarsh,
Thank you for your response. But I am not sure how to convert my audio dataset from .wav to images so I can classify them with AlexNet. Check out the attached paper to see what I exactly mean.
Thanks
Utkarsh
Utkarsh 2020 年 6 月 19 日
編集済み: Utkarsh 2020 年 6 月 19 日
You may refer here for documentation on how to create a spectrogram for an audio
Have you tried this for saving the spectrogram? (It will save the spectrogram without any border,label,etc)
s = spectrogram(x); %x contains the loaded audio in an array
spectrogram(x,'yaxis')
set(gca, 'Visible', 'off');
colorbar('off');
saveas(gcf,'file_name.png')
Ibrahim A
Ibrahim A 2020 年 6 月 23 日
Thank you Utkarsh

サインインしてコメントする。

その他の回答 (0 件)

カテゴリ

ヘルプ センター および File ExchangeDeep Learning Toolbox についてさらに検索

製品

リリース

R2020a

質問済み:

2020 年 6 月 10 日

コメント済み:

2020 年 6 月 23 日

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by