
Brian Hemmat
Statistics
All
Content Feed
回答済み
input shape to the LSTM net when doing inference for VAD tasks
I did not look at the dimensions you're discussing, but I can say that you are correct that the "streaming" code in the example ...
input shape to the LSTM net when doing inference for VAD tasks
I did not look at the dimensions you're discussing, but I can say that you are correct that the "streaming" code in the example ...
18日 前 | 0
| 採用済み
回答済み
Input data must be a formatted dlarray.
As Matt J said, the mfcc function, and all feature extraction functions provided by Audio Toolbox, doesn't support dlarrays as o...
Input data must be a formatted dlarray.
As Matt J said, the mfcc function, and all feature extraction functions provided by Audio Toolbox, doesn't support dlarrays as o...
30日 前 | 0
回答済み
code not working in MATLAB 2020b
I can't reproduce this from my end (the download works fine). Can you print the results of ver?
code not working in MATLAB 2020b
I can't reproduce this from my end (the download works fine). Can you print the results of ver?
4ヶ月 前 | 0
回答済み
Free-to-use speech-to-text conversion using MATLAB interface to .NET
Hi Steven, You can try out wav2vec 2.0. You can find a MATLAB implementation here: https://github.com/matlab-deep-learning/wav...
Free-to-use speech-to-text conversion using MATLAB interface to .NET
Hi Steven, You can try out wav2vec 2.0. You can find a MATLAB implementation here: https://github.com/matlab-deep-learning/wav...
12ヶ月 前 | 0
回答済み
How do I adapt the "Denoise Speech Using Deep Learning Networks" example to the TIMIT dataset?
The suggestions below are with respect to the R2021b release. If that's not the release you're working from/these changes don't ...
How do I adapt the "Denoise Speech Using Deep Learning Networks" example to the TIMIT dataset?
The suggestions below are with respect to the R2021b release. If that's not the release you're working from/these changes don't ...
約1年 前 | 0
| 採用済み
回答済み
How to remove 3rd person voice from an audio signal?
These examples might be helpful: Cocktail Party Source Separation Using Deep Learning End-to-End Deep Speech Separation They ...
How to remove 3rd person voice from an audio signal?
These examples might be helpful: Cocktail Party Source Separation Using Deep Learning End-to-End Deep Speech Separation They ...
約1年 前 | 0
送信済み
wav2vec-2.0
This repo provides the pretrained baseline 960 hours wav2vec 2.0 model in MATLAB.
1年以上 前 | ダウンロード 13 件 |
回答済み
Undefined function error in Audio Toolbox and Deeplearning Toolbox
Hi Yifan, To run the example, you'll need to open the example using this command: openExample('deeplearning_shared/DeepLearnin...
Undefined function error in Audio Toolbox and Deeplearning Toolbox
Hi Yifan, To run the example, you'll need to open the example using this command: openExample('deeplearning_shared/DeepLearnin...
1年以上 前 | 0
| 採用済み
回答済み
Convolution Reverb audio plugin
Hi Lorenzo, My understanding is that you are developing a function but eventually want to convert it to an audioPlugin object. ...
Convolution Reverb audio plugin
Hi Lorenzo, My understanding is that you are developing a function but eventually want to convert it to an audioPlugin object. ...
2年弱 前 | 0
| 採用済み
回答済み
Why am I getting an "out of memory on device" error when trying to run the Speaker Recognition Using X Vectors Example?
Hi Joseph, Are you sharing that GPU with other programs (for example, is it also being used for your graphics?)? That can cause...
Why am I getting an "out of memory on device" error when trying to run the Speaker Recognition Using X Vectors Example?
Hi Joseph, Are you sharing that GPU with other programs (for example, is it also being used for your graphics?)? That can cause...
2年弱 前 | 0
送信済み
Baseline-MATLAB-DCASE
This repo contains an unofficial MATLAB implementation of DCASE2021 Task 1A baseline code, which is part of the DCASE challenge....
2年弱 前 | ダウンロード 2 件 |
回答済み
Smoothing output in delay plugin
The example plugin, audiopluginexample.DampedVolumeController, might be helpful for this issue.
Smoothing output in delay plugin
The example plugin, audiopluginexample.DampedVolumeController, might be helpful for this issue.
2年弱 前 | 0
| 採用済み
回答済み
How can I detect audio device changes?
Hi Mattias, Starting in R2020b, you can use audiodevreset to refresh the list of audio devices MATLAB can interact with: https...
How can I detect audio device changes?
Hi Mattias, Starting in R2020b, you can use audiodevreset to refresh the list of audio devices MATLAB can interact with: https...
約2年 前 | 0
| 採用済み
回答済み
Separate voiced and unvoiced frame of speech signal
This example: https://www.mathworks.com/help/audio/ug/speaker-identification-using-pitch-and-mfcc.html#SpeakerIdentificationUsi...
Separate voiced and unvoiced frame of speech signal
This example: https://www.mathworks.com/help/audio/ug/speaker-identification-using-pitch-and-mfcc.html#SpeakerIdentificationUsi...
約2年 前 | 0
| 採用済み
回答済み
Not sure if I set up this neural network correctly
Hi Saketh, I believe the example you're following is more of a 'hello-world' type example--your current code is trying to accom...
Not sure if I set up this neural network correctly
Hi Saketh, I believe the example you're following is more of a 'hello-world' type example--your current code is trying to accom...
約2年 前 | 0
| 採用済み
回答済み
Adding confusion code for LSTM classification on audio files in Matlab
You can use confusionchart: https://www.mathworks.com/help/deeplearning/ref/confusionchart.html Your output from classify will ...
Adding confusion code for LSTM classification on audio files in Matlab
You can use confusionchart: https://www.mathworks.com/help/deeplearning/ref/confusionchart.html Your output from classify will ...
2年以上 前 | 0
| 採用済み
回答済み
How to get magnitude values for a Parametric EQ?
Hi Nathan, The syntax of freqz that you were using in your code above expects w to be normalized fequency. I ran your code an...
How to get magnitude values for a Parametric EQ?
Hi Nathan, The syntax of freqz that you were using in your code above expects w to be normalized fequency. I ran your code an...
2年以上 前 | 1
| 採用済み
回答済み
Getting NaN and Inf values after extracting features from Audio files
The features you are extracting (basically statistics about a spectrum) are either not defined or poorly defined for an all-zero...
Getting NaN and Inf values after extracting features from Audio files
The features you are extracting (basically statistics about a spectrum) are either not defined or poorly defined for an all-zero...
2年以上 前 | 0
| 採用済み
回答済み
FreeverbReverberator example Plugin error
Hi Pablo, Audio plugin API requires plugins to support variable-sized input. The audioexample.FreeverbReverberator does not sup...
FreeverbReverberator example Plugin error
Hi Pablo, Audio plugin API requires plugins to support variable-sized input. The audioexample.FreeverbReverberator does not sup...
2年以上 前 | 0
| 採用済み
回答済み
Parameter settings for gtcc feature extraction in the aFE and extract() function
The audioFeatureExtractor object does not support time-domain filtering for GTCC. I am curious, what is your use-case? Are you ...
Parameter settings for gtcc feature extraction in the aFE and extract() function
The audioFeatureExtractor object does not support time-domain filtering for GTCC. I am curious, what is your use-case? Are you ...
2年以上 前 | 0
| 採用済み
回答済み
MFCC into feature vector
The mfcc function returns mel frequnecy cepstral coefficients (MFCC) over time. That is, it separates the audio into short wind...
MFCC into feature vector
The mfcc function returns mel frequnecy cepstral coefficients (MFCC) over time. That is, it separates the audio into short wind...
3年弱 前 | 0
| 採用済み
回答済み
Why is the Reverberator output a two-column matrix.
The two columns represent the left and right channels of a stereo recording. The reverberator object is intended to mimic an aco...
Why is the Reverberator output a two-column matrix.
The two columns represent the left and right channels of a stereo recording. The reverberator object is intended to mimic an aco...
3年弱 前 | 0
| 採用済み
回答済み
Unrecognized function or variable 'helperExtractAuditoryFeatures'
Hi Divya, Please click the "open this example" button on the top right corner of the example. The "helperExtractAuditoryFeature...
Unrecognized function or variable 'helperExtractAuditoryFeatures'
Hi Divya, Please click the "open this example" button on the top right corner of the example. The "helperExtractAuditoryFeature...
3年弱 前 | 1
回答済み
HRTF plugin Filter error
Hi Pablo, The following code compiles for me (and should for you). Initialize the Numerator's with the correct size at constr...
HRTF plugin Filter error
Hi Pablo, The following code compiles for me (and should for you). Initialize the Numerator's with the correct size at constr...
3年弱 前 | 0
| 採用済み
回答済み
Averaging Values in a Matrix with Nans
I think this is what you want to do: mean(Strain,2,'omitnan')
Averaging Values in a Matrix with Nans
I think this is what you want to do: mean(Strain,2,'omitnan')
3年弱 前 | 1
回答済み
I want to identify male or female voice in a speech signal
Audio Toolbox provides an example: Classify Gender Using LSTM Networks
I want to identify male or female voice in a speech signal
Audio Toolbox provides an example: Classify Gender Using LSTM Networks
3年弱 前 | 0
回答済み
Speaker recognition
Audio Toolbox provides several examples for speaker recognition (both identification and verification): Speaker Identification ...
Speaker recognition
Audio Toolbox provides several examples for speaker recognition (both identification and verification): Speaker Identification ...
約3年 前 | 0
回答済み
Voice recogntion /Speaker identification
Audio Toolbox provides examples for speaker verification: A simple one using Gaussian mixture models and a more complicated one ...
Voice recogntion /Speaker identification
Audio Toolbox provides examples for speaker verification: A simple one using Gaussian mixture models and a more complicated one ...
約3年 前 | 0
回答済み
Speaker Recognition using MFCC and GMM
Audio Toolbox provides several examples for speaker recognition (both identification and verification): Speaker Identification ...
Speaker Recognition using MFCC and GMM
Audio Toolbox provides several examples for speaker recognition (both identification and verification): Speaker Identification ...
約3年 前 | 0