SBAA490A December 2021 – April 2022 PCM6120-Q1 , TLV320ADC5120 , TLV320ADC6120
This section discusses the VAD results. The algorithm performance is given by a ROC curve which describes the detection performance across different operating thresholds (–12 dB to –3 dB). ROC plots are included for the noise scenarios from the Aurora Noise database (Figure 3-1 Car , Figure 3-2 restaurant and Figure 3-3 Subway) and speech signals from the NOIZEUS Speech database. Test vectors are generated by mixing noise and speech signals at the desired SNR (SNR is the separation between the power levels of speech and noise signals) of 12, 18, and 24 dB (for example, 12-dB SNR means noise power level is 12 dB down from the speech power level). The operating point is at the extreme top left for the 12-dB threshold, and moves towards the right as the threshold is increased, indicating better performance at Figure 3-4 and the –7-dB threshold for both speech hit rate and non-speech hit rate.
After analyzing the collected data, the –7-dB threshold was chosen to give the best speech hit rate and non-speech hit rate across different noise types. ROC curve at –7-dB threshold for different noise types is as shown.