SBAA490A December   2021  – April 2022 PCM6120-Q1 , TAA5212 , TAC5111 , TAC5112 , TAC5211 , TAC5212 , TLV320ADC5120 , TLV320ADC6120

 

  1.   Trademarks
  2. 1Introduction
  3. 2Voice Activity Detector
    1. 2.1 VAD Configurations
    2. 2.2 VAD Parameters
  4. 3VAD Results
  5. 4Examples
  6. 5Related Documentation
  7. 6Revision History

Voice Activity Detector

The VAD block monitors the signal from a microphone channel for voice-like patterns and on detection of a matching pattern that triggers an interrupt. The VAD monitors for both an onset of voice-activity as well as the end of voice-activity. Both events can be mapped to interrupts.

The ADCx120 device also has the capability to automatically power-on and power-off based on the VAD interrupts. As an example, the ADCx120 system can be set up to monitor VAD activity on a digital microphone channel and then power on the analog microphone channels based on the VAD trigger.

There are two modes of VAD operation:

  1. Auto Mode: The VAD triggers automatically to power on and power off the system.
  2. User Mode: The VAD interrupts have to be monitored by the host and the system has to be powered on or off through I2C commands.

Note that in both Auto and User modes, the device also generates an interrupt on the GPIO or GPO pin which can be sent to an external DSP or SOC.

The salient features of VAD are as follows:

  • No need for external clocks - The system reuses an internal oscillator tuned to generate the ADC and DSP clocks of the required frequency.
  • Automatic switching between VAD mode and Record mode - The system switches from VAD mode to Record mode upon voice activity and switches back to VAD mode upon no voice activity in Auto VAD mode.
  • Reconfigurable decision tree from RAM - This feature allows the default voice-activity decision tree to be updated with a different decision tree that has been trained for other acoustic event detections.

The VAD algorithm uses a decision tree classification-based algorithm for voice activity detection. The decision-tree parameters can be updated through coefficient writes so the VAD block can be reconfigured for other applications that need to use a decision-tree for detection. A 16-band non-uniformly-spaced IIR filter-bank is used for feature extraction. Feature selection parameters and the decision tree can be computed offline and updated through coefficient-memory writes. Figure 2-1 shows the signal processing chain for VAD.

GUID-20211207-SS0I-LRJJ-BMHV-P4RJMS597R3P-low.pngFigure 2-1 VAD Block Diagram