Wav2vec2 - Search News

GitHub - leeroopedia/workflow-speechbrain-speechbrain-ctc-asr-training: Train CTC-based ASR systems using wav2vec2/WavLM feature extraction with the SpeechBrain toolkit

What is this? This workflow trains a speech recognition system that converts spoken audio into text. It uses modern self-supervised speech encoders (wav2vec2, WavLM) as powerful feature extractors, ...

IEEE

Speech Emotion Recognition from Bone-Conducted Speech Using Wav2Vec2 Transformer Model

Abstract: Speech emotion recognition (SER) is an essential technology for enhancing human-computer interactions (HCI). While most SER research uses air-conducted (AC) speech, bone-conducted (BC) ...

IEEE

Speech Emotion Recognition Using HuBERT and Wav2Vec2

Abstract: A robust Speech Emotion Recognition (SER) system is designed to improve human-computer interaction, particularly in healthcare, by accurately classifying seven emotions: happiness, sadness, ...

GitHub

Soul-AILab/SoulX-Singer-Eval

We incorporate two MOS (Mean Opinion Score) prediction models to evaluate the subjective appeal of synthesized singing. SingMOS-Pro: A specialized MOS predictor for singing voice, focusing on ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results