Speech Recognition Python Tutorial

Hybrid Contrastive Learning Decoupling Speech Emotion Recognition

Abstract: Speech signals contain rich information, such as textual content, emotion, and speaker identity. To extract these features more efficiently, researchers are investigating joint training ...

OSTechNix

Pocket TTS: High-Quality Local Voice Cloning Without GPU

Pocket TTS delivers high-quality text-to-speech on standard CPUs. No GPU, no cloud APIs. It is the first local TTS with voice ...

GitHub

offline-transcription

"An offline video & audio transcription tool powered by OpenAI Whisper. Convert your tutorials, lectures, and podcasts into accurate text transcripts and use AI to generate summaries, notes, and mind ...

GitHub

offline-transcription

An advanced study tool that transforms raw audio recordings and PDF slides into structured, professional LaTeX university notes. Powered by fast local transcription (Whisper) and Google Gemini AI for ...

IEEE

A Lightweight Forward–Backward Independent Temporal-Aware Causal Network for Speech Emotion Recognition

Abstract: Speech Emotion Recognition (SER) technology analyzes speech characteristics in human-computer interactions to understand user intent and improve interaction experience. It is widely used in ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results