Starred repositories
animatediff prompt travel
so-vits-svc fork with realtime support, improved interface and more features.
リアルタイムボイスチェンジャー Realtime Voice Changer
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Robust Speech Recognition via Large-Scale Weak Supervision
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
C0untFloyd / bark-gui
Forked from suno-ai/bark🔊 Text-Prompted Generative Audio Model with Gradio
TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS)
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
ImageBind One Embedding Space to Bind Them All
Clone a voice in 5 seconds to generate arbitrary speech in real-time
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets
PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model
🚀🧠💬 Supercharged Custom Instructions for ChatGPT (non-coding) and ChatGPT Advanced Data Analysis (coding).
An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents
Collection of notebook guides created by the Brev.dev team!
This is the repository for EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation
Fast inference engine for Transformer models
Faster Whisper transcription with CTranslate2
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
A high-throughput and memory-efficient inference and serving engine for LLMs
Declarative Animations Library for React and React Native