A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 11,575 2,425 Updated Sep 24, 2024

JushBJJ / Mr.-Ranedeer-AI-Tutor

A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.

28,559 3,281 Updated Mar 25, 2024

fatchord / WaveRNN

WaveRNN Vocoder + TTS

Python 2,127 697 Updated Jul 2, 2022

babysor / MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 34,982 5,194 Updated Aug 29, 2024

C0untFloyd / bark-gui

Forked from suno-ai/bark

🔊 Text-Prompted Generative Audio Model with Gradio

Python 659 60 Updated Nov 23, 2023

rsxdalv / tts-generation-webui

TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS)

TypeScript 1,639 178 Updated Sep 23, 2024

audiojs / sample-rate

List of common sample rates

JavaScript 47 2 Updated May 9, 2018

mozilla / TTS

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Jupyter Notebook 9,247 1,241 Updated Nov 9, 2023

facebookresearch / ImageBind

ImageBind One Embedding Space to Bind Them All

Python 8,237 757 Updated Jul 31, 2024

CorentinJ / Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 52,210 8,731 Updated Aug 14, 2024

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 33,651 4,096 Updated Aug 16, 2024

ranchlai / mandarin-tts

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets

Python 461 109 Updated May 28, 2022

Text-to-Audio / Make-An-Audio

PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model

Python 738 108 Updated May 22, 2024

spdustin / ChatGPT-AutoExpert

🚀🧠💬 Supercharged Custom Instructions for ChatGPT (non-coding) and ChatGPT Advanced Data Analysis (coding).

JavaScript 6,596 456 Updated Jan 17, 2024

aiwaves-cn / agents

An Open-source Framework for Data-centric, Self-evolving Autonomous Language Agents

Python 5,174 408 Updated Sep 10, 2024

brevdev / notebooks

Collection of notebook guides created by the Brev.dev team!

Jupyter Notebook 1,628 276 Updated Sep 17, 2024

ZiqiaoPeng / EmoTalk

This is the repository for EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation

116 4 Updated Jul 29, 2023

OpenNMT / CTranslate2

Fast inference engine for Transformer models

C++ 3,237 286 Updated Sep 20, 2024

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

Python 11,511 957 Updated Aug 21, 2024

pluja / whishper

Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!

Svelte 1,409 82 Updated Sep 17, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 27,262 3,999 Updated Sep 24, 2024

animatedjs / animated

Declarative Animations Library for React and React Native

JavaScript 1,853 102 Updated Dec 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wangbochao

Block or report wangbochao

Starred repositories

s9roll7 / animatediff-cli-prompt-travel

Linaqruf / sd-notebook-collection

voicepaw / so-vits-svc-fork

w-okada / voice-changer

microsoft / SpeechT5

lucidrains / audiolm-pytorch

openai / whisper

Plachtaa / VITS-fast-fine-tuning

NVIDIA / NeMo