This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 6,533 635 Updated Sep 21, 2024

forthespada / CampusShame

互联网仍有记忆！那些曾经在校招过程中毁过口头offer、意向书、三方的公司！纵然人微言轻，也想尽绵薄之力！

3,127 153 Updated Feb 29, 2024

sh-lee-prml / PeriodWave

The official Implementation of PeriodWave and PeriodWave-Turbo

111 7 Updated Aug 19, 2024

ytsrt66589 / pyneuralfx

Jupyter Notebook 41 3 Updated Aug 13, 2024

X-LANCE / VoiceFlow-TTS

[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"

Python 299 20 Updated Sep 3, 2024

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 5,012 509 Updated Sep 23, 2024

lucidrains / vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

Python 2,419 196 Updated Sep 16, 2024

neonbjb / tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 12,947 1,785 Updated Aug 19, 2024

HappyColor / SpeechFormer

Official implement of SpeechFormer written in Python (PyTorch).

Python 72 7 Updated Apr 1, 2023

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 13,512 1,241 Updated Sep 24, 2024

archinetai / audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Python 1,923 167 Updated Jun 12, 2023

scutcsq / Neural-Transducers-for-Two-Stage-Text-to-Speech-via-Semantic-Token-Prediction

Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (arXiv:2401.01498)

Python 58 4 Updated Apr 4, 2024

facebookresearch / demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 8,167 1,029 Updated Apr 24, 2024

adelacvg / ttts

Train the next generation of TTS systems.

Python 159 16 Updated Sep 13, 2024

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 32,971 3,794 Updated Sep 17, 2024

YWolfeee / lapjax

A JAX based package designed for efficient second order operators (e.g., laplacian) computation.

Python 69 6 Updated Mar 15, 2024

adelacvg / NS2VC

Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech

Python 226 12 Updated Feb 29, 2024

HappyColor / SpeechFormer2

SpeechFormer++ in PyTorch

Python 38 8 Updated Jul 21, 2023

HappyColor / Vesper

A Compact and Effective Pretrained Model for Speech Emotion Recognition

Python 25 1 Updated Jun 29, 2024

teticio / audio-diffusion

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.

Jupyter Notebook 703 70 Updated Jul 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhipeng Li Leezp99

Highlights

Block or report Leezp99

Stars

yangdongchao / Open-Training-Moshi

supertone-inc / super-monotonic-align

mengchaoheng / SCUT_thesis

Plachtaa / seed-vc

Zeyi-Lin / HivisionIDPhotos

neosapience / SpeechSlicer

3loi / NaturalVoices

NirDiamant / RAG_Techniques