Stars
A timeline of the latest AI models for audio generation, starting in 2023!
Gender recognition by voice and speech analysis
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
Monitor your docker containers with this web interface.
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
Deep Speaker: an End-to-End Neural Speaker Embedding System.
💬 SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/