Chengbin-Liang

Follow

Chengbin-Liang

Follow

0 followers · 8 following

Lists (5)

Sort

Automatic Speech Recognition

Denoise

12 repositories

Encodec

Speech-to-Speech Translation

text-to-speech

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

NVIDIA / tacotron2

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Jupyter Notebook 5,048 1,377 Updated Jun 12, 2024

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 67,834 8,007 Updated Sep 10, 2024

zw76859420 / ASR_Theory

语音识别理论、论文和PPT

581 183 Updated Aug 7, 2024

nl8590687 / ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Python 7,753 1,891 Updated Sep 16, 2024

facebookresearch / seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,769 1,049 Updated Aug 15, 2024

kahne / SpeechTransProgress

Tracking the progress in end-to-end speech translation

251 27 Updated Oct 25, 2023

Rongjiehuang / awesome-speech-to-speech-translation

List of direct speech-to-speech translation papers.

29 2 Updated Jan 31, 2023

fishaudio / Bert-VITS2

vits2 backbone with multilingual-bert

Python 7,820 1,103 Updated Sep 18, 2024

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 32,875 3,780 Updated Sep 17, 2024

mjpost / sacrebleu

Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons

Python 1,043 162 Updated Aug 17, 2024

Rongjiehuang / TranSpeech

PyTorch Implementation of TranSpeech (ICLR'23): Textless NAR Speech-to-Speech Translation with Bilateral Perturbation

Python 169 23 Updated Jun 20, 2024

ictnlp / StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Python 881 66 Updated Aug 24, 2024

charlesliucn / summer-review

🌏 Review notes for Postgraduate Interview of Tsinghua EE. (Sept. 2017)

192 30 Updated Apr 17, 2018

facebookresearch / encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,428 306 Updated Jan 4, 2024

alibabasglab / FRCRN

127 12 Updated Nov 25, 2022

xiph / LPCNet

Efficient neural speech synthesis

C 1,125 295 Updated Sep 21, 2024

haiciyang / SANAC

Jupyter Notebook 4 1 Updated Nov 18, 2021

xiph / rnnoise

Recurrent neural network for audio noise reduction

C 4,000 889 Updated Aug 24, 2024

ruizhecao96 / CMGAN

Conformer-based Metric GAN for speech enhancement

Python 299 58 Updated May 3, 2024

Le-Xiaohuai-speech / DPCRN_DNS3

Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"

Python 181 40 Updated Apr 22, 2024

huyanxin / DeepComplexCRN

HTML 394 97 Updated Oct 12, 2023

Audio-WestlakeU / FullSubNet

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

Python 537 153 Updated Aug 19, 2023

facebookresearch / denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…

Python 1,643 301 Updated Mar 14, 2023

Rikorose / DeepFilterNet

Noise supression using deep filtering

Python 2,371 219 Updated Jul 31, 2024

WenzheLiu-Speech / awesome-speech-enhancement

speech enhancement\speech seperation\sound source localization

1,000 220 Updated Nov 14, 2023

vbelz / Speech-enhancement

Deep learning for audio denoising

Python 645 124 Updated Oct 15, 2023

google / lyra

A Very Low-Bitrate Codec for Speech Compression

C++ 3,818 354 Updated Aug 20, 2024

coqui-ai / open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

1,265 137 Updated Jun 6, 2024

fighting41love / zhvoice

Chinese voice corpus. 中文语音语料，语音更加清晰自然，包含8个开源数据集，3200个说话人，900小时语音，1300万字。

578 114 Updated Jun 12, 2020

nanahou / Awesome-Speech-Enhancement

A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.

MATLAB 706 149 Updated Dec 1, 2020