Skip to content
View Chengbin-Liang's full-sized avatar

Block or report Chengbin-Liang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Jupyter Notebook 5,048 1,377 Updated Jun 12, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 67,834 8,007 Updated Sep 10, 2024

语音识别理论、论文和PPT

581 183 Updated Aug 7, 2024

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Python 7,753 1,891 Updated Sep 16, 2024

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,769 1,049 Updated Aug 15, 2024

Tracking the progress in end-to-end speech translation

251 27 Updated Oct 25, 2023

List of direct speech-to-speech translation papers.

29 2 Updated Jan 31, 2023

vits2 backbone with multilingual-bert

Python 7,820 1,103 Updated Sep 18, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 32,875 3,780 Updated Sep 17, 2024

Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons

Python 1,043 162 Updated Aug 17, 2024

PyTorch Implementation of TranSpeech (ICLR'23): Textless NAR Speech-to-Speech Translation with Bilateral Perturbation

Python 169 23 Updated Jun 20, 2024

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Python 881 66 Updated Aug 24, 2024

🌏 Review notes for Postgraduate Interview of Tsinghua EE. (Sept. 2017)

192 30 Updated Apr 17, 2018

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,428 306 Updated Jan 4, 2024

Efficient neural speech synthesis

C 1,125 295 Updated Sep 21, 2024
Jupyter Notebook 4 1 Updated Nov 18, 2021

Recurrent neural network for audio noise reduction

C 4,000 889 Updated Aug 24, 2024

Conformer-based Metric GAN for speech enhancement

Python 299 58 Updated May 3, 2024

Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"

Python 181 40 Updated Apr 22, 2024
HTML 394 97 Updated Oct 12, 2023

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

Python 537 153 Updated Aug 19, 2023

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…

Python 1,643 301 Updated Mar 14, 2023

Noise supression using deep filtering

Python 2,371 219 Updated Jul 31, 2024

speech enhancement\speech seperation\sound source localization

1,000 220 Updated Nov 14, 2023

Deep learning for audio denoising

Python 645 124 Updated Oct 15, 2023

A Very Low-Bitrate Codec for Speech Compression

C++ 3,818 354 Updated Aug 20, 2024

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

1,265 137 Updated Jun 6, 2024

Chinese voice corpus. 中文语音语料,语音更加清晰自然,包含8个开源数据集,3200个说话人,900小时语音,1300万字。

578 114 Updated Jun 12, 2020

A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.

MATLAB 706 149 Updated Dec 1, 2020
Next