Skip to content
View Leezp99's full-sized avatar
  • SCUT
  • Guangzhou
  • 17:22 (UTC +08:00)

Highlights

  • Pro

Block or report Leezp99

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The reproduce training process for Moshi

Python 63 5 Updated Sep 20, 2024

华南理工大学硕博士学位论文模板(LaTeX)。Latex templates for the thesis of South China University of Technology

TeX 288 56 Updated May 27, 2024

zero-shot voice conversion & singing voice conversion with in context learning

Python 243 23 Updated Sep 23, 2024

⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。

Python 10,089 968 Updated Sep 24, 2024
Python 4 Updated Sep 2, 2024
Jupyter Notebook 41 3 Updated Sep 4, 2024

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 6,533 635 Updated Sep 21, 2024

互联网仍有记忆!那些曾经在校招过程中毁过口头offer、意向书、三方的公司!纵然人微言轻,也想尽绵薄之力!

3,127 153 Updated Feb 29, 2024

The official Implementation of PeriodWave and PeriodWave-Turbo

111 7 Updated Aug 19, 2024
Jupyter Notebook 41 3 Updated Aug 13, 2024

[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"

Python 299 20 Updated Sep 3, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 5,012 509 Updated Sep 23, 2024

Vector (and Scalar) Quantization, in Pytorch

Python 2,419 196 Updated Sep 16, 2024

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 12,947 1,785 Updated Aug 19, 2024

Official implement of SpeechFormer written in Python (PyTorch).

Python 72 7 Updated Apr 1, 2023

Fast and memory-efficient exact attention

Python 13,512 1,241 Updated Sep 24, 2024

Audio generation using diffusion models, in PyTorch.

Python 1,923 167 Updated Jun 12, 2023

Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (arXiv:2401.01498)

Python 58 4 Updated Apr 4, 2024

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 8,167 1,029 Updated Apr 24, 2024

Train the next generation of TTS systems.

Python 159 16 Updated Sep 13, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 32,971 3,794 Updated Sep 17, 2024

A JAX based package designed for efficient second order operators (e.g., laplacian) computation.

Python 69 6 Updated Mar 15, 2024

Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech

Python 226 12 Updated Feb 29, 2024

SpeechFormer++ in PyTorch

Python 38 8 Updated Jul 21, 2023

A Compact and Effective Pretrained Model for Speech Emotion Recognition

Python 25 1 Updated Jun 29, 2024

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.

Jupyter Notebook 703 70 Updated Jul 16, 2024