Skip to content
View chenht2021's full-sized avatar
  • Chengdu
  • 13:24 (UTC +08:00)

Block or report chenht2021

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
44 stars written in Jupyter Notebook
Clear filter

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 35,409 4,163 Updated Aug 19, 2024

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 27,470 3,088 Updated Sep 23, 2024

Kolmogorov Arnold Networks

Jupyter Notebook 14,654 1,345 Updated Sep 15, 2024

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 12,945 1,785 Updated Aug 19, 2024

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…

Jupyter Notebook 11,655 1,658 Updated Sep 11, 2024

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,775 1,050 Updated Aug 15, 2024

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 8,843 771 Updated Aug 7, 2024

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,609 470 Updated Sep 23, 2024

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 7,493 735 Updated Jun 24, 2024
Jupyter Notebook 7,399 524 Updated Jun 16, 2024

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 6,917 439 Updated Sep 24, 2024

Official Code for Stable Cascade

Jupyter Notebook 6,516 531 Updated Jul 25, 2024

An Open Source text-to-speech system built by inverting Whisper.

Jupyter Notebook 3,787 204 Updated Jun 18, 2024

CoTracker is a model for tracking any point (pixel) on a video.

Jupyter Notebook 2,689 189 Updated Aug 15, 2024

MARS5 speech model (TTS) from CAMB.AI

Jupyter Notebook 2,460 199 Updated Aug 1, 2024

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Jupyter Notebook 625 81 Updated Sep 23, 2024
Jupyter Notebook 601 106 Updated Sep 17, 2023

Code for the paper "LLark: A Multimodal Instruction-Following Language Model for Music" by Josh Gardner, Simon Durand, Daniel Stoller, and Rachel Bittner.

Jupyter Notebook 291 22 Updated May 30, 2024

Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

Jupyter Notebook 231 15 Updated Mar 14, 2024

EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction

Jupyter Notebook 224 15 Updated May 19, 2024

CV-VAE: A Compatible Video VAE for Latent Generative Video Models

Jupyter Notebook 210 6 Updated Sep 2, 2024

HF's ML for Audio study group

Jupyter Notebook 182 29 Updated Feb 27, 2023

In this repository, you will learn how code works in VITS(Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech) in Jupyter Notebooks, including normalizing da…

Jupyter Notebook 156 24 Updated Jun 5, 2023

Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"

Jupyter Notebook 142 12 Updated Mar 25, 2024
Jupyter Notebook 139 15 Updated Jan 7, 2024

Predicts the level of noise and reverberation on your audiofiles

Jupyter Notebook 136 24 Updated May 22, 2024

An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"

Jupyter Notebook 131 12 Updated Aug 17, 2023

Pretraining transformer based Thai language models

Jupyter Notebook 116 22 Updated Nov 6, 2023

Dataset and baseline code for the VocalSound dataset (ICASSP2022).

Jupyter Notebook 114 10 Updated Nov 12, 2022

BotSIM - a data-efficient end-to-end Bot SIMulation toolkit for evaluation, diagnosis, and improvement of commercial chatbots

Jupyter Notebook 113 8 Updated Jun 12, 2023
Next