chenht2021

Haitao chenht2021

21 followers · 230 following

Chengdu
13:24 (UTC +08:00)

Achievements

Stars

44 stars written in Jupyter Notebook

Clear filter

suno-ai / bark

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 35,409 4,163 Updated Aug 19, 2024

rasbt / LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 27,470 3,088 Updated Sep 23, 2024

KindXiaoming / pykan

Kolmogorov Arnold Networks

Jupyter Notebook 14,654 1,345 Updated Sep 15, 2024

neonbjb / tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 12,945 1,785 Updated Aug 19, 2024

meta-llama / llama-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…

Jupyter Notebook 11,655 1,658 Updated Sep 11, 2024

facebookresearch / seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,775 1,050 Updated Aug 15, 2024

facebookresearch / dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 8,843 771 Updated Aug 7, 2024

01-ai / Yi

A series of large language models trained from scratch by developers @01-ai

Jupyter Notebook 7,609 470 Updated Sep 23, 2024

jasonppy / VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 7,493 735 Updated Jun 24, 2024

Vaibhavs10 / insanely-fast-whisper

Jupyter Notebook 7,399 524 Updated Jun 16, 2024

OpenBMB / MiniCPM

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 6,917 439 Updated Sep 24, 2024

Stability-AI / StableCascade

Official Code for Stable Cascade

Jupyter Notebook 6,516 531 Updated Jul 25, 2024

collabora / WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

Jupyter Notebook 3,787 204 Updated Jun 18, 2024

facebookresearch / co-tracker

CoTracker is a model for tracking any point (pixel) on a video.

Jupyter Notebook 2,689 189 Updated Aug 15, 2024

Camb-ai / MARS5-TTS

MARS5 speech model (TTS) from CAMB.AI

Jupyter Notebook 2,460 199 Updated Aug 1, 2024

shivammehta25 / Matcha-TTS

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Jupyter Notebook 625 81 Updated Sep 23, 2024

rasbt / scipy2023-deeplearning

Jupyter Notebook 601 106 Updated Sep 17, 2023

spotify-research / llark

Code for the paper "LLark: A Multimodal Instruction-Following Language Model for Music" by Josh Gardner, Simon Durand, Daniel Stoller, and Rachel Bittner.

Jupyter Notebook 291 22 Updated May 30, 2024

TXH-mercury / VAST

Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

Jupyter Notebook 231 15 Updated Mar 14, 2024

Srijith-rkr / Whispering-LLaMA

EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction

Jupyter Notebook 224 15 Updated May 19, 2024

AILab-CVC / CV-VAE

CV-VAE: A Compatible Video VAE for Latent Generative Video Models

Jupyter Notebook 210 6 Updated Sep 2, 2024

Vaibhavs10 / ml-with-audio

HF's ML for Audio study group

Jupyter Notebook 182 29 Updated Feb 27, 2023

JunityZhan / Understanding-VITS

In this repository, you will learn how code works in VITS(Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech) in Jupyter Notebooks, including normalizing da…

Jupyter Notebook 156 24 Updated Jun 5, 2023

happylittlecat2333 / Auffusion

Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"

Jupyter Notebook 142 12 Updated Mar 25, 2024

CODEJIN / NaturalSpeech2

Jupyter Notebook 139 15 Updated Jan 7, 2024

marianne-m / brouhaha-vad

Predicts the level of noise and reverberation on your audiofiles

Jupyter Notebook 136 24 Updated May 22, 2024

gmltmd789 / UnitSpeech

An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"

Jupyter Notebook 131 12 Updated Aug 17, 2023

vistec-AI / thai2transformers

Pretraining transformer based Thai language models

Jupyter Notebook 116 22 Updated Nov 6, 2023

YuanGongND / vocalsound

Dataset and baseline code for the VocalSound dataset (ICASSP2022).

Jupyter Notebook 114 10 Updated Nov 12, 2022

salesforce / botsim

BotSIM - a data-efficient end-to-end Bot SIMulation toolkit for evaluation, diagnosis, and improvement of commercial chatbots

Jupyter Notebook 113 8 Updated Jun 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Haitao chenht2021

Achievements

Achievements

Block or report chenht2021

Stars

suno-ai / bark

rasbt / LLMs-from-scratch

KindXiaoming / pykan

neonbjb / tortoise-tts

meta-llama / llama-recipes

facebookresearch / seamless_communication

facebookresearch / dinov2

01-ai / Yi

jasonppy / VoiceCraft

Vaibhavs10 / insanely-fast-whisper

OpenBMB / MiniCPM

Stability-AI / StableCascade

collabora / WhisperSpeech

facebookresearch / co-tracker

Camb-ai / MARS5-TTS

shivammehta25 / Matcha-TTS

rasbt / scipy2023-deeplearning

spotify-research / llark

TXH-mercury / VAST

Srijith-rkr / Whispering-LLaMA

AILab-CVC / CV-VAE

Vaibhavs10 / ml-with-audio

JunityZhan / Understanding-VITS

happylittlecat2333 / Auffusion

CODEJIN / NaturalSpeech2

marianne-m / brouhaha-vad

gmltmd789 / UnitSpeech

vistec-AI / thai2transformers

YuanGongND / vocalsound

salesforce / botsim