Skip to content
View amanteur's full-sized avatar
🌚
🌚

Block or report amanteur

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Multitrack music mixing style transfer given a reference song using differentiable mixing console.

Jupyter Notebook 33 1 Updated Sep 4, 2024
Python 5,125 384 Updated Sep 24, 2024

S2cap ♥: Constructing a Singing Style Caption Dataset

6 Updated Sep 19, 2024

TS-BSmamba2: A TWO-STAGE BAND-SPLIT MAMBA-2 NETWORK FOR MUSIC SEPARATION

Python 31 Updated Sep 16, 2024

SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.

Python 31 3 Updated Sep 21, 2024

Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.

Python 57 2 Updated May 25, 2023

Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.

Python 131 10 Updated Apr 13, 2023

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 848 96 Updated Sep 5, 2024

MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.

Jupyter Notebook 22 1 Updated Aug 9, 2024

It's a repository for implementations of neural speech editing algorithms.

Python 186 19 Updated Jan 9, 2024

Text-to-Music Generation with Rectified Flow Transformers

Python 1,462 110 Updated Sep 6, 2024

This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)

Python 120 11 Updated Sep 9, 2024

🎚️ Open Source Audio Matching and Mastering

Python 1,314 154 Updated Jul 24, 2024

PyTorch implementation of Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities.

Python 170 9 Updated Aug 20, 2024

An extremely fast Python linter and code formatter, written in Rust.

Rust 31,258 1,042 Updated Sep 24, 2024

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 11,579 2,425 Updated Sep 24, 2024

VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer

Python 309 42 Updated Jul 17, 2024

zero-shot voice conversion & singing voice conversion with in context learning

Python 242 23 Updated Sep 24, 2024

Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction

Python 25 Updated Sep 14, 2024

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 2,609 245 Updated Sep 14, 2024

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 4,745 388 Updated Aug 10, 2024

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

Python 4,443 556 Updated Aug 9, 2024

HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz

Python 22 2 Updated Jan 2, 2024

Official Implementation of Interspeech 2024 Paper "Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement"

Python 27 Updated Sep 20, 2024
Python 5 Updated Jun 11, 2024

Fine-tune Stable Audio Open with DiT ControlNet.

Python 158 3 Updated Sep 2, 2024

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

1,679 223 Updated Jun 6, 2024

SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling

Python 667 39 Updated Sep 21, 2024

Boosting Self-Supervised Embeddings for Speech Enhancement

Python 42 4 Updated Jun 23, 2022

Scientific literature about Audio Effects

HTML 123 2 Updated Sep 3, 2024
Next