Skip to content
View ljuvela's full-sized avatar

Highlights

  • Pro

Block or report ljuvela

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

an architecture for neural network inference in real-time audio applications

C++ 87 1 Updated Sep 23, 2024
Python 5,030 378 Updated Sep 24, 2024
Jupyter Notebook 41 3 Updated Aug 13, 2024

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,128 102 Updated Jul 11, 2024

Differentiable augmentation and robustness evaluation for audio

Python 3 Updated Sep 19, 2024

Official Repository of Unsupervised Lead Sheet Generation via Semantic Compression

Python 15 1 Updated Oct 23, 2023

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 1,909 504 Updated Jul 27, 2024

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Python 926 87 Updated Sep 2, 2024

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Python 1,774 527 Updated Oct 27, 2023

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,844 4,053 Updated Sep 23, 2024

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 6,693 1,229 Updated Dec 6, 2023

An unofficial PyTorch implementation of the audio LM VALL-E

Python 2,936 417 Updated May 10, 2023

Generative models for conditional audio generation

Python 2,537 235 Updated Jul 15, 2024

Flops counter for convolutional networks in pytorch framework

Python 2,785 309 Updated Jul 16, 2024

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Jupyter Notebook 625 81 Updated Sep 23, 2024
Python 1,367 185 Updated Feb 11, 2024

An autoregressive character-level language model for making more things

Python 2,478 652 Updated Jun 4, 2024

Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector

Python 416 49 Updated Aug 28, 2024

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,777 1,050 Updated Aug 15, 2024

AI-based Audio Watermarking Tool

Python 212 29 Updated Jan 7, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 4,474 384 Updated Sep 23, 2024

✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models. The first work to correct hallucinations in MLLMs.

Python 599 29 Updated Jun 17, 2024

A DDSP-based neural voice synthesiser.

Jupyter Notebook 96 6 Updated Sep 7, 2024

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Python 756 111 Updated Mar 26, 2024

Companion code for the Modern Real-Time Audio Programming course.

C++ 17 3 Updated Aug 30, 2023

ASVspoof 2021 Baseline Systems

Python 197 75 Updated Jun 6, 2024
Python 27 5 Updated Jul 16, 2023

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 20,661 2,104 Updated Jul 18, 2024

Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, Diverse Synthesis, and Generative Modeling and Fine-Grained …

Roff 281 40 Updated Apr 6, 2023

Fast CUDA implementation of (differentiable) soft dynamic time warping for PyTorch

Python 615 58 Updated Apr 3, 2024
Next