ljuvela

Lauri Juvela ljuvela

Assistant Professor in Machine Learning for Speech and Language Technology at Aalto University. Interested in generative Deep Learning for speech and audio.

104 followers · 15 following

Aalto University
Helsinki, Finland
https://orcid.org/0000-0002-2201-103X

Achievements

Highlights

Stars

anira-project / anira

an architecture for neural network inference in real-time audio applications

C++ 87 1 Updated Sep 23, 2024

kyutai-labs / moshi

Python 5,030 378 Updated Sep 24, 2024

ytsrt66589 / pyneuralfx

Jupyter Notebook 41 3 Updated Aug 13, 2024

descriptinc / descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,128 102 Updated Jul 11, 2024

ljuvela / DAREA

Differentiable augmentation and robustness evaluation for audio

Python 3 Updated Sep 19, 2024

ZacharyNovack / Lead-AE

Official Repository of Unsupervised Lead Sheet Generation via Semantic Compression

Python 15 1 Updated Oct 23, 2023

jik876 / hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 1,909 504 Updated Jul 27, 2024

asteroid-team / torch-audiomentations

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Python 926 87 Updated Sep 2, 2024

ming024 / FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Python 1,774 527 Updated Oct 27, 2023

microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,844 4,053 Updated Sep 23, 2024

jaywalnut310 / vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 6,693 1,229 Updated Dec 6, 2023

enhuiz / vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

Python 2,936 417 Updated May 10, 2023

Stability-AI / stable-audio-tools

Generative models for conditional audio generation

Python 2,537 235 Updated Jul 15, 2024

sovrasov / flops-counter.pytorch

Flops counter for convolutional networks in pytorch framework

Python 2,785 309 Updated Jul 16, 2024

shivammehta25 / Matcha-TTS

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Jupyter Notebook 625 81 Updated Sep 23, 2024

microsoft / NeuralSpeech

Python 1,367 185 Updated Feb 11, 2024

karpathy / makemore

An autoregressive character-level language model for making more things

Python 2,478 652 Updated Jun 4, 2024

facebookresearch / audioseal

Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector

Python 416 49 Updated Aug 28, 2024

facebookresearch / seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,777 1,050 Updated Aug 15, 2024

wavmark / wavmark

AI-based Audio Watermarking Tool

Python 212 29 Updated Jan 7, 2024

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 4,474 384 Updated Sep 23, 2024

BradyFU / Woodpecker

✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models. The first work to correct hallucinations in MLLMs.

Python 599 29 Updated Jun 17, 2024

yoyololicon / golf

A DDSP-based neural voice synthesiser.

Jupyter Notebook 96 6 Updated Sep 7, 2024

lmnt-com / diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Python 756 111 Updated Mar 26, 2024

Neural-DSP / modern-rt-audio-course

Companion code for the Modern Real-Time Audio Programming course.

C++ 17 3 Updated Aug 30, 2023

asvspoof-challenge / 2021

ASVspoof 2021 Baseline Systems

Python 197 75 Updated Jun 6, 2024

dsuedholt / vocal-tract-grad

Python 27 5 Updated Jul 16, 2023

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 20,661 2,104 Updated Jul 18, 2024

NVIDIA / radtts

Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, Diverse Synthesis, and Generative Modeling and Fine-Grained …

Roff 281 40 Updated Apr 6, 2023

Maghoumi / pytorch-softdtw-cuda

Fast CUDA implementation of (differentiable) soft dynamic time warping for PyTorch

Python 615 58 Updated Apr 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lauri Juvela ljuvela

Achievements

Achievements

Highlights

Block or report ljuvela

Stars

anira-project / anira

kyutai-labs / moshi

ytsrt66589 / pyneuralfx

descriptinc / descript-audio-codec

ljuvela / DAREA

ZacharyNovack / Lead-AE

jik876 / hifi-gan

asteroid-team / torch-audiomentations

ming024 / FastSpeech2

microsoft / DeepSpeed

jaywalnut310 / vits

enhuiz / vall-e

Stability-AI / stable-audio-tools

sovrasov / flops-counter.pytorch

shivammehta25 / Matcha-TTS

microsoft / NeuralSpeech

karpathy / makemore

facebookresearch / audioseal

facebookresearch / seamless_communication

wavmark / wavmark

open-mmlab / Amphion

BradyFU / Woodpecker

yoyololicon / golf

lmnt-com / diffwave

Neural-DSP / modern-rt-audio-course

asvspoof-challenge / 2021

dsuedholt / vocal-tract-grad

facebookresearch / audiocraft

NVIDIA / radtts

Maghoumi / pytorch-softdtw-cuda