water1905

water1905

4 followers · 19 following

Stars

AME430 / Towards-Training-Explainable-Singing-Quality-Assessment-Network-with-Augmented-Data

Codes for paper -- Towards Training Explainable Singing Quality Assessment Network with Augmented Data

Python 13 1 Updated Dec 7, 2021

mozilla / TTS

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Jupyter Notebook 9,235 1,238 Updated Nov 9, 2023

pytorch / audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

Python 2,486 643 Updated Sep 20, 2024

zafarrafii / CQHC-Python

Constant-Q harmonic coefficients (CQHCs), a timbre feature designed for music signals.

Jupyter Notebook 26 1 Updated Feb 16, 2024

Apm5 / ImageNet_ResNet_Tensorflow2.0

Train ResNet on ImageNet in Tensorflow 2.0; ResNet 在ImageNet上完整训练代码

Python 83 32 Updated Sep 25, 2020

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,184 6,373 Updated Sep 9, 2024

mimbres / neural-audio-fp

Python 175 25 Updated Aug 1, 2024

justinsalamon / audio_to_midi_melodia

Extract the melody from an audio file and export to MIDI

Python 562 103 Updated Apr 3, 2020

polarch / Array-Response-Simulator

A set of routines that simulate array responses for sensors with arbitrary geometry and directional characteristics.

MATLAB 47 14 Updated Nov 12, 2016

CoinCheung / pytorch-loss

label-smooth, amsoftmax, partial-fc, focal-loss, triplet-loss, lovasz-softmax. Maybe useful

Python 2,159 373 Updated Sep 29, 2022

pengzhiliang / MAE-pytorch

Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners

Python 2,585 344 Updated Jul 25, 2023

facebookresearch / ConvNeXt

Code release for ConvNeXt model

Python 5,705 692 Updated Jan 8, 2023

lucidrains / vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 19,684 2,961 Updated Aug 28, 2024

haoheliu / voicefixer

General Speech Restoration

Python 999 130 Updated May 31, 2024

ildoonet / pytorch-gradual-warmup-lr

Gradually-Warmup Learning Rate Scheduler for PyTorch

Python 971 124 Updated Jul 15, 2021

wenet-e2e / WenetSpeech

A 10000+ hours dataset for Chinese speech recognition

Shell 490 48 Updated Jul 3, 2023

bytedance / music_source_separation

Python 1,257 193 Updated Apr 18, 2024

DTennant / reid_baseline_with_syncbn

Reimplementation of Bag of Tricks and A Strong Baseline for Deep Person Re-identification

Python 157 35 Updated Apr 3, 2020

seongmin-kye / CAP

Cross attentive pooling for speaker verification (IEEE SLT, 2021)

Python 12 6 Updated Dec 14, 2020

seongmin-kye / meta-SR

Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)

Python 73 19 Updated Sep 16, 2020

meinardmueller / libfmp

libfmp - Python package for teaching and learning Fundamentals of Music Processing (FMP)

Python 193 18 Updated Feb 21, 2024

michuanhaohao / reid-strong-baseline

Bag of Tricks and A Strong Baseline for Deep Person Re-identification

Python 2,243 573 Updated Apr 23, 2020

andreasveit / densenet-pytorch

A PyTorch Implementation for Densely Connected Convolutional Networks (DenseNets)

Python 456 141 Updated Feb 28, 2018

KinWaiCheuk / nnAudio

Audio processing by using pytorch 1D convolution network

Python 1,009 89 Updated Feb 13, 2024

facebookresearch / BinauralSpeechSynthesis

N/A

Python 163 19 Updated May 19, 2022

SoundScapeRenderer / ssr

Main source code repository for the SoundScape Renderer

C++ 132 52 Updated Sep 1, 2024

JensAhrens / soundfieldsynthesis

Matlab code for the book "Analytic Methods of Sound Field Synthesis"

MATLAB 23 10 Updated Jul 29, 2020

macosforge / alac

The Apple Lossless Audio Codec (ALAC) is a lossless audio codec developed by Apple and deployed on all of its platforms and devices.

C++ 342 63 Updated Jul 29, 2020

wq2012 / VoiceIdentityBook

《声纹技术：从核心算法到工程实践》

148 19 Updated Sep 12, 2022

shanwangshan / Low-latency_deep_clustering_for_speech_separation

Python 3 1 Updated Aug 6, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly