Stars
Codes for paper -- Towards Training Explainable Singing Quality Assessment Network with Augmented Data
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
Data manipulation and transformation for audio signal processing, powered by PyTorch
Constant-Q harmonic coefficients (CQHCs), a timbre feature designed for music signals.
Train ResNet on ImageNet in Tensorflow 2.0; ResNet 在ImageNet上完整训练代码
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Extract the melody from an audio file and export to MIDI
A set of routines that simulate array responses for sensors with arbitrary geometry and directional characteristics.
label-smooth, amsoftmax, partial-fc, focal-loss, triplet-loss, lovasz-softmax. Maybe useful
Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Gradually-Warmup Learning Rate Scheduler for PyTorch
A 10000+ hours dataset for Chinese speech recognition
Reimplementation of Bag of Tricks and A Strong Baseline for Deep Person Re-identification
Cross attentive pooling for speaker verification (IEEE SLT, 2021)
Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)
libfmp - Python package for teaching and learning Fundamentals of Music Processing (FMP)
Bag of Tricks and A Strong Baseline for Deep Person Re-identification
A PyTorch Implementation for Densely Connected Convolutional Networks (DenseNets)
Audio processing by using pytorch 1D convolution network
Main source code repository for the SoundScape Renderer
Matlab code for the book "Analytic Methods of Sound Field Synthesis"
The Apple Lossless Audio Codec (ALAC) is a lossless audio codec developed by Apple and deployed on all of its platforms and devices.