Skip to content
View water1905's full-sized avatar

Block or report water1905

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Codes for paper -- Towards Training Explainable Singing Quality Assessment Network with Augmented Data

Python 13 1 Updated Dec 7, 2021

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Jupyter Notebook 9,235 1,238 Updated Nov 9, 2023

Data manipulation and transformation for audio signal processing, powered by PyTorch

Python 2,486 643 Updated Sep 20, 2024

Constant-Q harmonic coefficients (CQHCs), a timbre feature designed for music signals.

Jupyter Notebook 26 1 Updated Feb 16, 2024

Train ResNet on ImageNet in Tensorflow 2.0; ResNet 在ImageNet上完整训练代码

Python 83 32 Updated Sep 25, 2020

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,184 6,373 Updated Sep 9, 2024
Python 175 25 Updated Aug 1, 2024

Extract the melody from an audio file and export to MIDI

Python 562 103 Updated Apr 3, 2020

A set of routines that simulate array responses for sensors with arbitrary geometry and directional characteristics.

MATLAB 47 14 Updated Nov 12, 2016

label-smooth, amsoftmax, partial-fc, focal-loss, triplet-loss, lovasz-softmax. Maybe useful

Python 2,159 373 Updated Sep 29, 2022

Unofficial PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners

Python 2,585 344 Updated Jul 25, 2023

Code release for ConvNeXt model

Python 5,705 692 Updated Jan 8, 2023

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 19,684 2,961 Updated Aug 28, 2024

General Speech Restoration

Python 999 130 Updated May 31, 2024

Gradually-Warmup Learning Rate Scheduler for PyTorch

Python 971 124 Updated Jul 15, 2021

A 10000+ hours dataset for Chinese speech recognition

Shell 490 48 Updated Jul 3, 2023

Reimplementation of Bag of Tricks and A Strong Baseline for Deep Person Re-identification

Python 157 35 Updated Apr 3, 2020

Cross attentive pooling for speaker verification (IEEE SLT, 2021)

Python 12 6 Updated Dec 14, 2020

Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)

Python 73 19 Updated Sep 16, 2020

libfmp - Python package for teaching and learning Fundamentals of Music Processing (FMP)

Python 193 18 Updated Feb 21, 2024

Bag of Tricks and A Strong Baseline for Deep Person Re-identification

Python 2,243 573 Updated Apr 23, 2020

A PyTorch Implementation for Densely Connected Convolutional Networks (DenseNets)

Python 456 141 Updated Feb 28, 2018

Audio processing by using pytorch 1D convolution network

Python 1,009 89 Updated Feb 13, 2024

Main source code repository for the SoundScape Renderer

C++ 132 52 Updated Sep 1, 2024

Matlab code for the book "Analytic Methods of Sound Field Synthesis"

MATLAB 23 10 Updated Jul 29, 2020

The Apple Lossless Audio Codec (ALAC) is a lossless audio codec developed by Apple and deployed on all of its platforms and devices.

C++ 342 63 Updated Jul 29, 2020

《声纹技术:从核心算法到工程实践》

148 19 Updated Sep 12, 2022
Next