eric102004

Follow

eric102004

Follow

Research Assistant at NTU SPML Lab

10 followers · 5 following

National Taiwan University
Taipei

Achievements

Achievements

Highlights

Pro

Starred repositories

IDRnD / ReDimNet

The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"

Python 92 5 Updated Sep 3, 2024

HuangZiliAndy / SSL_for_multitalker

ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS

Shell 26 1 Updated Mar 16, 2023

LingweiMeng / Whisper-Sidecar

The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".

Python 13 Updated Sep 20, 2024

sooftware / attentions

PyTorch implementation of some attentions for Deep Learning Researchers.

Python 511 70 Updated Mar 4, 2022

cooelf / AwesomeMRC

IJCAI 2021 Tutorial & code for Retrospective Reader for Machine Reading Comprehension (AAAI 2021)

Python 361 66 Updated Sep 6, 2023

psaylor / spoke-client

Spoke client-side library for audio and speech recognition

JavaScript 3 3 Updated Jul 9, 2015

psaylor / spoke

A framework for building speech-enabled websites.

JavaScript 9 2 Updated Jul 10, 2015

ffaisal93 / SD-QA

Jupyter Notebook 14 2 Updated Jun 4, 2022

YannickJadoul / Parselmouth

Praat in Python, the Pythonic way

C++ 1,052 115 Updated Aug 15, 2024

xwhan / pylucene-bm25

Lucene open-domain QA retrieval in python

Python 11 3 Updated Feb 18, 2021

googleapis / python-speech

This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-speech

357 210 Updated Oct 31, 2023

jim-schwoebel / voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

1,679 223 Updated Jun 6, 2024

DanielLin94144 / DUAL-textless-SQA

Textless (ASR-transcript free) Spoken Question Answering. The official release of NMSQA dataset and the implementation of "DUAL: Textless Spoken Question Answering with Speech Discrete Unit Adaptiv…

Python 34 11 Updated Aug 10, 2023

danqi / acl2020-openqa-tutorial

ACL2020 Tutorial: Open-Domain Question Answering

834 85 Updated Jan 1, 2021

roger-tseng / self-supervised-vq-segmentation

Simple implementation of dynamic programming based phoneme segmentation method given in "Towards unsupervised phone and word segmentation using self-supervised vector-quantized neural networks" (ht…

Jupyter Notebook 6 3 Updated Jun 15, 2022

Uberi / speech_recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Python 8,334 2,396 Updated Aug 13, 2024

microsoft / ANCE

A novel embedding training algorithm leveraging ANN search and achieved SOTA retrieval on Trec DL 2019 and OpenQA benchmarks

Python 359 49 Updated Jun 12, 2023

dorianbrown / rank_bm25

A Collection of BM25 Algorithms in Python

Python 991 83 Updated May 28, 2024

facebookresearch / faiss

A library for efficient similarity search and clustering of dense vectors.

C++ 30,613 3,569 Updated Sep 23, 2024

huggingface / tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 8,903 770 Updated Sep 17, 2024

google / sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 10,105 1,165 Updated Sep 1, 2024

facebookresearch / DPR

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

Python 1,703 299 Updated Apr 6, 2023

voidism / mfcc_extractor

Simple one-line scripts to extract reliable MFCC features with librosa and store in HDF5 format file.

Python 2 Updated Sep 26, 2019

google-research / language

Shared repository for open-sourced projects from the Google AI Language team.

Python 1,606 346 Updated Aug 20, 2024

Lightning-AI / pytorch-lightning

Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.

Python 28,028 3,360 Updated Sep 20, 2024

Lightning-Universe / lightning-bolts

Toolbox of models, callbacks, and datasets for AI/ML researchers.

Python 1,679 320 Updated Sep 16, 2024

cyhuang-tw / AdaIN-VC

An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization".

Python 114 20 Updated May 27, 2021

CorentinJ / Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 52,208 8,732 Updated Aug 14, 2024

lixucuhk / Channel-wise-Gated-Res2Net

Implementation of the paper: Channel-wise Gated Res2Net: Towards Robust Detection of Synthetic Speech Attacks (INTERSPEECH 2021)

Shell 29 5 Updated Jul 21, 2021

Leethony / Additive-Margin-Softmax-Loss-Pytorch

Forked from cvqluu/Angular-Penalty-Softmax-Losses-Pytorch

Additive margin softmax loss in pytorch

Python 45 13 Updated Jun 17, 2019

Starred topics

MATLAB

Arduino

Python

Deep learning