Skip to content
View eric102004's full-sized avatar
  • National Taiwan University
  • Taipei

Highlights

  • Pro

Block or report eric102004

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"

Python 92 5 Updated Sep 3, 2024

ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS

Shell 26 1 Updated Mar 16, 2023

The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".

Python 13 Updated Sep 20, 2024

PyTorch implementation of some attentions for Deep Learning Researchers.

Python 511 70 Updated Mar 4, 2022

IJCAI 2021 Tutorial & code for Retrospective Reader for Machine Reading Comprehension (AAAI 2021)

Python 361 66 Updated Sep 6, 2023

Spoke client-side library for audio and speech recognition

JavaScript 3 3 Updated Jul 9, 2015

A framework for building speech-enabled websites.

JavaScript 9 2 Updated Jul 10, 2015
Jupyter Notebook 14 2 Updated Jun 4, 2022

Praat in Python, the Pythonic way

C++ 1,052 115 Updated Aug 15, 2024

Lucene open-domain QA retrieval in python

Python 11 3 Updated Feb 18, 2021

This library has moved to https://github.com/googleapis/google-cloud-python/tree/main/packages/google-cloud-speech

357 210 Updated Oct 31, 2023

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

1,679 223 Updated Jun 6, 2024

Textless (ASR-transcript free) Spoken Question Answering. The official release of NMSQA dataset and the implementation of "DUAL: Textless Spoken Question Answering with Speech Discrete Unit Adaptiv…

Python 34 11 Updated Aug 10, 2023

ACL2020 Tutorial: Open-Domain Question Answering

834 85 Updated Jan 1, 2021

Simple implementation of dynamic programming based phoneme segmentation method given in "Towards unsupervised phone and word segmentation using self-supervised vector-quantized neural networks" (ht…

Jupyter Notebook 6 3 Updated Jun 15, 2022

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Python 8,334 2,396 Updated Aug 13, 2024

A novel embedding training algorithm leveraging ANN search and achieved SOTA retrieval on Trec DL 2019 and OpenQA benchmarks

Python 359 49 Updated Jun 12, 2023

A Collection of BM25 Algorithms in Python

Python 991 83 Updated May 28, 2024

A library for efficient similarity search and clustering of dense vectors.

C++ 30,613 3,569 Updated Sep 23, 2024

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 8,903 770 Updated Sep 17, 2024

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 10,105 1,165 Updated Sep 1, 2024

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

Python 1,703 299 Updated Apr 6, 2023

Simple one-line scripts to extract reliable MFCC features with librosa and store in HDF5 format file.

Python 2 Updated Sep 26, 2019

Shared repository for open-sourced projects from the Google AI Language team.

Python 1,606 346 Updated Aug 20, 2024

Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.

Python 28,028 3,360 Updated Sep 20, 2024

Toolbox of models, callbacks, and datasets for AI/ML researchers.

Python 1,679 320 Updated Sep 16, 2024

An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization".

Python 114 20 Updated May 27, 2021

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 52,208 8,732 Updated Aug 14, 2024

Implementation of the paper: Channel-wise Gated Res2Net: Towards Robust Detection of Synthetic Speech Attacks (INTERSPEECH 2021)

Shell 29 5 Updated Jul 21, 2021

Additive margin softmax loss in pytorch

Python 45 13 Updated Jun 17, 2019
Next