A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 6,117 653 Updated Sep 26, 2024

pika-online / funasr_seaco_paraformer_onnx_with_timestamp

修复funasr中seaco-paraformer导出onnx后没有时间戳的bug

Python 13 4 Updated Sep 12, 2024

anthropics / courses

Anthropic's educational courses

Jupyter Notebook 5,220 402 Updated Sep 18, 2024

Lightning-AI / LitServe

Lightning-fast serving engine for AI models. Flexible. Easy. Enterprise-scale.

Python 2,154 132 Updated Sep 26, 2024

kaliiiiiiiiii / undetected-playwright-python

Forked from microsoft/playwright-python

Undetected Python version of the Playwright testing and automation library.

Python 190 19 Updated May 16, 2024

cloud-org / stealth.min.js

The resulting JS file can be used in pure CDP implementations or to test the evasions in your devtools.

21 6 Updated May 30, 2022

Huanshere / VideoLingo

Netflix级字幕切割、翻译、对齐、甚至加上配音，一键全自动视频搬运AI字幕组

Python 1,948 197 Updated Sep 26, 2024

linyqh / NarratoAI

利用AI大模型，一键解说并剪辑视频； Using AI models to automatically provide commentary and edit videos with a single click.

Python 1,292 155 Updated Sep 26, 2024

yzGuu830 / efficient-speech-codec

[EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers

Jupyter Notebook 61 3 Updated Aug 23, 2024

astral-sh / uv

An extremely fast Python package and project manager, written in Rust.

Rust 21,073 622 Updated Sep 27, 2024

chen08209 / FlClash

A multi-platform proxy client based on ClashMeta,simple and easy to use, open-source and ad-free.

Dart 8,878 516 Updated Sep 26, 2024

modelscope / 3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 1,104 95 Updated Aug 18, 2024

opendatalab / labelU

Data annotation toolbox supports image, audio and video data.

Python 789 71 Updated Aug 29, 2024

orange2ai / note_files

用来存放重要的公开文件

51 5 Updated Aug 16, 2024

Superheroff / douyin_uplod

抖音自动上传发布视频

Python 350 81 Updated Jun 27, 2024

innnky / MagVITS

VITS with phoneme-level prosody modeling based on MaskGIT

Python 74 7 Updated Aug 31, 2024

teableio / teable

✨ The Next Gen Airtable Alternative: No-Code Postgres

TypeScript 11,339 515 Updated Sep 26, 2024

OpenBMB / MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Python 12,066 847 Updated Sep 13, 2024

leandromoreira / digital_video_introduction

A hands-on introduction to video technology: image, video, codec (av1, vp9, h265) and more (ffmpeg encoding). Translations: 🇺🇸 🇨🇳 🇯🇵 🇮🇹 🇰🇷 🇷🇺 🇧🇷 🇪🇸

Jupyter Notebook 15,423 1,324 Updated Sep 7, 2023

jbilcke-hf / clapper

Clapper.app, a video synthesizer and sequencer designed for the age of AI cinema

TypeScript 1,994 182 Updated Sep 18, 2024

serengil / deepface

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Python 11,914 2,026 Updated Sep 6, 2024

Rikorose / DeepFilterNet

Noise supression using deep filtering

Python 2,387 222 Updated Jul 31, 2024

rtvi-ai / rtvi-web-demo

Example UI implementing the RTVI web client

TypeScript 470 66 Updated Aug 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lanyuer lanyuer

Achievements

Achievements

Block or report lanyuer

Stars

lifeiteng / OmniSenseVoice

kyutai-labs / moshi

openatx / uiautomator2

gojue / ecapture

JoeanAmier / TikTokDownloader

NearHuiwen / TiktokDouyinCrawler

Breakthrough / PySceneDetect

modelscope / FunASR