Stars
Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯
Capturing SSL/TLS plaintext without a CA certificate using eBPF. Supported on Linux/Android kernels for amd64/arm64.
TikTok 主页/合辑/直播/视频/图集/原声;抖音主页/视频/图集/收藏/直播/原声/合集/评论/账号/搜索/热榜数据采集工具
国外Tiktok+国内抖音爬虫,a-bogus和x-bogus算法破解
🎥 Python and OpenCV-based scene cut/transition detection program & library.
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
修复funasr中seaco-paraformer导出onnx后没有时间戳的bug
Anthropic's educational courses
Lightning-fast serving engine for AI models. Flexible. Easy. Enterprise-scale.
Undetected Python version of the Playwright testing and automation library.
The resulting JS file can be used in pure CDP implementations or to test the evasions in your devtools.
Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click.
[EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers
An extremely fast Python package and project manager, written in Rust.
A multi-platform proxy client based on ClashMeta,simple and easy to use, open-source and ad-free.
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
Data annotation toolbox supports image, audio and video data.
VITS with phoneme-level prosody modeling based on MaskGIT
✨ The Next Gen Airtable Alternative: No-Code Postgres
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
A hands-on introduction to video technology: image, video, codec (av1, vp9, h265) and more (ffmpeg encoding). Translations: 🇺🇸 🇨🇳 🇯🇵 🇮🇹 🇰🇷 🇷🇺 🇧🇷 🇪🇸
Clapper.app, a video synthesizer and sequencer designed for the age of AI cinema
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Noise supression using deep filtering
Example UI implementing the RTVI web client