Stars
Noise supression using deep filtering
Open Source Autonomous Software Development System
Multilingual Voice Understanding Model
🔥🕷️ Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Crawl a site to generate knowledge files to create your own custom GPT from a URL
[TOG 2023] HAvatar: High-fidelity Head Avatar via Facial Model ConditionedNeural Radiance Field
Drag & drop UI to build your customized LLM flow
A guidance language for controlling large language models.
Your friendliest open source all-in-one automation tool ✨ Workflow automation tool 200+ integration / Enterprise automation tool / Zapier Alternative
Deploying n8n on Render (render.com) hosting, using separate Web Service (with Docker and Persistent Disk Storage) + Postgres DB.