Stars
自动化上传视频到社交媒体:抖音、小红书、视频号、tiktok、youtube、bilibili
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Robust Speech Recognition via Large-Scale Weak Supervision
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
一个拍照做题程序。输入一张包含数学计算题的图片,输出识别出的数学计算式以及计算结果。This is a mathematic expression recognition project.
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
基于Dify的企业微信知识库机器人,基于企微gpt知识库的bot机器人,能够自动回复企业微信中收到的消息。这个机器人能够处理私聊和群聊,还可以记住与用户的聊天内容,从而做出更加贴合上下文的回应。此外,您还可以设置白名单来控制机器人与哪些用户或群组交互。如需自己dify网站版的机器人WX:aiwis99
AnimateDiff for AUTOMATIC1111 Stable Diffusion WebUI
CodeGeeX2: A More Powerful Multilingual Code Generation Model
SoftVC VITS Singing Voice Conversion
Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用
基于企微gpt知识库的bot机器人,能够自动回复企业微信中收到的消息。这个机器人能够处理私聊和群聊,还可以记住与用户的聊天内容,从而做出更加贴合上下文的回应。此外,您还可以设置白名单来控制机器人与哪些用户或群组交互。如需对接自己的知识库网站WX:aiwis99
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…
FlagAI (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model.
[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
🚀🎬 ShortGPT - Experimental AI framework for youtube shorts / tiktok channel automation
FastAPI-Amis-Admin is a high-performance, efficient and easily extensible FastAPI admin framework. Inspired by django-admin, and has as many powerful functions as django-admin.
Simple and extensible administrative interface framework for Flask
Ready-to-use SRT / WebRTC / RTSP / RTMP / LL-HLS media server and media proxy that allows to read, publish, proxy, record and playback video and audio streams.
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
high performance coding with golang(Go 语言高性能编程,Go 语言陷阱,Gotchas,Traps)
Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Simplify declarative cloud-native development base on FastAPI and gRPC. https://bali-framework.github.io/bali/
可自动优化提示、免费开源、全平台傻瓜式 ChatGPT 本地客户端,支持断点续聊、修改历史对话、本地聊天记录存储导入导出、添加自己的 apikey