Stars
The world's simplest facial recognition api for Python and the command line
基于人脸识别的课堂考勤系统v2.0
A CNN based pytorch implementation on facial expression recognition (FER2013 and CK+), achieving 73.112% (state-of-the-art) in FER2013 and 94.64% in CK+ dataset
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
A powerful baseline for image classification and face recognition with Pytorch
Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
智慧教室监考系统,作弊检测和考生点名功能(智慧教室的最后一个项目,尝试使用c++部署算法,使用TensorRT进行加速)
利用Pytorch设计完成的基于卷积神经网络实现的面部表情识别项目 —— A facial expression recognition project based on convolution neural network designed by Pytorch 【Plus版本】:https://github.com/hexiang10/face-recognition-plus
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…
Speech To Speech: an effort for an open-sourced and modular GPT4-o
A powerful tool that translates ComfyUI workflows into executable Python code.
A set of nodes for ComfyUI that can composite layer and mask to achieve Photoshop like functionality.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Unofficial implementation of BRIA RMBG Model for ComfyUI
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Official PyTorch implementation for TOMM24 "SwinShadow: Shifted Window for Ambiguous Adjacent Shadow Detection"
Official Implementation of Web-based Visual Corpus Builder (Webvicob), ICDAR 2023
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT-o1/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
A clean customizable documentation theme for Sphinx
My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
Get bibtex of multiple references in a single line text, by python scraping Google Scholar.