-
Ph.D. student of Mechanical Engineering Department of Penn State University
- State College, PA, US
Stars
Interact with your documents using the power of GPT, 100% privately, no data leaks
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
A Deep Learning based project for colorizing and restoring old images (and video!)
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
High-Resolution 3D Human Digitization from A Single Image.
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
A collaboration friendly studio for NeRFs
PyTorch implementation of the U-Net for image semantic segmentation with high quality images
all kinds of text classification models and more with deep learning
Real-Time High-Resolution Background Matting
Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also …
Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models
Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).
A collection of loss functions for medical image segmentation
A repository for storing models that have been inter-converted between various frameworks. Supported frameworks are TensorFlow, PyTorch, ONNX, OpenVINO, TFJS, TFTRT, TensorFlowLite (Float32/16/INT8…
Python library for loading and using triangular meshes.
FFCV: Fast Forward Computer Vision (and other ML workloads!)
Flops counter for convolutional networks in pytorch framework
Summary of related papers on visual attention. Related code will be released based on Jittor gradually.
A procedural Blender pipeline for photorealistic training image generation
YOLO5Face: Why Reinventing a Face Detector (https://arxiv.org/abs/2105.12931) ECCV Workshops 2022)
A Unified Framework for Surface Reconstruction
This repository contains the code for the paper "PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization"