-
Shanghai Jiao Tong University
- China
Lists (1)
Sort Name ascending (A-Z)
Stars
The codes for the paper of "A particle swarm optimization-based flexible convolutional auto-encoder for image classification" published by TNNLS
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
High-resolution models for human tasks.
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
[ICPR 2024] Exemplar-free continual deepfake detector that leverages CLIP and domain-specific multi-modal prompts
上海交通大学 LaTeX 论文模板 | Shanghai Jiao Tong University LaTeX Thesis Template
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Implementation of the paper "DeepLSD: Line Segment Detection and Refinement with Deep Image Gradients"
[CVPR 2023 Highlight] Perspective Fields for Single Image Camera Calibration
Code and pre-trained models for our paper "CLIPping the Deception: Adapting Vision-Language Models for Universal Deepfake Detection".
DiffusionFace: Towards a Comprehensive Dataset for Diffusion-Based Face Forgery Analysis
[CVPR 2024] Shadows Don’t Lie and Lines Can’t Bend! Generative Models don’t know Projective Geometry...for now
Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video”
Code for the paper: Learning on Gradients: Generalized Artifacts Representation for GAN-Generated Images Detection
MARS5 speech model (TTS) from CAMB.AI
MaskSim: Detection of synthetic images by masked spectrum similarity analysis
The official implementation for LAA-Net: Localized Artifact Attention Network for Quality-Agnostic and Generalizable Deepfake Detection
Official implementation for CVPR2023 Paper "Re-IQA : Unsupervised Learning for Image Quality Assessment in the Wild"
[WACV 2024 Oral] - ARNIQA: Learning Distortion Manifold for Image Quality Assessment
[CVPR2023] Blind Image Quality Assessment via Vision-Language Correspondence: A Multitask Learning Perspective
A generative speech model for daily dialogue.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
(ICCV'23) Learning to Upsample by Learning to Sample