Skip to content
View guangkaixu's full-sized avatar
  • @aim-uofa & Zhejiang University
  • Hangzhou,China
  • 17:50 (UTC +08:00)

Block or report guangkaixu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

Dense Contrastive Learning (DenseCL) for self-supervised representation learning, CVPR 2021 Oral.

Python 545 70 Updated Dec 26, 2023

3D Reconstruction with Spatial Memory

Python 330 12 Updated Sep 17, 2024

[ICML 2024] Official code repository for 3D embodied generalist agent LEO

Python 333 30 Updated Jul 30, 2024

Official code of PatchmatchNet (CVPR 2021 Oral)

Python 501 70 Updated May 28, 2022

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

Python 4,637 462 Updated Sep 9, 2024
Python 752 47 Updated Aug 13, 2024

Code for "Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text".

Python 264 14 Updated Jul 19, 2024

EVA Series: Visual Representation Fantasies from BAAI

Python 2,217 162 Updated Aug 1, 2024

Awesome Embodied Navigation: Concept, Paradigm and State-of-the-arts

75 2 Updated Sep 4, 2024

🙃 A delightful community-driven (with 2,400+ contributors) framework for managing your zsh configuration. Includes 300+ optional plugins (rails, git, macOS, hub, docker, homebrew, node, php, python…

Shell 172,561 25,821 Updated Sep 19, 2024

SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process

Python 151 9 Updated Jan 21, 2024

Official code for PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point Tracking (ICCV 2023)

Python 110 5 Updated Feb 22, 2024

Grounding Image Matching in 3D with MASt3R

Python 803 46 Updated Aug 28, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 37,267 3,915 Updated Jul 28, 2024

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 507 20 Updated Sep 19, 2024

[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion

Python 1,340 249 Updated Jul 29, 2024

An implemtation of Everyting of Thoughts (XoT).

Python 119 13 Updated Feb 21, 2024

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Python 3,743 302 Updated Aug 21, 2024

IRS: A Large Synthetic Indoor Robotics Stereo Dataset for Disparity and Surface Normal Estimation

Python 108 15 Updated Jun 2, 2024

Official inference library for Mistral models

Jupyter Notebook 9,531 845 Updated Sep 20, 2024
Python 1,742 54 Updated Jun 28, 2024

F3RM: Feature Fields for Robotic Manipulation. Official repo for the paper "Distilled Feature Fields Enable Few-Shot Language-Guided Manipulation" (CoRL 2023).

Python 181 17 Updated Apr 26, 2024

[SIGGRAPH Asia 2024 (Journal Track)] StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal

Python 309 14 Updated Sep 11, 2024

A toolbox for benchmarking SOTA discriminative and generative geometry estimation models.

Python 46 1 Updated Aug 29, 2024

Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 3,327 268 Updated Aug 14, 2024

[ECCV 2024] ShareGPT4V: Improving Large Multi-modal Models with Better Captions

Python 115 4 Updated Jul 1, 2024

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 4,653 368 Updated Sep 11, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 7,892 479 Updated Sep 20, 2024

An open-source implementation for training LLaVA-NeXT.

Python 243 10 Updated Jun 12, 2024

✨✨Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models

Python 128 7 Updated Jul 23, 2024
Next