Skip to content
View tanhuajie's full-sized avatar

Block or report tanhuajie

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[NeurIPS 2024] Official code repository for MSR3D paper

10 Updated Sep 27, 2024

A ROS wrapper of the AprilTag 3 visual fiducial detector

C++ 364 340 Updated Jun 23, 2024

ROS2 node for AprilTag detection

C++ 152 92 Updated Jul 7, 2024

[ICLR 2023] SQA3D for embodied scene understanding and reasoning

Python 117 3 Updated Oct 13, 2023

Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"

Python 172 2 Updated Sep 29, 2024

[ICML 2024] Official code repository for 3D embodied generalist agent LEO

Python 343 30 Updated Jul 30, 2024

[CVPR 2024 & NeurIPS 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI

Python 460 34 Updated Sep 12, 2024

中文nlp解决方案(大模型、数据、模型、训练、推理)

Jupyter Notebook 2,871 358 Updated Sep 26, 2024

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Tr…

Jupyter Notebook 359 20 Updated Sep 24, 2024

Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"

Python 17 2 Updated Apr 16, 2024

Democratization of RT-2 "RT-2: New model translates vision and language into action"

Python 356 47 Updated Jul 26, 2024

Benchmarking Knowledge Transfer in Lifelong Robot Learning

Jupyter Notebook 203 31 Updated Aug 22, 2024

Official codebase for "Any-point Trajectory Modeling for Policy Learning"

Python 152 16 Updated Aug 20, 2024

Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)

Python 35 Updated Apr 15, 2024

[CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`

Python 65 3 Updated Sep 15, 2024

Pointcept: a codebase for point cloud perception research. Latest works: PTv3 (CVPR'24 Oral), PPT (CVPR'24), OA-CNNs (CVPR'24), MSC (CVPR'23)

Python 1,516 163 Updated Sep 7, 2024

[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model

Python 321 11 Updated Jul 9, 2024

[IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models

Python 63 3 Updated Aug 22, 2024

[Embodied-AI-Survey-2024] Paper list and projects for Embodied AI

551 37 Updated Sep 26, 2024

Cobra: Extending Mamba to Multi-modal Large Language Model for Efficient Inference

Python 248 8 Updated Aug 19, 2024

Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.

C++ 12,486 2,867 Updated Aug 8, 2024

Gym-Styled UR5 arm with Robotiq-85 / 140 gripper in Bullet simulator

Python 199 34 Updated Apr 18, 2022

✨✨Latest Papers on Vision Mamba and Related Areas

187 10 Updated Sep 26, 2024

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 2,852 185 Updated Sep 19, 2024

Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone

Python 732 41 Updated Sep 9, 2024

Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"

Python 761 85 Updated Apr 18, 2024
Python 15 Updated Jun 24, 2024

LLaRA: Large Language and Robotics Assistant

Python 141 3 Updated Sep 3, 2024
Next