Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Tr…

Jupyter Notebook 359 20 Updated Sep 24, 2024

Sid2697 / HOI-Ref

Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"

Python 17 2 Updated Apr 16, 2024

kyegomez / RT-2

Democratization of RT-2 "RT-2: New model translates vision and language into action"

Python 356 47 Updated Jul 26, 2024

EmbodiedGPT / EmbodiedGPT_Pytorch

Python 330 32 Updated Apr 26, 2024

Lifelong-Robot-Learning / LIBERO

Benchmarking Knowledge Transfer in Lifelong Robot Learning

Jupyter Notebook 203 31 Updated Aug 22, 2024

Large-Trajectory-Model / ATM

Official codebase for "Any-point Trajectory Modeling for Policy Learning"

Python 152 16 Updated Aug 20, 2024

facebookresearch / ego4d-goalstep

Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)

Python 35 Updated Apr 15, 2024

changhaonan / A3VLM

[CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`

Python 65 3 Updated Sep 15, 2024

Pointcept / Pointcept

Pointcept: a codebase for point cloud perception research. Latest works: PTv3 (CVPR'24 Oral), PPT (CVPR'24), OA-CNNs (CVPR'24), MSC (CVPR'23)

Python 1,516 163 Updated Sep 7, 2024

UMass-Foundation-Model / 3D-VLA

[ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model

Python 321 11 Updated Jul 9, 2024

SiyuanHuang95 / ManipVQA

[IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models

Python 63 3 Updated Aug 22, 2024

EmbodiedGPT / EgoCOT_Dataset

36 Updated Apr 4, 2024

HCPLab-SYSU / Embodied_AI_Paper_List

[Embodied-AI-Survey-2024] Paper list and projects for Embodied AI

551 37 Updated Sep 26, 2024

h-zhao1997 / cobra

Cobra: Extending Mamba to Multi-modal Large Language Model for Efficient Inference

Python 248 8 Updated Aug 19, 2024

bulletphysics / bullet3

Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.

C++ 12,486 2,867 Updated Aug 8, 2024

ElectronicElephant / pybullet_ur5_robotiq

Gym-Styled UR5 arm with Robotiq-85 / 140 gripper in Bullet simulator

Python 199 34 Updated Apr 18, 2022

ReaFly / Awesome-Vision-Mamba

✨✨Latest Papers on Vision Mamba and Related Areas

187 10 Updated Sep 26, 2024

hustvl / Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 2,852 185 Updated Sep 19, 2024

NVlabs / MambaVision

Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone

Python 732 41 Updated Sep 9, 2024

vimalabs / VIMA

Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"

Python 761 85 Updated Apr 18, 2024

notFoundThisPerson / RoboCAS-v0

Python 15 Updated Jun 24, 2024

LostXine / LLaRA

LLaRA: Large Language and Robotics Assistant

Python 141 3 Updated Sep 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tanhuajie tanhuajie

Block or report tanhuajie

Stars

MSR3D / MSR3D

AprilRobotics / apriltag_ros

christianrauch / apriltag_ros

SilongYong / SQA3D

scene-verse / SceneVerse

embodied-generalist / embodied-generalist

OpenRobotLab / EmbodiedScan

yuanzhoulvpi2017 / zero_nlp

Coobiw / MPP-LLaVA