A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites

466 24 Updated Sep 9, 2024

dvlab-research / LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 1,766 126 Updated Jul 2, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

28,738 1,575 Updated Aug 1, 2024

Open3DA / LL3DA

[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Language 3D Assistant.

Python 225 9 Updated Jul 17, 2024

ZCMax / ScanReason

[ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities

43 1 Updated Jul 2, 2024

OpenRobotLab / Grounded_3D-LLM

Code&Data for Grounded 3D-LLM with Referent Tokens

Python 76 1 Updated Jul 1, 2024

ActiveVisionLab / Awesome-LLM-3D

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources

986 65 Updated Sep 20, 2024

ybgdgh / L3MVN

Leveraging Large Language Models for Visual Target Navigation

Python 77 13 Updated Oct 24, 2023

ruili3 / Know-Your-Neighbors

[CVPR 2024] 🏡Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning

Python 66 8 Updated Apr 5, 2024

OpenRobotLab / EmbodiedScan

[CVPR 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI

Python 452 34 Updated Sep 12, 2024

facebookresearch / Replica-Dataset

The Replica Dataset v1 as published in https://arxiv.org/abs/1906.05797 .

C++ 977 98 Updated Jul 22, 2024

Pointcept / PointTransformerV3

[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)

Python 725 41 Updated Apr 5, 2024

JLUtangchuan / Parts2Words

This is the source code of Part2Word: Learning Joint Embedding of Point Clouds and Text by Bidirectional Matching between Parts and Words

Python 13 Updated Mar 22, 2023

embodied-generalist / embodied-generalist

[ICML 2024] Official code repository for 3D embodied generalist agent LEO

Python 333 30 Updated Jul 30, 2024

CVMI-Lab / PLA

(CVPR 2023) PLA: Language-Driven Open-Vocabulary 3D Scene Understanding & (CVPR2024) RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding

Python 252 11 Updated Jun 28, 2024

pengsongyou / openscene

[CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabularies

Python 631 44 Updated Oct 27, 2023

yuxiaoguo / Uni3DScenes

Python 12 Updated Apr 24, 2023

bertjiazheng / Structured3D

[ECCV'20] Structured3D: A Large Photo-realistic Dataset for Structured 3D Modeling

Python 524 62 Updated Jan 9, 2024

microsoft / Swin3D

A shift-window based transformer for 3D sparse tasks

Cuda 202 19 Updated Jun 25, 2023

Pointcept / Pointcept

Pointcept: a codebase for point cloud perception research. Latest works: PTv3 (CVPR'24 Oral), PPT (CVPR'24), OA-CNNs (CVPR'24), MSC (CVPR'23)

Python 1,498 161 Updated Sep 7, 2024

PJLab-ADG / 3DTrans

An open-source codebase for exploring autonomous driving pre-training

Python 587 72 Updated Jan 19, 2024

Pointcept / PointTransformerV2

[NeurIPS'22] An official PyTorch implementation of PTv2.

Python 347 23 Updated Jun 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

An Tao antao97

Achievements

Achievements

Highlights

Block or report antao97

Stars

hijkzzz / Awesome-LLM-Strawberry

GuanxingLu / YAPL

BradyFU / Awesome-Multimodal-Large-Language-Models

mlabonne / llm-course

FranxYao / chain-of-thought-hub

logikon-ai / awesome-deliberative-prompting

Hannibal046 / Awesome-LLM

zubair-irshad / Awesome-Implicit-NeRF-Robotics

zubair-irshad / Awesome-Robotics-3D