ar-mine

Follow

🎯

Focusing

ar-mine

🎯

Focusing

Follow

3 followers · 4 following

Highlights

Pro

Starred repositories

lutzroeder / netron

Visualizer for neural network, deep learning and machine learning models

JavaScript 27,636 2,746 Updated Sep 24, 2024

AndrejOrsula / pymoveit2

Basic Python interface for MoveIt 2 built on top of ROS 2 actions and services

Python 135 53 Updated Sep 16, 2024

mst272 / LLM-Dojo

欢迎来到 LLM-Dojo，这里是一个开源大模型学习场所，使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩‍🎓👨‍🎓

Python 232 17 Updated Sep 20, 2024

Meituan-AutoML / MobileVLM

Strong and Open Vision Language Assistant for Mobile Devices

Python 986 65 Updated Apr 15, 2024

YanjieZe / 3D-Diffusion-Policy

[RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations

Python 386 35 Updated Sep 10, 2024

jgornet / predictive-coding-recovers-maps

67 3 Updated Jun 6, 2024

uml-robotics / fetchit2019

Code ran for the fetchit 2019 competition

C++ 4 2 Updated Mar 15, 2021

graspnet / graspnet-baseline

Baseline model for "GraspNet-1Billion: A Large-Scale Benchmark for General Object Grasping" (CVPR 2020)

Python 467 142 Updated Jun 13, 2024

langchain-ai / langchain

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 92,627 14,824 Updated Sep 24, 2024

longzw1997 / Open-GroundingDino

This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.

Python 392 62 Updated Jun 25, 2024

IDEA-Research / GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 6,345 658 Updated Aug 12, 2024

mbzuai-oryx / Video-LLaVA

PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models

Python 237 11 Updated Jan 2, 2024

octo-models / octo

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Python 773 151 Updated Jul 31, 2024

thuml / iVideoGPT

Official repo for "iVideoGPT: Interactive VideoGPTs are Scalable World Models", https://arxiv.org/abs/2405.15223

Python 59 3 Updated Sep 1, 2024

MrNeRF / awesome-3D-gaussian-splatting

Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.

5,706 329 Updated Sep 19, 2024

Jumpat / SegAnyGAussians

The official implementation of SAGA (Segment Any 3D GAussians)

Jupyter Notebook 554 40 Updated Jun 21, 2024

hkchengrex / XMem

[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model

Python 1,725 191 Updated Mar 15, 2024

OpenGVLab / VisionLLM

VisionLLM Series

Python 852 20 Updated Sep 13, 2024

IDEA-Research / Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 14,811 1,372 Updated Sep 5, 2024

mees / calvin

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

Python 365 56 Updated Sep 1, 2024

askforalfred / alfred

ALFRED - A Benchmark for Interpreting Grounded Instructions for Everyday Tasks

C 364 79 Updated Jul 25, 2024

LargeWorldModel / LWM

Python 7,088 549 Updated Aug 12, 2024

OpenGVLab / Instruct2Act

Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model

Python 323 20 Updated Jun 23, 2024

facebookresearch / r3m

Pre-training Reusable Representations for Robotic Manipulation Using Diverse Human Video Data

Python 285 45 Updated Mar 21, 2023

GT-RIPL / Awesome-LLM-Robotics

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

2,785 228 Updated Sep 9, 2024

openai / openai-cookbook

Examples and guides for using the OpenAI API

MDX 58,662 9,309 Updated Sep 23, 2024

google-deepmind / open_x_embodiment

Jupyter Notebook 785 54 Updated Aug 21, 2024

DAMO-NLP-SG / Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python 2,722 242 Updated Jun 4, 2024

DirtyHarryLYL / LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

828 35 Updated Jun 5, 2024

UZ-SLAMLab / ORB_SLAM3

ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual-Inertial and Multi-Map SLAM

C++ 6,425 2,526 Updated Jul 24, 2024

Starred topics

Ubuntu

Tensorflow

Raspberry Pi

Python

Minecraft

Deep learning

C

C++