Skip to content
View OedoSoldier's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report OedoSoldier

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

拷贝漫画的第三方APP,优化阅读/下载体验

Kotlin 2,167 54 Updated Sep 15, 2024

LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models

Python 87 5 Updated May 15, 2024

The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''

Python 166 6 Updated Mar 25, 2024

OMG-LLaVA and OMG-Seg codebase

Python 1,226 47 Updated Aug 16, 2024

Control for Universal Robots with python Dashboard, RealTime Interfaces, RTDE (to be discussed). If there is anything specific that needs to be done - suggest it to the discussion.

Python 6 2 Updated Jul 16, 2024

ROS2 Interface for Universal Robot CoBots Control with ur_rtde (Python, C++)

C++ 8 3 Updated Sep 3, 2024

[SIGGRAPH Asia 2022] Assemble Them All: Physics-Based Planning for Generalizable Assembly by Disassembly

C++ 139 15 Updated Mar 18, 2024

Data, tools, and documentation of the Fusion 360 Gallery Dataset

Jupyter Notebook 422 51 Updated Apr 23, 2022

Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild Robots

Python 608 118 Updated Sep 15, 2024

A Web based GUI for Universal Robots UR5 industrial robot

JavaScript 25 6 Updated Jul 17, 2020

📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

366 10 Updated Sep 15, 2024

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

Python 540 57 Updated Jun 7, 2024

Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment

Python 44 2 Updated Jun 13, 2024

[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Jupyter Notebook 642 38 Updated Jul 30, 2024

使用 OpenAI 兼容接口自动生成小说、漫画、字幕、游戏脚本等内容文本中实体词语表的翻译辅助工具

Python 117 6 Updated Sep 19, 2024

适配轻小说/Galgame的日中翻译大模型

Python 2,211 75 Updated Aug 9, 2024

轻小说机翻网站,支持网络小说/文库小说/本地小说

Kotlin 355 37 Updated Sep 18, 2024

Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"

Python 753 85 Updated Apr 18, 2024

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 2,856 206 Updated Jul 27, 2024
Python 40 1 Updated May 6, 2024

Code for the paper "Disentangled Generative Models for Robust Prediction of System Dynamics"

Jupyter Notebook 14 3 Updated May 2, 2023

LayerDiffuse in pure diffusers without any GUI

Python 307 25 Updated Jun 16, 2024

Continuation of Clash Verge - A Clash Meta GUI based on Tauri (Windows, MacOS, Linux)

TypeScript 33,063 2,539 Updated Sep 19, 2024

A curated list of awesome papers on Embodied AI and related research/industry-driven resources.

242 5 Updated Jul 26, 2024

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 4,376 427 Updated Jul 30, 2024

Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model

Python 239 8 Updated Jun 25, 2024

Official implementation of "DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion"

Python 961 74 Updated Nov 2, 2023

Cobra: Extending Mamba to Multi-modal Large Language Model for Efficient Inference

Python 245 8 Updated Aug 19, 2024
Next