OedoSoldier

🎯

Focusing

OedoSoldier OedoSoldier

🎯

Focusing

Ph.D. student at UCAS

172 followers · 0 following

Achievements

Highlights

Starred repositories

fumiama / copymanga

拷贝漫画的第三方APP，优化阅读/下载体验

Kotlin 2,167 54 Updated Sep 15, 2024

42Shawn / LLaVA-PruMerge

LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models

Python 87 5 Updated May 15, 2024

RUCAIBox / POPE

Forked from AoiDragon/POPE

The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''

Python 166 6 Updated Mar 25, 2024

lxtGH / OMG-Seg

OMG-LLaVA and OMG-Seg codebase

Python 1,226 47 Updated Aug 16, 2024

hganchev / python-ur-control

Control for Universal Robots with python Dashboard, RealTime Interfaces, RTDE (to be discussed). If there is anything specific that needs to be done - suggest it to the discussion.

Python 6 2 Updated Jul 16, 2024

SuperDiodo / ur_ros_rtde

ROS2 Interface for Universal Robot CoBots Control with ur_rtde (Python, C++)

C++ 8 3 Updated Sep 3, 2024

yunshengtian / Assemble-Them-All

[SIGGRAPH Asia 2022] Assemble Them All: Physics-Based Planning for Generalizable Assembly by Disassembly

C++ 139 15 Updated Mar 18, 2024

AutodeskAILab / Fusion360GalleryDataset

Data, tools, and documentation of the Fusion 360 Gallery Dataset

Jupyter Notebook 422 51 Updated Apr 23, 2022

real-stanford / universal_manipulation_interface

Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild Robots

Python 608 118 Updated Sep 15, 2024

CRH380B-6216L / ur5_webgui

A Web based GUI for Universal Robots UR5 industrial robot

JavaScript 25 6 Updated Jul 17, 2020

liuzhao1225 / YouDub-webui

Python 1,768 186 Updated May 14, 2024

showlab / Awesome-MLLM-Hallucination

📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

366 10 Updated Sep 15, 2024

FoundationVision / Groma

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

Python 540 57 Updated Jun 7, 2024

foundation-multimodal-models / CAL

Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment

Python 44 2 Updated Jun 13, 2024

SunzeY / AlphaCLIP

[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Jupyter Notebook 642 38 Updated Jul 30, 2024

neavo / KeywordGacha

使用 OpenAI 兼容接口自动生成小说、漫画、字幕、游戏脚本等内容文本中实体词语表的翻译辅助工具

Python 117 6 Updated Sep 19, 2024

SakuraLLM / SakuraLLM

适配轻小说/Galgame的日中翻译大模型

Python 2,211 75 Updated Aug 9, 2024

FishHawk / auto-novel

轻小说机翻网站，支持网络小说/文库小说/本地小说

Kotlin 355 37 Updated Sep 18, 2024

vimalabs / VIMA

Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"

Python 753 85 Updated Apr 18, 2024

PKU-YuanGroup / Video-LLaVA

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 2,856 206 Updated Jul 27, 2024

DCDmllm / MorphTokens

Python 40 1 Updated May 6, 2024

stathius / sd-vae

Code for the paper "Disentangled Generative Models for Robust Prediction of System Dynamics"

Jupyter Notebook 14 3 Updated May 2, 2023

lllyasviel / LayerDiffuse_DiffusersCLI

LayerDiffuse in pure diffusers without any GUI

Python 307 25 Updated Jun 16, 2024

clash-verge-rev / clash-verge-rev

Continuation of Clash Verge - A Clash Meta GUI based on Tauri (Windows, MacOS, Linux)

TypeScript 33,063 2,539 Updated Sep 19, 2024

yunlongdong / Awesome-Embodied-AI

245 19 Updated Apr 15, 2024

haoranD / Awesome-Embodied-AI

A curated list of awesome papers on Embodied AI and related research/industry-driven resources.

242 5 Updated Jul 26, 2024

AILab-CVC / YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 4,376 427 Updated Jul 30, 2024

FreedomIntelligence / ALLaVA

Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model

Python 239 8 Updated Jun 25, 2024

johannakarras / DreamPose

Official implementation of "DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion"

Python 961 74 Updated Nov 2, 2023

h-zhao1997 / cobra

Cobra: Extending Mamba to Multi-modal Large Language Model for Efficient Inference

Python 245 8 Updated Aug 19, 2024

Starred topics

fine-grained-classification