kkahatapitiya

Follow

Kumara Kahatapitiya kkahatapitiya

Follow

CS PhD candidate at Stony Brook University. Working on Computer Vision.

20 followers · 13 following

Achievements

Achievements

Stars

black-forest-labs / flux

Official inference repo for FLUX.1 models

Python 13,958 991 Updated Sep 13, 2024

Ziyang412 / VideoTree

Code for paper "VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos"

Python 70 3 Updated Aug 6, 2024

Vchitect / VBench

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 495 23 Updated Sep 20, 2024

jongwoopark7978 / LVNet

Python 13 1 Updated Jul 31, 2024

bdaiinstitute / theia

Theia: Distilling Diverse Vision Foundation Models for Robot Learning

Python 141 6 Updated Aug 30, 2024

bytedance / tarsier

Python 103 5 Updated Sep 23, 2024

Vchitect / Latte

Latte: Latent Diffusion Transformer for Video Generation.

Python 1,643 174 Updated Sep 18, 2024

cfmata / CoPT

[ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings

3 Updated Jul 16, 2024

BenchCouncil / AIGCBench

Official repo for AIGCBench: Comprehensive Evaluation of Image-to-Video Content Generated by AI

Python 28 Updated Jan 30, 2024

NUS-HPC-AI-Lab / VideoSys

VideoSys: An easy and efficient system for video generation

Python 1,649 111 Updated Sep 23, 2024

ImageOptim / gifski

GIF encoder based on libimagequant (pngquant). Squeezes maximum possible quality from the awful GIF format.

Rust 4,737 140 Updated Aug 31, 2024

microsoft / Focal-Transformer

[NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"

Python 545 59 Updated Mar 27, 2022

lllyasviel / ControlNet

Let us control diffusion models!

Python 29,830 2,693 Updated Feb 25, 2024

PKU-YuanGroup / Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,281 1,007 Updated Sep 20, 2024

LostXine / LLaRA

LLaRA: Large Language and Robotics Assistant

Python 138 3 Updated Sep 3, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 21,678 2,091 Updated Aug 9, 2024

Alpha-VLLM / Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,030 85 Updated Aug 6, 2024

AUTOMATIC1111 / stable-diffusion-webui

Stable Diffusion web UI

Python 139,996 26,518 Updated Sep 9, 2024

facebookresearch / DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,035 535 Updated May 31, 2024

WisconsinAIVision / ViP-LLaVA

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

Python 278 20 Updated Jul 17, 2024

HVision-NKU / StoryDiffusion

Create Magic Story!

Jupyter Notebook 5,806 577 Updated Jul 24, 2024

PKU-YuanGroup / Video-LLaVA

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 2,860 207 Updated Jul 27, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 26,228 2,954 Updated Aug 12, 2024

Agentic-Learning-AI-Lab / lifelong-memory

Code for LifelongMemory: Leveraging LLMs for Answering Queries in Long-form Egocentric Videos

Python 13 Updated May 17, 2024

IDEA-Research / Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 14,806 1,370 Updated Sep 5, 2024

LostXine / open_x_pytorch_dataloader

An unofficial pytorch dataloader for Open X-Embodiment Datasets https://github.com/google-deepmind/open_x_embodiment

Python 9 1 Updated Apr 18, 2024

kahnchana / mvu

Multimodal Video Understanding Framework (MVU)

Python 23 Updated May 15, 2024

mhamilton723 / FeatUp

Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024

Jupyter Notebook 1,334 78 Updated Jun 28, 2024

kkahatapitiya / LinearConv

Code for our WACV 2021 paper "Exploiting the Redundancy in Convolutional Filters for Parameter Reduction"

Python 9 2 Updated Jan 6, 2021

kkahatapitiya / SSDet

Code for our AAAI 2023 paper "Weakly-guided Self-supervised Pretraining for Temporal Activity Detection"

Python 9 Updated Feb 5, 2023