Skip to content
View kkahatapitiya's full-sized avatar

Block or report kkahatapitiya

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official inference repo for FLUX.1 models

Python 13,958 991 Updated Sep 13, 2024

Code for paper "VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos"

Python 70 3 Updated Aug 6, 2024

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 495 23 Updated Sep 20, 2024
Python 13 1 Updated Jul 31, 2024

Theia: Distilling Diverse Vision Foundation Models for Robot Learning

Python 141 6 Updated Aug 30, 2024
Python 103 5 Updated Sep 23, 2024

Latte: Latent Diffusion Transformer for Video Generation.

Python 1,643 174 Updated Sep 18, 2024

[ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings

3 Updated Jul 16, 2024

Official repo for AIGCBench: Comprehensive Evaluation of Image-to-Video Content Generated by AI

Python 28 Updated Jan 30, 2024

VideoSys: An easy and efficient system for video generation

Python 1,649 111 Updated Sep 23, 2024

GIF encoder based on libimagequant (pngquant). Squeezes maximum possible quality from the awful GIF format.

Rust 4,737 140 Updated Aug 31, 2024

[NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"

Python 545 59 Updated Mar 27, 2022

Let us control diffusion models!

Python 29,830 2,693 Updated Feb 25, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,281 1,007 Updated Sep 20, 2024

LLaRA: Large Language and Robotics Assistant

Python 138 3 Updated Sep 3, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 21,678 2,091 Updated Aug 9, 2024

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,030 85 Updated Aug 6, 2024

Stable Diffusion web UI

Python 139,996 26,518 Updated Sep 9, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,035 535 Updated May 31, 2024

[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts

Python 278 20 Updated Jul 17, 2024

Create Magic Story!

Jupyter Notebook 5,806 577 Updated Jul 24, 2024

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 2,860 207 Updated Jul 27, 2024

The official Meta Llama 3 GitHub site

Python 26,228 2,954 Updated Aug 12, 2024

Code for LifelongMemory: Leveraging LLMs for Answering Queries in Long-form Egocentric Videos

Python 13 Updated May 17, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 14,806 1,370 Updated Sep 5, 2024

An unofficial pytorch dataloader for Open X-Embodiment Datasets https://github.com/google-deepmind/open_x_embodiment

Python 9 1 Updated Apr 18, 2024

Multimodal Video Understanding Framework (MVU)

Python 23 Updated May 15, 2024

Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024

Jupyter Notebook 1,334 78 Updated Jun 28, 2024

Code for our WACV 2021 paper "Exploiting the Redundancy in Convolutional Filters for Parameter Reduction"

Python 9 2 Updated Jan 6, 2021

Code for our AAAI 2023 paper "Weakly-guided Self-supervised Pretraining for Temporal Activity Detection"

Python 9 Updated Feb 5, 2023
Next