-
Shanghai Jiao Tong University
- Shanghai, China
-
01:19
(UTC +08:00) - main.nuowen.pro
Lists (5)
Sort Name ascending (A-Z)
Stars
The official repository of Compact 3D Gaussian Representation for Radiance Field
Octree-GS: Towards Consistent Real-time Rendering with LOD-Structured 3D Gaussians
A multi-sensor capture system for free viewpoint video.
Code repo for paper "Low Latency Point Cloud Rendering with Learned Splatting", CVPRW 2024.
Implementation of Decision Transformer, Conservative Q-Learning, and Behavior Cloning in Offline Reinforcement Learning setting
Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Workshop
DT6A / ReBRAC
Forked from tinkoff-ai/ReBRACAuthor's implementation of ReBRAC, a minimalist improvement upon TD3+BC
[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"
Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).
An offline deep reinforcement learning library
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Open standard for machine learning interoperability
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)
A DDPM implementation in Jax for continuous space modeling.
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
[NeurIPS 2023] T2T: From Distribution Learning in Training to Gradient Search in Testing for Combinatorial Optimization
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
The source code of team Schaferct in 2nd Bandwidth Prediction of MMSys'24.
This gym leverages NS3 and WebRTC, which can be used by reinforcement learning or other methods to build a Bandwidth Controller for WebRTC