confiwent

🎣

Focusing

Nuowen Kan confiwent

🎣

Focusing

PhD. in Shanghai Jiao Tong University

26 followers · 10 following

Shanghai Jiao Tong University
Shanghai, China
03:31 (UTC +08:00)
main.nuowen.pro

Achievements

Stars

rl

14 repositories

tinkoff-ai / CORL

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,063 124 Updated Aug 3, 2023

kzl / decision-transformer

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Python 2,334 441 Updated Apr 29, 2024

philippe-eecs / IDQL

Repo for Implicit Diffusion Q-Learning

Python 86 11 Updated Dec 5, 2023

ikostrikov / jaxrl

JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.

Jupyter Notebook 609 65 Updated Oct 26, 2022

philippe-eecs / JaxDDPM

A DDPM implementation in Jax for continuous space modeling.

Python 8 Updated Apr 24, 2023

felix-thu / AlignIQL

Official Jax implementation of AlignIQL.

Python 3 Updated May 29, 2024

frt03 / generalized_dt

Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)

Python 65 4 Updated Aug 8, 2022

thu-ml / SRPO

Codes accompanying the paper "Score Regularized Policy Optimization through Diffusion Behavior" (ICLR 2024).

Python 36 1 Updated Feb 10, 2024

ryanxhr / IVR

[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"

Python 41 7 Updated Jul 27, 2023

ikostrikov / implicit_q_learning

Python 226 38 Updated Jan 23, 2022

DT6A / ReBRAC

Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC

Jupyter Notebook 11 Updated Oct 22, 2023

tinkoff-ai / lb-sac

Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Workshop

Python 21 2 Updated Feb 27, 2023

TroddenSpade / Decision-Transformer-on-Offline-Reinforcement-Learning

Implementation of Decision Transformer, Conservative Q-Learning, and Behavior Cloning in Offline Reinforcement Learning setting

Jupyter Notebook 23 Updated Oct 5, 2022

takuseno / d3rlpy

An offline deep reinforcement learning library

Python 1,292 230 Updated Sep 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nuowen Kan confiwent

Achievements

Achievements

Block or report confiwent

rl

tinkoff-ai / CORL

kzl / decision-transformer

philippe-eecs / IDQL

ikostrikov / jaxrl

philippe-eecs / JaxDDPM

felix-thu / AlignIQL

frt03 / generalized_dt

thu-ml / SRPO

ryanxhr / IVR

ikostrikov / implicit_q_learning

DT6A / ReBRAC

tinkoff-ai / lb-sac

TroddenSpade / Decision-Transformer-on-Offline-Reinforcement-Learning

takuseno / d3rlpy