Skip to content
View pickxiguapi's full-sized avatar

Block or report pickxiguapi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. CleanDiffuserTeam/CleanDiffuser CleanDiffuserTeam/CleanDiffuser Public

    CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making

    Jupyter Notebook 314 28

  2. Clean-Offline-RLHF Clean-Offline-RLHF Public

    Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)

    Python 31 2

  3. Uni-RLHF-Platform Uni-RLHF-Platform Public

    Uni-RLHF platform for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback" (ICLR2024)

    Python 29 1

  4. euclid-iclr2023 euclid-iclr2023 Public

    Official implementation for "EUCLID: Towards efficient unsupervised reinforcement learning with multi-choice dynamics model" (ICLR2023)

    Python 1

  5. ED2 ED2 Public

    Forked from ED2-source-code/ED2

    the ED2 implementation

    Python

  6. Mini-Uni-RLHF Mini-Uni-RLHF Public

    Minimal implementation for easy-to-use RLHF annotation

    Python 1