Contents Menu Expand Light mode Dark mode Auto light/dark mode
Logo

Get Start

  • Installation
  • Usage Video

mathematical theory

  • Notations
  • Vector and Martrix
  • Lagrange Duality

Base RL Algorithm

  • Trust Region Policy Optimization
  • Proximal Policy Optimization Algorithms

Safe RL Algorithm

  • Constrained Policy Optimization
  • Projection-Based Constrained Policy Optimization
  • First Order Constrained Optimization in Policy Space
  • Lagrangian Methods

baserl api

  • Base on-policy Algorithms

saferl api

  • First Order Algorithms
  • Second Order Algorithms
  • The Lagrange Algorithms
  • Penalty Function Algorithms

common

  • OmniSafe Buffer
  • OmniSafe Experiment Grid
  • OmniSafe Lagrange Multiplier
  • OmniSafe Normalizer
  • OmniSafe Logger

Utils

  • OmniSafe Config
  • OmniSafe Distributed
  • OmniSafe Math
  • OmniSafe Model Utils
  • OmniSafe Tools

Models

  • OmniSafe Actor
  • OmniSafe Critic
Back to top
Copyright © 2022, OmniSafe Team
Made with Sphinx and @pradyunsg's Furo