File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change @@ -28,7 +28,7 @@ See documentation for the full list of included features.
2828- [ Augmented Random Search (ARS)] ( https://arxiv.org/abs/1803.07055 )
2929- [ Quantile Regression DQN (QR-DQN)] ( https://arxiv.org/abs/1710.10044 )
3030- [ PPO with invalid action masking (MaskablePPO)] ( https://arxiv.org/abs/2006.14171 )
31- - [ PPO with recurrent policy (RecurrentPPO aka PPO LSTM)] ( https://ppo-details.cleanrl.dev//2021/11/05 /ppo-implementation-details/ )
31+ - [ PPO with recurrent policy (RecurrentPPO aka PPO LSTM)] ( https://iclr-blog-track.github.io/2022/03/25 /ppo-implementation-details/ )
3232- [ Truncated Quantile Critics (TQC)] ( https://arxiv.org/abs/2005.04269 )
3333- [ Trust Region Policy Optimization (TRPO)] ( https://arxiv.org/abs/1502.05477 )
3434- [ Batch Normalization in Deep Reinforcement Learning (CrossQ)] ( https://openreview.net/forum?id=PczQtTsTIX )
Original file line number Diff line number Diff line change 22
33# Changelog
44
5+ ## Release 2.8.1a0 (WIP)
6+
7+ ### Breaking Changes:
8+
9+ ### New Features:
10+
11+ ### Bug Fixes:
12+
13+ ### Deprecations:
14+
15+ ### Others:
16+
17+ ### Documentation:
18+ - Fixed dead link for ` RecurrentPPO ` .
19+
20+
521## Release 2.8.0 (2026-04-01)
622
723### Breaking Changes:
You can’t perform that action at this time.
0 commit comments