Skip to content

Commit 7d2e1e9

Browse files
Tanki777Cemal Yilmaz
andauthored
Fix PPO blog post link recurrentppo (#330)
* Fixed dead link. * Updated changelog. --------- Co-authored-by: Cemal Yilmaz <Cemal.Yilmaz@telematik-zentrum.de>
1 parent 50d8a8f commit 7d2e1e9

2 files changed

Lines changed: 17 additions & 1 deletion

File tree

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -28,7 +28,7 @@ See documentation for the full list of included features.
2828
- [Augmented Random Search (ARS)](https://arxiv.org/abs/1803.07055)
2929
- [Quantile Regression DQN (QR-DQN)](https://arxiv.org/abs/1710.10044)
3030
- [PPO with invalid action masking (MaskablePPO)](https://arxiv.org/abs/2006.14171)
31-
- [PPO with recurrent policy (RecurrentPPO aka PPO LSTM)](https://ppo-details.cleanrl.dev//2021/11/05/ppo-implementation-details/)
31+
- [PPO with recurrent policy (RecurrentPPO aka PPO LSTM)](https://iclr-blog-track.github.io/2022/03/25/ppo-implementation-details/)
3232
- [Truncated Quantile Critics (TQC)](https://arxiv.org/abs/2005.04269)
3333
- [Trust Region Policy Optimization (TRPO)](https://arxiv.org/abs/1502.05477)
3434
- [Batch Normalization in Deep Reinforcement Learning (CrossQ)](https://openreview.net/forum?id=PczQtTsTIX)

docs/misc/changelog.md

Lines changed: 16 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,22 @@
22

33
# Changelog
44

5+
## Release 2.8.1a0 (WIP)
6+
7+
### Breaking Changes:
8+
9+
### New Features:
10+
11+
### Bug Fixes:
12+
13+
### Deprecations:
14+
15+
### Others:
16+
17+
### Documentation:
18+
- Fixed dead link for `RecurrentPPO`.
19+
20+
521
## Release 2.8.0 (2026-04-01)
622

723
### Breaking Changes:

0 commit comments

Comments
 (0)