arXiv
Open Access
2026
Nightmare Dreamer: Dreaming About Unsafe States And Planning Ahead
Oluwatosin Oseni
Shengjie Wang
Jun Zhu
Micah Corah
Abstrak
Reinforcement Learning (RL) has shown remarkable success in real-world applications, particularly in robotics control. However, RL adoption remains limited due to insufficient safety guarantees. We introduce Nightmare Dreamer, a model-based Safe RL algorithm that addresses safety concerns by leveraging a learned world model to predict potential safety violations and plan actions accordingly. Nightmare Dreamer achieves nearly zero safety violations while maximizing rewards. Nightmare Dreamer outperforms model-free baselines on Safety Gymnasium tasks using only image observations, achieving nearly a 20x improvement in efficiency.
Penulis (4)
O
Oluwatosin Oseni
S
Shengjie Wang
J
Jun Zhu
M
Micah Corah
Akses Cepat
Informasi Jurnal
- Tahun Terbit
- 2026
- Bahasa
- en
- Sumber Database
- arXiv
- Akses
- Open Access ✓