Risk-Aware RL
RL that considers not just expected return but also variance or tail risk of outcomes. CVaR (Conditional Value at Risk) optimization minimizes worst-case scenario performance rather than average performance. Risk-aware RL is important for physical robots where rare catastrophic outcomes (hardware damage, human injury) must be avoided even at the cost of average performance.
Robot LearningRLSafety