- Multivariant Normal Distribuation
- KL Divergence
- Entropy
- Hinge Loss
- Pooled Variance/Scale new data
- Generalized Advantage Estimation
- Trust Region Policy Optimization
- Deep Q-Network
- Deep Deterministic Policy Gradient
- Multi-Goal Reinforcement Learning
- Proximal Policy Optimization