World Models

Introduction

Dreamer V2

World Model learning

Figure 1: World Model Learning.

Behavior Learning

Figure 2: Behavior learning of actor-critic in the model latent states.

Latent Diffusion Planning for Imitation Learning

Self-supervised world models

There is no turning back: A self-supervised approach for reversibility-aware reinforcement learning

Reversibility

Reversibility estimation via classification

Temporal Difference Learning for Model Predictive Control (TD-MPC) <d-cite key”hansen2022temporal”></d-cite>

Task-Oriented Latent Dynamics Model

Learning Latent Dynamics for Planning from Pixels

Latent Space Plannning

Recurrent State Space Model (RSSM)

Latent Overshooting

DINO-WM

- Learning world models to capture stability features (don’t focus on image reconstruction or reward prediction, then what to focus on?!?!)