World Models

Introduction

Dreamer V2

World Model learning

Figure 1: World Model Learning.

Behavior Learning

Figure 2: Behavior learning of actor-critic in the model latent states.

Latent Diffusion Planning for Imitation Learning

Self-supervised world models

There is no turning back: A self-supervised approach for reversibility-aware reinforcement learning

Reversibility

Reversibility estimation via classification

Temporal Difference Learning for Model Predictive Control (TD-MPC) <d-cite key”hansen2022temporal”></d-cite>

Task-Oriented Latent Dynamics Model

Learning Latent Dynamics for Planning from Pixels

Latent Space Plannning

Recurrent State Space Model (RSSM)

Latent Overshooting