Posts

Showing posts from 2026

ICLR 2025 World Model Workshop Notes

  https://iclr.cc/virtual/2025/workshop/24000 highlights world model is a model given states and actions, predict next states world model can be used for simulation and training RL agent short horizon task: combine local reward and global value function diffusion model can be combined with/replace transformer or VAE to speed up inference post training Robotics one model for high level instruction generation and one for low level motion generation use vision language model simulate with world model and pick best trajectory before execute in real environment World model can be pure video frame prediction encode video into latent space and apply diffusion model or transformer model for generation one component of model to infer latent action from video frames could also concatenate video token with action token and make next frame prediction Quality diversity algorithms find a diverse set of high performing agent(policies) and combine them exploration and goal s...