The reading map will eventually connect papers, diagrams, and intuition-focused explanations.
Draft order
- Start with the core idea: predict representations, not raw pixels.
- Move to image-based pretraining.
- Extend the idea to video and temporal understanding.
- Track open questions around world models and planning.
This is only a scaffold. The detailed notes can be added one paper at a time.