Vision-Language-Action models: long-context, history-aware, multi-modal, reasoning, and planning.
Robotic Learning: building diverse robot datasets, training large-scale robot policies on this data, and developing approaches for scalably evaluating robot foundation models.