Publications

(2026). Addressing Limitations of Slot Attention using a Multiscale Hierarchical Approach. Manuscript.

Google Scholar

(2025). Using Neural Language Models for Long-term Action Anticipation from Videos. US Patent App. 18/539,746.

Google Scholar

(2025). Object-centric Video Representation for Action Prediction. US Patent App. 18/539,590.

Google Scholar

(2023). Object-centric Video Representation for Long-term Action Anticipation. WACV 2024.

PDF Source Document Google Scholar

(2023). AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?. ICLR 2024.

PDF Source Document Google Scholar