top of page

publications

gt.png

One Flow-Transformer for Imagination and Control

Rabiul Awal, Jinseong Jeong, Ankur Sikarwar, Parisa Kordjamshidi, Andrii Zadaianchuk, Sai Rajeswar, Paul Hongsuck Seo, Aishwarya Agrawal

In Review

Screenshot 2026-06-05 at 4.15.58 PM.png

How and What to Imagine? Visual Thinking in Unified Multimodal Models for Cross-View Spatial Reasoning
Qian Yang, Ankur Sikarwar, Huy Le, Le Zhang, Zhuan Shi, Perouz Taslakian, Aishwarya Agrawal

In Review

image.png

Communicating about Space: Language-Mediated Spatial Integration Across Partial Views

Ankur Sikarwar*,  Debangan Mishra*, Sudarshan Nikhil, Ponnurangam Kumaraguru, Aishwarya Agrawal (* Denotes equal contribution)

In Review

teaser.png

The Promise of RL for Autoregressive Image Editing
Saba Ahmadi*, Rabiul Awal*, Ankur Sikarwar*, Amirhossein Kazemnejad*, Ge Ya Luo, Juan A. Rodriguez, Sai Rajeswar, Siva Reddy, Christopher Pal, Benno Krojer, Aishwarya Agrawal (* Denotes equal contribution)

NeurIPS 2025

HC_tWt4aYAA3viW.jpeg

Human or Machine? Turing Tests for Vision and Language

Mengmi Zhang, Giorgia Dellaferrera, Ankur Sikarwar, Marcelo Armendariz, Noga Mudrik, Prachi Agrawal, Spandan Madan, Mranmay Shetty, Andrei Barbu, Haochen Yang, Tanishq Kumar, Shui’Er Han, Aman Raj Singh, Meghna Sadwani, Stella Dellaferrera, Michele Pizzochero, Brandon Tang, Hanspeter Pfister, Gabriel Kreiman

Nature Human Behaviour 2026

curriculum.png

Learning to Learn: How to Continuously Teach Humans and Machines
Parantak Singh, You Li, Ankur Sikarwar, Weixan Lei, Daniel Gao, Morgan Bruce Talbot,
Ying Sun, Mike Zheng Shou, Gabriel Kreiman, Mengmi Zhang

ICCV 2023

On the Efficacy of Co-Attention Transformer Layers in Visual Question Answering

Ankur Sikarwar, Gabriel Kreiman

Preprint

word_cloud_4.png
reascannnn_edited.jpg
seco_11_edited.jpg
coattn_1.png

Decoding the Enigma: Benchmarking Humans and AIs on the Many Facets of Working Memory

Ankur Sikarwar, Mengmi Zhang

NeurIPS 2023, Datasets and Benchmarks Track

When Can Transformers Ground and Compose: Insights from Compositional Generalization Benchmarks
Ankur Sikarwar, Arkil Patel, Navin Goyal

EMNLP 2022 [Oral]

Reason from Context with Self-supervised Learning
Xiao Liu, Ankur Sikarwar, Joo Hwee Lim, Gabriel Kreiman, Zenglin Shi, Mengmi Zhang

In Review

© 2023 Ankur Sikarwar

bottom of page