new RSS 2026 Workshop on Semantics for Robotics (SemRob) -- Poster
IMBench: A Benchmark for Intuitive Robotic Manipulation
Anurag Maurya, Sukhvansh Jain, Prajwal Avhad, Gautham Balachandran,
Ziyi Zhou, Atharva Kshirsagar, Satyam Singh, Bowen Li,
Rishabh Mukund, Ritul Singh, Jatin Vira, Suvonil Chatterjee, Devesh K. Jha
Humans combine reasoning and motor control to solve complex manipulation tasks under diverse constraints.
They build an understanding of the physical world that helps them convert reasoning into actions and quickly
adapt to new scenes, tasks, and rules. We refer to this capability as intuitive manipulation.
Existing benchmarks fail to capture this integration: they evaluate physical reasoning in isolation from
execution, or measure policy performance without requiring explicit reasoning. We introduce IMBench,
a benchmark designed to evaluate intuitive manipulation as an integrated capability spanning perception,
physical reasoning, action generation, and iterative execution. Our tasks require models to infer
task-relevant physical structure and generate feasible action sequences under explicit constraints,
including contact-rich manipulation, tool use, and multi-stage dependencies. We introduce a benchmark of
35 tasks, 14K filtered trajectories, and scalable tools for generating
diverse scenarios.