| NeurIPS25 | Code | Heterogeneous Adversarial Play in Interactive Environments |
| NeurIPS25 | Code | DrivAerStar: An Industrial-Grade CFD Dataset for Vehicle Aerodynamic Optimization |
| NeurIPS25 | Data | DrivAerStar: An Industrial-Grade CFD Dataset for Vehicle Aerodynamic Optimization |
| NeurIPS25 | Code | GlobalTomo: A global dataset for physics-ML seismic wavefield modeling and FWI |
| NeurIPS25 | Data | GlobalTomo: A global dataset for physics-ML seismic wavefield modeling and FWI |
| NeurIPS25 | Code | Taccel: Scaling Up Vision-based Tactile Robotics via High-performance GPU Simulation |
| T-RO25 | Code | Integration of Robot and Scene Kinematics for Sequential Mobile Manipulation Planning |
| RA-L25 | Code | B*: Efficient and Optimal Base Placement for Fixed-Base Manipulators |
| CoRL25 | Code | CLONE: Closed-Loop Whole-Body Humanoid Teleoperation for Long-Horizon Tasks |
| IROS25 | Code | Ag2x2: Robust Agent-Agnostic Visual Representations for Zero-Shot Bimanual Manipulation |
| CogSci25 | Code | Probing and Inducing Combinational Creativity in Vision-Language Models |
| CogSci25 | Dataset | Probing and Inducing Combinational Creativity in Vision-Language Models |
| CogSci25 | Code | A simulation-heuristics dual-process model for intuitive physics |
| CogSci25 | Dataset | A simulation-heuristics dual-process model for intuitive physics |
| CVPR25 | Code | GROVE: A Generalized Reward for Learning Open-Vocabulary Physical Skill |
| CVPR25 | Code | Dynamic Motion Blending for Versatile Motion Editing |
| CVPR25 | Dataset | Dynamic Motion Blending for Versatile Motion Editing |
| TRO24 | Code | Tac-Man: Tactile-Informed Prior-Free Manipulation of Articulated Objects |
| NeurIPS24 | Code | PhyRecon: Physically Plausible Neural Scene Reconstruction |
| SIGGRAPHAsia24 | Code | Autonomous Character-Scene Interaction Synthesis from Text Instruction |
| SIGGRAPHAsia24 | Data | Autonomous Character-Scene Interaction Synthesis from Text Instruction |
| IROS24 | Code | Ag2Manip: Learning Novel Manipulation Skills with Agent-Agnostic Visual and Action Representations |
| IROS24 | Code | PreAfford: Universal Affordance-Based Pre-Grasping for Diverse Objects and Environments |
| ECCV24 | Code | Zero-Shot Image Feature Consensus with Deep Functional Maps |
| CogSci24 | Code | Evaluating and Modeling Social Intelligence: A Comparative Study of Human and AI Capabilities |
| CVPR24 | Code | Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene Affordance |
| CVPR24 | Code | AnySkill: Learning Open-Vocabulary Physical Skill for Interactive Agents |
| CVPR24 | Code | Scaling Up Dynamic Human-Scene Interaction Modeling |
| CVPR24 | HuggingFace | Scaling Up Dynamic Human-Scene Interaction Modeling |
| RAL24 | Code | Grasp Multiple Objects with One Hand |
| RAL24 | Dataset | Grasp Multiple Objects with One Hand |
| 3DV24 | Code | Single-view 3D Scene Reconstruction with High-fidelity Shape and Texture |
| 3DV24 | Dataset | Single-view 3D Scene Reconstruction with High-fidelity Shape and Texture |
| ICLR24 | Code | I-PHYRE: Interactive Physical Reasoning |
| ICLR24 | Code | SparseDFF: Sparse-View Feature Distillation for One-Shot Dexterous Manipulation |
| ICLR24 | Code | Neural-Symbolic Recursive Machine for Systematic Generalization |
| NeurIPS23 | Code | ProBio: A Protocol-guided Multimodal Dataset for Molecular Biology Lab |
| NeurIPS23 | Dataset | ProBio: A Protocol-guided Multimodal Dataset for Molecular Biology Lab |
| NeurIPS23 | Code | Active Reasoning in an Open-World Environment |
| NeurIPS23 | Code | ChimpAct: A Longitudinal Dataset for Understanding Chimpanzee Behaviors |
| NeurIPS23 | Data Examples | ChimpAct: A Longitudinal Dataset for Understanding Chimpanzee Behaviors |
| NeurIPS23 | Dataset | ChimpAct: A Longitudinal Dataset for Understanding Chimpanzee Behaviors |
| NeurIPS23 | Code & Dataset | Evaluating and Inducing Personality in Pre-trained Language Models |
| NeurIPS23 | Online Game | Interactive Visual Reasoning under Uncertainty |
| NeurIPS23 | Code | Interactive Visual Reasoning under Uncertainty |
| NeurIPS23 | Data Examples | Interactive Visual Reasoning under Uncertainty |
| ICCV23 | Code | Full-Body Articulated Human-Object Interaction |
| ICCV23 | Dataset | Full-Body Articulated Human-Object Interaction |
| ICCV23 | Code | X-VoE: Measuring eXplanatory Violation of Expectation in Physical Events |
| ICCV23 | Dataset | X-VoE: Measuring eXplanatory Violation of Expectation in Physical Events |
| IROS23 | Code | Part-level Scene Reconstruction Affords Robot Interaction |
| ICML23 | Code | MEWL: Few-shot multimodal word learning with referential uncertainty |
| ICML23 | Dataset | MEWL: Few-shot multimodal word learning with referential uncertainty |
| ICML23 | Code | On the Complexity of Bayesian Generalization |
| CVPR23 | Code | Diffusion-based Generation, Optimization, and Planning in 3D Scenes |
| CVPR23 | Checkpoints | Diffusion-based Generation, Optimization, and Planning in 3D Scenes |
| CVPR23 | HuggingFace | Diffusion-based Generation, Optimization, and Planning in 3D Scenes |
| ICLR23 | Code | Understanding Embodied Reference with Touch-Line Transformer |
| ICLR23 | Code | A Minimalist Dataset for Systematic Generalization of Perception, Syntax, and Semantics |
| ICLR23 | Dataset | A Minimalist Dataset for Systematic Generalization of Perception, Syntax, and Semantics |
| ICRA23 | Code | GenDexGrasp: Generalizable Dexterous Grasping |
| ICRA23 | Dataset | GenDexGrasp: Generalizable Dexterous Grasping |
| ICRA23 | Code | Rearrange Indoor Scenes for Human-Robot Co-Activity |
| NeurIPS22 | Code | On the Learning Mechanisms in Physical Reasoning |
| NeurIPS22 | Code | Emergent Graphical Conventions in a Visual Communication Game |
| NeurIPS22 | Code | HUMANISE: Language-conditioned Human Motion Generation in 3D Scenes |
| NeurIPS22 | Dataset | HUMANISE: Language-conditioned Human Motion Generation in 3D Scenes |
| IJCV22 | Code | Scene Reconstruction with Functional Objects for Robot Autonomy |
| ScienceRobotics22 | Code & Data | In situ bidirectional human-robot value alignment |
| ECCV22 | Code | Learning Algebraic Representation for Systematic Generalization in Abstract Reasoning |
| ECCV22 | Dataset | Learning Algebraic Representation for Systematic Generalization in Abstract Reasoning |
| IROS22 | Code | Planning Sequential Tasks on Contact Graph |
| RA-L22 | Code | Understanding Physical Effects for Effective Tool-use |
| ICML22 | Code | Latent Diffusion Energy-Based Model for Interpretable Text Modeling |
| RA-L22 | Code | Object Gathering with a Tethered Robot Duo |
| NeurIPS21 | Code | Unsupervised Foreground Extraction via Deep Region Competition |
| ICCV21 | Code | YouRefIt: Embodied Reference Understanding with Language and Gesture |
| ICCV21 | Dataset | YouRefIt: Embodied Reference Understanding with Language and Gesture |
| ICCV21 | Code | Spatio-temporal Self-Supervised Representation Learning for 3D Point Clouds |
| RA-L22 | Code | Synthesizing Diverse and Physically Stable Grasps with Arbitrary Hand Structures using Differentiable Force Closure Estimator |
| IROS21 | Code | Consolidating Kinematic Models to Promote Coordinated Mobile Manipulations |
| IROS21 | Code | Efficient Task Planning for Mobile Manipulation: a Virtual Kinematic Chain Perspective |
| IROS21 | Code | Communicative Learning with Natural Gestures for Embodied Navigation Agents with Human-in-the-Scene |
| CVPR21 | Code | Learning Triadic Belief Dynamics in Nonverbal Communication from Videos |
| CVPR21 | Data | Learning Triadic Belief Dynamics in Nonverbal Communication from Videos |
| CVPR21 | Code | ACRE: Abstract Causal Reasoning Beyond Covariation |
| CVPR21 | Dataset | ACRE: Abstract Causal Reasoning Beyond Covariation |
| CVPR21 | Code | Abstract Spatial-Temporal Reasoning via Probabilistic Abduction and Execution |
| ICRA21 | Code | Reconstructing Interactive 3D Scene by Panoptic Mapping and CAD Model Alignments |
| ICRA21 | Code | Congestion-aware Multi-agent Trajectory Prediction for Collision Avoidance |
| IJNME21 | Code | Lagrangian‐Eulerian Multi‐Density Topology Optimization with the Material Point Method |
| ECCV20 | Code | LEMMA: A Multi-view Dataset for Learning Multi-agent Multi-task Activities |
| ECCV20 | Dataset | LEMMA: A Multi-view Dataset for Learning Multi-agent Multi-task Activities |
| IROS20 | Code | Human-Robot Interaction in a Shared Augmented Reality Workspace |
| IROS20 | Code | Graph-based Hierarchical Knowledge Representation for Robot Task Transfer from Virtual to Physical World |
| SIGGRAPH20 | Code | A Massively Parallel and Scalable Multi-GPU Material Point Method |
| SIGGRAPH20 | Code | IQ-MPM: An Interface Quadrature Material Point Method for Non-sticky Strongly Two-Way Coupled Nonlinear Solids and Fluids |
| ICRA20 | Code | Congestion-aware Evacuation Routing using Augmented Reality Devices |
| ScienceRobotics19 | Code | A tale of two explanations: Enhancing human trust by explaining robot behavior |
| AAAI20 | Code | Theory-based Causal Transfer: Integrating Instance-level Induction and Abstract-level Structure Learning |
| AAAI20 | Code | Machine Number Sense: A Dataset of Visual Arithmetic Problems for Abstract and Relational Reasoning |
| AAAI20 | Dataset | Machine Number Sense: A Dataset of Visual Arithmetic Problems for Abstract and Relational Reasoning [Google Drive] [Baidu Netdisk] |
| NeurIPS19 | Code | Learning Perceptual Inference by Contrasting |
| ICCV19 | Code | Holistic++ Scene Understanding: Single-view 3D Holistic Scene Parsing and Human Pose Estimation with Human-Object Interaction and Physical Commonsense |
| IROS19 | Code | Learning Virtual Grasp with Failed Demonstrations via Bayesian Inverse Reinforcement Learning |
| TURC19 | Code | VRGym: A Virtual Testbed for Physical and Interactive AI |
| CVPR19 | Code | RAVEN: A Dataset for Relational and Analogical Visual rEasoNing |
| CVPR19 | Dataset | RAVEN-10000 |
| ICRA19 | Code | High-Fidelity Grasping in Virtual Reality using a Glove-based System |
| ICRA19 | Code | Self-Supervised Incremental Learning for Sound Source Localization in Complex Indoor Environment |
| AAAI19 | Code | MetaStyle: Three-Way Trade-Off Among Speed, Flexibility and Quality in Neural Style Transfer |
| NeurIPS18 | Code | Cooperative Holistic Scene Understanding: Unifying 3D Object, Layout, and Camera Pose Estimation |
| ECCV18 | Code | Holistic 3D Scene Parsing and Reconstruction from a Single RGB Image |
| CogSci18 | Code | Human Causal Transfer: Challenges for Deep Reinforcement Learning |
| CVPR18 | Code | Human-centric Indoor Scene Synthesis Using Stochastic Grammar |
| SIGGRAPH18 | Code | A Moving Least Squares Material Point Method with Displacement Discontinuity and Two-Way Rigid Body Coupling |
| ICRA18 | Code | Interactive Robot Knowledge Patching using Augmented Reality |
| ICRA18 | Code | Unsupervised Learning of Hierarchical Models for Hand-Object Interactions |
| IROS17 | Code | Feeling the Force: Integrating Force and Pose for Fluent Discovery through Imitation Learning to Open Medicine Bottles |
| IROS17 | Code | A Glove-based System for Studying Hand-Object Manipulation via Joint Pose and Force Sensing |
| SIGGRAPHAsia16Workshop | Code | A Virtual Reality Platform for Dynamic Human-Scene Interaction |
| CVPR16 | Code | FEM Simulation for Inferring Forces during Human-scene Interactions |
| CVPR16 | Data | Watertight 3D Scene Dataset |
| CVPR15 | Code | Kinect2 Toolbox |
| CVPR15 | Data | Tool 3D Dataset |