Song-Chun Zhu
Latest
- [NeurIPS24] PhyRecon: Physically Plausible Neural Scene Reconstruction
- [IROS24] Ag2Manip: Learning Novel Manipulation Skills with Agent-Agnostic Visual and Action Representations
- [ScienceAdvances24] Human-level few-shot concept induction through minimax entropy learning
- [ICLR24] Neural-Symbolic Recursive Machine for Systematic Generalization
- [NeurIPS23] Evaluating and Inducing Personality in Pre-trained Language Models
- [ICCV23] X-VoE: Measuring eXplanatory Violation of Expectation in Physical Events
- [IROS23] Learning a Causal Transition Model for Object Cutting
- [IROS23] Part-level Scene Reconstruction Affords Robot Interaction
- [ICML23] On the Complexity of Bayesian Generalization
- [AIR22] Artificial Social Intelligence: A Comparative and Holistic View
- [CVPR23] Diffusion-based Generation, Optimization, and Planning in 3D Scenes
- [ICLR23] A Minimalist Dataset for Systematic Generalization of Perception, Syntax, and Semantics
- [ICRA23] Rearrange Indoor Scenes for Human-Robot Co-Activity
- [Engineering23] A Reconfigurable Data Glove for Reconstructing Physical and Virtual Grasps
- [NeurIPS22] Emergent Graphical Conventions in a Visual Communication Game
- [IJCV22] Scene Reconstruction with Functional Objects for Robot Autonomy
- [ScienceRobotics22] In situ bidirectional human-robot value alignment
- [ECCV22] Learning Algebraic Representation for Systematic Generalization in Abstract Reasoning
- [ECCV22Workshop] PartAfford: Part-level Affordance Discovery from 3D Objects
- [IROS22] Sequential Manipulation Planning on Scene Graph
- [RA-L/IROS22] Understanding Physical Effects for Effective Tool-use
- [ICML22] Latent Diffusion Energy-Based Model for Interpretable Text Modeling
- [AAIL21] Patching interpretable And-Or-Graph knowledge representation using augmented reality
- [NeurIPS21] Unsupervised Foreground Extraction via Deep Region Competition
- [RA-L21] Synthesizing Diverse and Physically Stable Grasps with Arbitrary Hand Structures using Differentiable Force Closure Estimator
- [ICCV21] YouRefIt: Embodied Reference Understanding with Language and Gesture
- [ICCV21] Spatio-temporal Self-Supervised Representation Learning for 3D Point Clouds
- [IROS21] Consolidating Kinematic Models to Promote Coordinated Mobile Manipulations
- [IROS21] Efficient Task Planning for Mobile Manipulation: a Virtual Kinematic Chain Perspective
- [ICLR21Workshop] HALMA: Humanlike Abstraction Learning Meets Affordance in Rapid Problem Solving
- [ACL-Findings21] GRICE: A Grammar-based Dataset for Recovering Implicature and Conversational rEasoning
- [CVPR21] Learning Triadic Belief Dynamics in Nonverbal Communication from Videos
- [CVPR21] ACRE: Abstract Causal Reasoning Beyond Covariation
- [CVPR21] Abstract Spatial-Temporal Reasoning via Probabilistic Abduction and Execution
- [ICRA21] Reconstructing Interactive 3D Scene by Panoptic Mapping and CAD Model Alignments
- [ICRA21] Congestion-aware Multi-agent Trajectory Prediction for Collision Avoidance
- [ECCV20] LEMMA: A Multi-view Dataset for Learning Multi-agent Multi-task Activities
- [IROS20] Human-Robot Interaction in a Shared Augmented Reality Workspace
- [IROS20] Graph-based Hierarchical Knowledge Representation for Robot Task Transfer from Virtual to Physical World
- [Engineering20] Dark, Beyond Deep: A Paradigm Shift to Cognitive AI with Humanlike Common Sense
- [SIGGRAPH20] A Massively Parallel and Scalable Multi-GPU Material Point Method
- [ICRA20] Joint Inference of States, Robot Knowledge, and Human (False-)Beliefs
- [ICRA20] Congestion-aware Evacuation Routing using Augmented Reality Devices
- [ScienceRobotics19] A tale of two explanations: Enhancing human trust by explaining robot behavior
- [AAAI20] Theory-based Causal Transfer: Integrating Instance-level Induction and Abstract-level Structure Learning
- [AAAI20] Machine Number Sense: A Dataset of Visual Arithmetic Problems for Abstract and Relational Reasoning
- [NeurIPS19] Learning Perceptual Inference by Contrasting
- [NeurIPS19] PerspectiveNet: 3D Object Detection from a Single RGB Image via Perspective Points
- [ICCV19] Holistic++ Scene Understanding: Single-view 3D Holistic Scene Parsing and Human Pose Estimation with Human-Object Interaction and Physical Commonsense
- [IROS19] Learning Virtual Grasp with Failed Demonstrations via Bayesian Inverse Reinforcement Learning
- [CogSci19] Decomposing Human Causal Learning: Bottom-up Associative Learning and Top-down Schema Reasoning
- [CVPR19] RAVEN: A Dataset for Relational and Analogical Visual Reasoning
- [TURC19] VRGym: A Virtual Testbed for Physical and Interactive AI
- [ICRA19] High-Fidelity Grasping in Virtual Reality using a Glove-based System
- [ICRA19] Self-Supervised Incremental Learning for Sound Source Localization in Complex Indoor Environment
- [AAAI19] MetaStyle: Three-Way Trade-Off Among Speed, Flexibility and Quality in Neural Style Transfer
- [AAAI19] Mirroring without Overimitation: Learning Functionally Equivalent Manipulation Actions
- [NeurIPS18] Cooperative Holistic Scene Understanding: Unifying 3D Object, Layout, and Camera Pose Estimation
- [ECCV18] Holistic 3D Scene Parsing and Reconstruction from a Single RGB Image
- [IJCV18] Configurable 3D Scene Synthesis and 2D Image Rendering with Per-Pixel Ground Truth using Stochastic Grammars
- [CogSci18] Human Causal Transfer: Challenges for Deep Reinforcement Learning
- [CVPR18] Human-centric Indoor Scene Synthesis Using Stochastic Grammar
- [ICRA18] Interactive Robot Knowledge Patching using Augmented Reality
- [ICRA18] Unsupervised Learning using Hierarchical Models for Hand-Object Interactions
- [AAAI18] Tracking Occluded Objects and Recovering Incomplete Trajectories by Reasoning about Containment Relations and Human Actions
- [IROS17] A Glove-based System for Studying Hand-Object Manipulation via Joint Pose and Force Sensing
- [IROS17] Feeling the Force: Integrating Force and Pose for Fluent Discovery through Imitation Learning to Open Medicine Bottles
- [CogSci17] Consistent Probabilistic Simulation Underlying Human Judgment in Substance Dynamics
- [TVCG16] The Martian: Examining Human Physical Judgments Across Virtual Gravity Fields
- [SIGGRAPHAsia16Workshop] A Virtual Reality Platform for Dynamic Human-Scene Interaction
- [IJCAI16] What is Where: Inferring Containment Relations from Videos
- [CVPR16] Inferring Forces and Learning Human Utilities From Videos
- [CogSci16] Probabilistic Simulation Predicts Human Performance on Viscous Fluid-Pouring Problem
- [CVPR15] Understanding Tools: Task-Oriented Object Modeling, Learning and Recognition
- [CogSci15] Evaluating Human Cognition of Containing Relations with Physical Simulation