PKU CoRe Lab
PKU CoRe Lab
Home
People
Research
Themes
Publications
Downloads
Videos
Talks
Scene Parsing
[NeurIPS23] ProBio: A Protocol-guided Multimodal Dataset for Molecular Biology Lab
The challenge of replicating research results has posed a significant impediment to the field of molecular biology. The advent of …
Jieming Cui
,
Ziren Gong
,
Baoxiong Jia
,
Siyuan Huang
,
Zilong Zheng
,
Jianzhu Ma
,
Yixin Zhu
PDF
Cite
Code
Dataset
Poster
Video
Supp
Web
北大AI院官微
[NeurIPS23] ChimpACT: A Longitudinal Dataset for Understanding Chimpanzee Behaviors
Understanding the behavior of non-human primates is crucial for improving animal welfare, modeling social behavior, and gaining …
Xiaoxuan Ma
,
Stephan Kaufhold
,
Jiajun Su
,
Wentao Zhu
,
Jack Terwilliger
,
Andres Meza
,
Yixin Zhu
,
Federico Rossano
,
Yizhou Wang
PDF
Cite
Code
Poster
Video
Supp
Web
Dataset Examples
Request Dataset
北大AI院官微
[ICLR23] Understanding Embodied Reference with Touch-Line Transformer
We study embodied reference understanding, the task of locating referents using embodied gestural signals and language references. …
Yang Li
,
Xiaoxue Chen
,
Hao Zhao
,
Jiangtao Gong
,
Guyue Zhou
,
Federico Rossano
,
Yixin Zhu
PDF
Cite
Code
Poster
Video
Web
北大新闻网
北大AI院官网
北大新工科官微
北大AI院官微
Visually Grounded Reasoning
[Vision meets Cognition]
Topics include affordance, container, functionality, hoi, intent, scene parsing, scene reconstruction, and …
Jieming Cui
,
Kai Jia
,
Nan Jiang
Aug 1, 2022
[ICCV21] YouRefIt: Embodied Reference Understanding with Language and Gesture
We study the machine’s understanding of embodied reference: One agent uses both language and gesture to refer to an object to …
Yixin Chen
,
Qing Li
,
Deqian Kong
,
Yik Lun Kei
,
Song-Chun Zhu
,
Tao Gao
,
Yixin Zhu
,
Siyuan Huang
PDF
Cite
Code
Dataset
Poster
Video
Supp
Web
[CVPR21] Learning Triadic Belief Dynamics in Nonverbal Communication from Videos
Humans possess a unique social cognition capability; nonverbal communication can convey rich social information among agents. In …
Lifeng Fan
,
Shuwen Qiu
,
Zilong Zheng
,
Tao Gao
,
Song-Chun Zhu
,
Yixin Zhu
PDF
Cite
Code
Dataset
Video
Supp
[ECCV20] LEMMA: A Multi-view Dataset for Learning Multi-agent Multi-task Activities
Understanding and interpreting human actions is a long-standing challenge and a critical indicator of perception in artificial …
Baoxiong Jia
,
Yixin Chen
,
Siyuan Huang
,
Yixin Zhu
,
Song-Chun Zhu
PDF
Cite
Code
Dataset
Supp
Presentation
Web
[ICRA20] Joint Inference of States, Robot Knowledge, and Human (False-)Beliefs
Aiming to understand how human (false-)belief—a core socio-cognitive ability—would affect human interactions with robots, …
Tao Yuan
,
Hangxin Liu
,
Lifeng Fan
,
Zilong Zheng
,
Tao Gao
,
Yixin Zhu
,
Song-Chun Zhu
PDF
Cite
Video
Presentation
[NeurIPS19] PerspectiveNet: 3D Object Detection from a Single RGB Image via Perspective Points
Detecting 3D objects from a single RGB image is intrinsically ambiguous, thus requiring appropriate prior knowledge and intermediate …
Siyuan Huang
,
Yixin Chen
,
Tao Yuan
,
Siyuan Qi
,
Yixin Zhu
,
Song-Chun Zhu
PDF
Cite
Poster
Supp
[ICCV19] Holistic++ Scene Understanding: Single-view 3D Holistic Scene Parsing and Human Pose Estimation with Human-Object Interaction and Physical Commonsense
We propose a new 3D holistic++ scene understanding problem, which jointly tackles two tasks from a single-view image: (i) holistic …
Yixin Chen
,
Siyuan Huang
,
Tao Yuan
,
Yixin Zhu
,
Siyuan Qi
,
Song-Chun Zhu
PDF
Cite
Code
Poster
Video
Supp
Web
»
Cite
×