Publication

ProBio: A Protocol-guided Multimodal Dataset for Molecular Biology Lab

Jieming Cui* , , Baoxiong Jia* , Siyuan Huang , Zilong Zheng , Jianzhu Ma , Yixin Zhu .
Advances in Neural Information Processing System (NeurIPS) 2023 (Track on Datasets and Benchmarks)

X-VoE: Measuring eXplanatory Violation of Expectation in Physical Events

, , Baoxiong Jia , Zeyu Zhang , Chi Zhang , Yixin Zhu , Song-Chun Zhu .
International Conference on Computer Vision (ICCV) 2023 (Oral)

ARNOLD: A Benchmark for Language-Grounded Task Learning with Continuous States in Realistic Scenes

Ran Gong* , Jiangyong Huang* , Yizhou Zhao , Haoran Geng , Xiaofeng Gao , , , , Demetri Terzopoulos , Song-Chun Zhu , Baoxiong Jia , Siyuan Huang .
International Conference on Computer Vision (ICCV) 2023
LangRob@CoRL 2022 (* indicates equal contribution.)

Learning a Causal Transition Model for Object Cutting

International Conference on Intelligent Robots and Systems (IROS) 2023
(* indicates equal contribution.)

Diffusion-based Generation, Optimization, and Planning in 3D Scenes

Conference on Computer Vision and Pattern Recognition (CVPR) 2023
(* indicates equal contribution.)

Improving Unsupervised Object-centric Learning with Query Optimization

Baoxiong Jia* , Yu Liu* , Siyuan Huang .
International Conference on Learning Represetnations (ICLR) 2023
(* indicates equal contribution.)

EgoTaskQA: Understanding Human Tasks in Egocentric Videos

Baoxiong Jia , , Song-Chun Zhu , Siyuan Huang .
Advances in Neural Information Processing System (NeurIPS) 2022 (Track on Datasets and Benchmarks)

Learning Algebraic Representation for Systematic Generalization in Contextual Decision Processes

European Conference on Computer Vision (ECCV) 2022
(* indicates equal contribution.)

Latent Diffusion Energy-Based Model for Interpretable Text Modeling

, Sirui Xie , Xiaojian Ma , Baoxiong Jia , , Ruiqi Gao , Yixin Zhu , Song-Chun Zhu , Ying Nian Wu .
International Conference on Machine Learning (ICML) 2022

ACRE: Abstract Causal REasoning Beyond Covariation

IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2021
(* indicates equal contribution.)

Abstract Spatial-Temporal Reasoning via Probabilistic Abduction and Execution

Chi Zhang* , Baoxiong Jia* , Song-Chun Zhu , Yixin Zhu .
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2021
(* indicates equal contribution.)

LEMMA: A Multi-view Dataset for Learning Multi-agent Multi-task Activities

European Conference on Computer Vision (ECCV) 2020

A Generalized Earley Parser for Human Activity Parsing and Prediction

Siyuan Qi , Baoxiong Jia , Siyuan Huang , Ping Wei , Song-Chun Zhu .
Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 2020

Learning Perceptual Inference by Contrasting

Conference on Neural Information Processing Systems (NeurIPS) 2019 (Spotlight)
(* indicates equal contribution.)

RAVEN: A Dataset for Relational and Analogical Visual rEasoNing

Chi Zhang* , Feng Gao* , Baoxiong Jia , Yixin Zhu , Song-Chun Zhu .
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2019
(* indicates equal contribution.)

Learning Human-Object Interactions by Graph Parsing Neural Networks

Siyuan Qi* , Wenguan Wang* , Baoxiong Jia , , Song-Chun Zhu .
European Conference on Computer Vision (ECCV) 2018
(* indicates equal contribution.)

Generalized Earley Parser: Bridging Symbolic Grammars and Sequence Data for Future Prediction

Siyuan Qi , Baoxiong Jia , Song-Chun Zhu .
International Conference on Machine Learning (ICML) 2018

Mining User Reviews for Mobile App Comparison

Yuanchun Li , Baoxiong Jia , Yao Guo , .
ACM International Joint Conference on Pervasive and Ubiquitous Computing (UbiComp) 2017


Baoxiong Jia © 2022. All rights reserved.