📄Publications

* denotes equal contribution & joint lead authorship and † denotes corresponding author.

2024

  1. Cradle: Empowering Foundation Agents towards General Computer Control
    Weihao Tan, Wentao Zhang, Xinrun Xu, Haochong Xia, Ziluo Ding, Boyu Li, Bohan Zhou, Junpeng Yue, Jiechuan Jiang, Yewen Li, Ruyi An, Molei Qin, Chuqiao Zong, Longtao Zheng, YuJie Wu, Xiaoqiang Chai, Yifei Bi, Tianbao Xie, Pengjie Gu, Xiyun Li, Ceyao Zhang, Long Tian, Chaojie Wang, Xinrun Wang, Börje F. Karlsson, Bo An, Shuicheng YAN, Zongqing Lu
    Neural Information Processing Systems 2024 Workshop on Open-World Agents (NeurIPS 2024 Workshop)
  1. UniCode: Learning a Unified Codebook for Multimodal Large Language Models
    Sipeng Zheng, Bohan Zhou, Yicheng Feng, Zongqing Lu
    European Conference on Computer Vision 2024 (ECCV 2024)
  1. Pre-trained Visual Dynamics Representations for Efficient Policy Learning
    Hao Luo, Bohan Zhou, Zongqing Lu
    European Conference on Computer Vision 2024 (ECCV 2024)
  1. Towards General Computer Control: A Multi Modal Agent For Red Dead Redemption II As A Case Study
    Weihao Tan, Ziluo Ding, Wentao Zhang, Boyu Li, Bohan Zhou, Junpeng Yue, Haochong Xia, Jiechuan Jiang, Longtao Zheng, Xinrun Xu, Yifei Bi, Pengjie Gu, Xinrun Wang, Börje F. Karlsson, Bo An, Zongqing Lu
    International Conference on Learning Representations 2024 Workshop on LLM Agents (ICLR 2024 Workshop)

2023

  1. Learning from Visual Observation via Offline Pretrained State-to-Go Transformer
    Bohan Zhou, Ke Li, Jiechuan Jiang, Zongqing Lu
    Neural Information Processing Systems 2023 (NeurIPS 2023)
  2. GFIE: A Dataset and Baseline for Gaze-Following from 2D to 3D in Indoor Environments
    Zhengxi Hu, Yuxue Yang, Xiaolin Zhai, Dingye Yang, Bohan Zhou, Jingtai Liu†
    Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2023 (CVPR 2023)

2022

  1. Gaze Target Estimation Inspired by Interactive Attention
    Zhengxi Hu , Kunxu Zhao , Bohan Zhou, Hang Guo , Shichao Wu , Yuxue Yang, Jingtai Liu†
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 2022 (TCSVT 2022)