2024 Cradle: Empowering Foundation Agents towards General Computer Control Weihao Tan, Wentao Zhang, Xinrun Xu, Haochong Xia, Ziluo Ding, Boyu Li, Bohan Zhou, Junpeng Yue, Jiechuan Jiang, Yewen Li, Ruyi An, Molei Qin, Chuqiao Zong, Longtao Zheng, YuJie Wu, Xiaoqiang Chai, Yifei Bi, Tianbao Xie, Pengjie Gu, Xiyun Li, Ceyao Zhang, Long Tian, Chaojie Wang, Xinrun Wang, Börje F. Karlsson, Bo An, Shuicheng YAN, Zongqing Lu† Neural Information Processing Systems 2024 Workshop on Open-World Agents (NeurIPS 2024 Workshop) UniCode: Learning a Unified Codebook for Multimodal Large Language Models Sipeng Zheng, Bohan Zhou, Yicheng Feng, Zongqing Lu† European Conference on Computer Vision 2024 (ECCV 2024) Pre-trained Visual Dynamics Representations for Efficient Policy Learning Hao Luo, Bohan Zhou, Zongqing Lu† European Conference on Computer Vision 2024 (ECCV 2024) Towards General Computer Control: A Multi Modal Agent For Red Dead Redemption II As A Case Study Weihao Tan, Ziluo Ding, Wentao Zhang, Boyu Li, Bohan Zhou, Junpeng Yue, Haochong Xia, Jiechuan Jiang, Longtao Zheng, Xinrun Xu, Yifei Bi, Pengjie Gu, Xinrun Wang, Börje F. Karlsson, Bo An, Zongqing Lu† International Conference on Learning Representations 2024 Workshop on LLM Agents (ICLR 2024 Workshop) 2023 Learning from Visual Observation via Offline Pretrained State-to-Go Transformer Bohan Zhou, Ke Li, Jiechuan Jiang, Zongqing Lu† Neural Information Processing Systems 2023 (NeurIPS 2023) GFIE: A Dataset and Baseline for Gaze-Following from 2D to 3D in Indoor Environments Zhengxi Hu, Yuxue Yang, Xiaolin Zhai, Dingye Yang, Bohan Zhou, Jingtai Liu† Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2023 (CVPR 2023) 2022 Gaze Target Estimation Inspired by Interactive Attention Zhengxi Hu , Kunxu Zhao , Bohan Zhou, Hang Guo , Shichao Wu , Yuxue Yang, Jingtai Liu† IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 2022 (TCSVT 2022)