教师简介:
刘阳,副教授,硕士生导师。主要研究方向为具身智能、多模态空间感知与推理、因果推断。已累计发表论文50余篇,包括TPAMI,TIP,TMECH,TKDE,CVPR,ICCV,ACM MM,NeurIPS等,5篇一作/通讯期刊论文入选ESI高被引,4篇会议论文入选Oral/Highlight,1篇论文入选TMech期刊热门文章榜单前2。出版中文专著《多模态大模型:新一代人工智能技术范式》,销量过万,获得电子工业出版社年度优秀作者。主编英文专著 "Multimodal Large Models: A New Paradigm of Artificial Intelligence" 一本,参编中文专著一本。主持国家自然科学基金面上、青年(C类)、重点项目子课题、鹏城实验室“揭榜挂帅”、华为企业合作等项目十余项。担任国际权威期刊Pattern Recognition编委。指导学生获得2023中国软件大会机器人大模型与具身智能挑战赛优胜奖。获得2023广东省第三届计算机科学青年学术秀一等奖,2025年度广东省学位与研究生教育学会优秀教学成果二等奖(1/2)。
招生信息:欢迎2027年9月入学的夏令营/保研同学联系,具备硬件开发经验、熟悉ROS系统的同学优先考虑,请发送个人介绍和简历到liuy856@mail.sysu.edu.cn。
全年招收科研实习生,并提供生活补贴。
团队拥有充足计算资源和机器人硬件设备,支撑具身智能体的软硬件研发。
招生要求:对科研有浓厚兴趣,基础扎实,具备独立思考能力,自驱力强,品行端正,身心健康的学生。
研究领域:
具身智能:三维空间推理、视觉语言导航、机器人操控、机器人系统集成
多模态推理:数学题推理、视觉问答、医学报告生成
因果推理:因果表征学习、因果强化学习、反事实推理



获奖及荣誉:
广东省学位与研究生教育学会优秀教学成果二等奖(1/2),2025
电子工业出版社年度优秀作者(1/1),2024
广东省第三届计算机科学青年学术秀一等奖(1/1),2023
中国软件大会机器人大模型与具身智能挑战赛优胜奖(指导老师),2023
科研项目:
1. 国家自然科学基金面上项目,2026.01-2029.12,主持
2. 国家自然科学基金青年项目(C类),2021.01-2023.12,主持
3. 国家自然科学基金重点项目,2025.01-2029.12,课题负责人
4. 鹏城国家实验室“揭榜挂帅”项目,2025.05-2026.05,主持
5. 华为技术合作项目,2025.06-2026.06,联合主持
6. 广东省自然科学基金面上项目,2025.01-2027.12,主持
7. 广东省自然科学基金面上项目, 2023.01-2025.12,主持
8. 广东省自然科学基金面上项目,2021.01-2023.12,主持
9. 广州市科技计划项目,2023.04-2025.04,主持
10. 博士后自然科学基金面上项目,2020.08-2022.08,主持
教授课程:
高等代数(本科生核心课程、专业必修课)
多模态大模型原理与应用(本科生专业选修课,课程负责人)
生成式人工智能(研究生专业选修课)
主要学术兼职:
Pattern Recognition 期刊 副编辑(Associate Editor)
Embodied Intelligence and Robotics 期刊 副编辑(Associate Editor)
Embodied Intelligence 期刊 副编辑(Associate Editor)
中国图学学会高级会员
中国图学学会 可视化与认知计算专委会 委员
中国自动化学会 具身智能专委会 委员
中国图象图形学学会 多媒体专委会 委员
中国图象图形学学会 视觉大数据专委会 委员
中国指挥与控制学会 具身智能专委会 首届委员
ACM广州分会执行委员会 委员
广东省图象图形学会青工委委员
广东省图象图形学会视觉专委会 副秘书长
代表性论著:
[英文专著-26] Liang Lin, Yang Liu; Multimodal Large Models: A New Paradigm of Artificial Intelligence, Springer, 2026.
[中文专著-24] 刘阳, 林倞;《多模态大模型:新一代人工智能技术范式》,电子工业出版社,2024. [畅销书,销量过万]
[ROBOT-25] 刘阳,柏永杰,林倞,面向人机物高效融合与协作的具身智能技术体系,机器人,2025. [中国科协学术年会论文]
[CVPR-26] Yongjie Bai#, Zhouxia Wang#, Yang Liu*, Kaijun Luo, Yifan Wen, Mingtong Dai, Weixing Chen, Ziliang Chen, Mingtong Dai, Yongsen Zheng, Lingbo Liu, Guanbin Li, Liang Lin; Learning to See and Act: Task-Aware Virtual View Exploration for Robotic Manipulation, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026. [CCF-A]
[CVPR-26] Yuwei Ning, Ganlong Zhao, Yipeng Qin, Si Liu, Yang Liu, Liang Lin, Guanbin Li ; LookasideVLN: Direction-Aware Aerial Vision-and-Language Navigation, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026. [CCF-A]
[CVPR-26] Ziliang Chen, Tianang Xiao, Jusheng Zhang, Yongsen Zheng, Yang Liu, Zhao-Rong Lai, Liang Lin; A Causal Marriage between VLM and IRM from Understanding to Reasoning, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026. [CCF-A]
[ACM MM-25] Zeming Wei#, Junyi Lin#, Yang Liu*, Weixing Chen, Jingzhou Luo, Guanbin Li, Liang Lin; 3DAffordSplat: Efficient Affordance Reasoning with 3D Gaussians, ACM International Conference on Multimedia (ACM MM), 2025. [CCF-A] [Oral]
[ICCV-25] Kaixuan Jiang, Yang Liu*, Weixing Chen, Jingzhou Luo, Ziliang Chen, Ling Pan, Guanbin Li, Liang Lin; Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering, IEEE/CVF International Conference on Computer Vision (ICCV), 2025. [CCF-A]
[CVPR-25] Xinshuai Song#, Weixing Chen#, Yang Liu*, Weikai Chen, Guanbin Li, Liang Lin; Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025. [CCF-A]
[CVPR-25] Jingzhou Luo, Yang Liu*, Weixing Chen, Zhen Li, Yaowei Wang, Guanbin Li, Liang Lin; DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025. [CCF-A]
[CVPR-25] Weixing Chen, Yang Liu*, Binglin Chen, Jiandong Su, Yongsen Zheng, Liang Lin; Cross-modal Causal Relation Alignment for Video Question Grounding, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025. [CCF-A] [Highlight]
[TMECH-25] Yang Liu, Weixing Chen, Yongjie Bai, Xiaodan Liang, Guanbin Li, Wen Gao, Liang Lin; Aligning cyber space with physical world: A comprehensive survey on embodied ai, IEEE/ASME Transactions on Mechatronics (TMECH), 2025. [中科院一区] [期刊Popular榜单Top-2] [ESI高被引]
[TKDE-25] Yang Liu, Binglin Chen, Yongsen Zheng, Lechao Cheng, Guanbin Li, Liang Lin, ODMixer: Fine-grained Spatial-temporal MLP for Metro Origin-Destination Prediction, IEEE Transactions on Knowledge and Data Engineering (TKDE), 2025. [CCF-A] [ESI高被引]
[TIP-25] Weixing Chen, Yang Liu*, Ce Wang, Jiarui Zhu, Guanbin Li, Cheng-Lin Liu, Liang Lin, Cross-Modal Causal Intervention for Radiology Report Generation, IEEE Transactions on Image Processing (TIP), 2025. [CCF-A] ] [ESI高被引]
[TPAMI-23] Yang Liu, Guanbin Li, Liang Lin; Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023. [CCF-A] [ESI高被引]
[ICCV-23] Hong Yan, Yang Liu*, Yushen Wei, Zhen Li, Guanbin Li, Liang Lin; SkeletonMAE: Graph-based Masked Autoencoder for Skeleton Sequence Pre-training, IEEE/CVF International Conference on Computer Vision (ICCV), 2023. [CCF-A]
[TII-23] Yuying Zhu, Yang Zhang, Lingbo Liu, Yang Liu*, Guanbin Li, Mingzhi Mao, Liang Lin; Hybrid-Order Representation Learning for Electricity Theft Detection, IEEE Transactions on Industrial Informatics (TII), 2023. [中科院一区TOP]
[INS-23] Kuo Wang, Lingbo Liu, Yang Liu*, Guanbin Li, Liang Lin; Urban Regional Function Guided Traffic Flow Prediction, Information Sciences (INS), 2023. [中科院一区TOP]
[ACM MM-23] Yushen Wei#, Yang Liu#, Hong Yan, Guanbin Li, Liang Lin; Visual Causal Scene Refinement for Video Question Answering; ACM International Conference on Multimedia (ACM MM), 2023. [CCF-A] [Oral]
[IJCAI-23] Junfan Lin, Yuying Zhu, Lingbo Liu, Yang Liu*; Guanbin Li, Liang Lin; DenseLight: Efficient Control for Large-scale Traffic Signals with Dense Feedback, International Joint Conference on Artificial Intelligence (IJCAI), 2023. [CCF-A]
[TIP-22] Yang Liu, Keze Wang, Lingbo Liu, Haoyuan Lan, Liang Lin; TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning, IEEE Transactions on Image Processing (TIP), 2022. [CCF-A][ESI高被引]
[TIP-21] Yang Liu, Keze Wang, Guanbin Li, Liang Lin; Semantics-aware Adaptive Knowledge Distillation for Sensor-to-Vision Action Recognition, IEEE Transactions on Image Processing (TIP), 2021. [CCF-A]
[TIP-20] Yang Liu, Zhaoyang Lu, Jing Li, Tao Yang, Chao Yao, Deep Image-to-Video Adaptation and Fusion Networks for Action Recognition, IEEE Transactions on Image Processing (TIP), 2020. [CCF-A]
[TCSVT-19] Yang Liu, Zhaoyang Lu, Jing Li, Tao Yang; Hierarchically Learned View-Invariant Representations for Cross View Action Recognition, IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2019. [中科院一区TOP]
[MIR-22] Yang Liu, Yushen Wei, Hong Yan, Guanbin Li, Liang Lin; Causal Reasoning Meets Visual Representation Learning: A Prospective Study, Machine Intelligence Research (MIR), 2022. [中科院一区]



