Xue Yang (杨学)

Unsplashed background img 1


🇨🇳 Biography

Xue Yang is now a Researcher at OpenGVLab, Shanghai AI Laboratory, collaborated with Prof. Jifeng Dai and Dr. Xizhou Zhu. Xue Yang's research interests include Deep Learning and Computer Vision, with a focus on Generic/Oriented Object Detection/Instance Segmentation, AI Agent, Vision-Language Models.

Xue Yang received the B. E. degree from School of Information Science and Engineering, Central South University, Hunan, China, in 2016. He received the M. S. degree from School of Electronic, Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing, China, in 2019. Xue Yang obtained the Ph.D. degree from Wu Honor Class (吴文俊人工智能博士班), Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai, China, in 2023. His research advisor is Prof. Junchi Yan.

Xue Yang has published about 50 papers at the top-tier international CV/ML/AI conferences and journals, such as TPAMI, IJCV, CVPR, ECCV, ICCV, ICML, NeurIPS, ICLR, AAAI and ACM MM. He is also the leading contributor to the MMRotate , AlphaRotate and JDet open-source projects for oriented object detection, and with 8000+ stars in Github. Xue Yang won SJTU Outstanding Doctoral Dissertation (2023), CCF Outstanding Doctoral Dissertation Award (2023), CCF-CV Academic Emerging Scholar (2022), Shanghai Outstanding Graduates (2023), Doctoral National Scholarship (2021/2022), SJTU Scholar Star Nomination Award (2021), and also selected into the World's Top 2% Scientists List (2023-2024).

I will be joining the Department of Automation, Shanghai Jiao Tong University as an Assistant Professor in the spring of 2025. Looking for self-motivated students (Master 2025 spring & fall, Ph.D. 2026 spring & fall), interns/visitors to join ReThinkLab with the goal of doing impactful work on the topic of Computer Vision, Vision-Language Models, Remote Sensing (AI4RS), etc. Please do not hesitate to contact me via email.

🔥 News

📝 Recent Works [Full List]

( * indicates equal contribution, indicates corresponding author, # indicates project lead)

Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training
Gen Luo*, Xue Yang*, Wenhan Dou*, Zhaokai Wang*, Jifeng Dai, Yu Qiao, Xizhou Zhu

PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection
Botao Ren*, Xue Yang*, Yi Yu*, Junwei Luo, Zhidong Deng

5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks
Dongshuo Yin*, Leiyi Hu*, Bin Li, Youqun Zhang, Xue Yang

Towards Vision-Language Geo-Foundation Model: A Survey
Yue Zhou, Litong Feng, Yiping Ke, Xue Jiang, Junchi Yan, Xue Yang, Wayne Zhang

STAR: A First-Ever Dataset and A Large-Scale Benchmark for Scene Graph Generation in Large-Size Satellite Imagery
Yansheng Li, Linlin Wang, Tingzhu Wang, Xue Yang, Junwei Luo, Qi Wang, Youming Deng, Wenbin Wang, Xian Sun, Haifeng Li, Bo Dang, Yongjun Zhang, Yi Yu, Junchi Yan

Parameter-Inverted Image Pyramid Networks
Xizhou Zhu*, Xue Yang*, Zhaokai Wang*, Hao Li, Wenhan Dou, Junqi Ge, Lewei Lu, Yu Qiao, Jifeng Dai
Advances in Neural Information Processing Systems (NeurIPS, CCF-A), Vancouver, Canada, Spotlight, 2024

Drones Help Drones: A Collaborative Framework for Multi-Drone Object Trajectory Prediction and Beyond
Zhechao Wang, Peirui Cheng, Mingxin Chen, Pengju Tian, Zhirui Wang, Xinming Li, Xue Yang, Xian Sun
Advances in Neural Information Processing Systems (NeurIPS, CCF-A), Vancouver, Canada, 2024

Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher Learning
Yan Li, Weiwei Guo, Xue Yang, Ning Liao, Dunyun He, Jiaqi Zhou, Wenxian Yu
In Proceedings of the European Conference on Computer Vision (ECCV, CCF-B, Tsinghua-A), MiCo Milano, Italy, 2024

E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion Detection
Jiaqing Zhang, Mingxiang Cao, Xue Yang, Weiying Xie, Jie Lei, Daixun Li, Geng Yang, Wenbo Huang, Yunsong Li
Advances in Neural Information Processing Systems (NeurIPS, CCF-A), Vancouver, Canada, Oral, 2024
📝 LLMs and AI Agent [Full List]

Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft
Hao Li*, Xue Yang*, Zhaokai Wang*, Xizhou Zhu, Jie Zhou, Yu Qiao, Xiaogang Wang, Hongsheng Li, Lewei Lu, Jifeng Dai
In Proceedings of the IEEE Computer Vision and Pattern Recognition (CVPR, CCF-A), Seattle WA, USA, 2024

InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Zhaoyang Liu*, Yinan He*, Wenhai Wang*, Weiyun Wang*, Yi Wang*, Shoufa Chen*, Qinglong Zhang*, Yang Yang, Qingyun Li, Jiashuo Yu, Kunchang Li, Zhe Chen, Xue Yang, Xizhou Zhu, Yali Wang, Limin Wang, Ping Luo, Jifeng Dai, Yu Qiao
📝 PEFT

FLoRA: Low-Rank Core Space for N-dimension
Chongjie Si*, Xuehui Wang*, Xue Yang, Zhengqin Xu, Qingyun Li, Jifeng Dai, Yu Qiao, Xiaokang Yang, Wei Shen
📝 Oriented Object Detection [Full List]

Point2RBox: Combine Knowledge from Synthetic Visual Patterns for End-to-end Oriented Object Detection with Single Point Supervision
Yi Yu*, Xue Yang*, Qingyun Li, Feipeng Da, Jifeng Dai, Yu Qiao, Junchi Yan
In Proceedings of the IEEE Computer Vision and Pattern Recognition (CVPR, CCF-A), Seattle WA, USA, 2024

AlphaRotate: A Rotation Detection Benchmark using TensorFlow
Xue Yang, Yue Zhou, Wenlong Liao, Tao He, Junchi Yan
In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP, CCF-B), Seoul, Korea, 2024

H2RBox-v2: Incorporating Symmetry for Boosting Horizontal Box Supervised Oriented Object Detection
Yi Yu*, Xue Yang*, Qingyun Li, Yue Zhou, Gefan Zhang, Feipeng Da, Junchi Yan
Advances in Neural Information Processing Systems (NeurIPS, CCF-A), New Orleans, Louisiana, USA, 2023

H2RBox: Horizontal Box Annotation is All You Need for Oriented Object Detection
Xue Yang, Gefan Zhang, Wentng Li, Xuehui Wang, Yue Zhou, Junchi Yan
In International Conference on Learning Representations (ICLR, Tsinghua-A), Kigali, Rwanda, 2023

The KFIoU Loss for Rotated Object Detection
Xue Yang, Yue Zhou, Gefan Zhang, Jirui Yang, Wentao Wang, Junchi Yan, Xiaopeng Zhang, Qi Tian
In International Conference on Learning Representations (ICLR, Tsinghua-A), Kigali, Rwanda, 2023

Detecting Rotated Objects as Gaussian Distributions and Its 3-D Generalization
Xue Yang, Gefan Zhang, Xiaojiang Yang, Yue Zhou, Wentao Wang, Jin Tang, Tao He, Junchi Yan
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI, CCF-A), 2022

MMRotate: A Rotated Object Detection Benchmark using PyTorch
Yue Zhou*, Xue Yang*, Gefan Zhang, Jiabao Wang, Yanyi Liu, Liping Hou, Xue Jiang, Xingzhao Liu, Junchi Yan, Chengqi Lyu, Wenwei Zhang, Kai Chen
In Proceedings of the 30th ACM International Conference on Multimedia (ACM MM, CCF-A), Lisboa, Portugal, Open Source Software Competition, Oral, 2022

SCRDet++: Detecting Small, Cluttered and Rotated Objects via Instance-Level Feature Denoising and Rotation Loss Smoothing
Xue Yang, Junchi Yan, Wenlong Liao, Xiaokang Yang, Jin Tang, Tao He
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI, CCF-A), 2022

On the Arbitrary-Oriented Object Detection: Classification based Approaches Revisited
Xue Yang, Junchi Yan
International Journal of Computer Vision (IJCV, CCF-A), 2022

Learning High-Precision Bounding Box for Rotated Object Detection via Kullback-Leibler Divergence
Xue Yang, Xiaojiang Yang, Jirui Yang, Qi Ming, Wentao Wang, Qi Tian, Junchi Yan
Advances in Neural Information Processing Systems (NeurIPS, CCF-A), Virtual, 2021

Rethinking Rotated Object Detection with Gaussian Wasserstein Distance Loss
Xue Yang, Junchi Yan, Qi Ming, Wentao Wang, Xiaopeng Zhang, Qi Tian
In International Conference on Machine Learning (ICML, CCF-A), Virtual, 2021

Dense Label Encoding for Boundary Discontinuity Free Rotation Detection
Xue Yang, Liping Hou, Yue Zhou, Wentao Wang, Junchi Yan
In Proceedings of the IEEE Computer Vision and Pattern Recognition (CVPR, CCF-A), Virtual, 2021

Learning Modulated Loss for Rotated Object Detection
Wen Qian, Xue Yang, Silong Peng, Junchi Yan, Yue Guo
In Proceedings of the Thirty-Five AAAI Conference on Artificial Intelligence (AAAI, CCF-A), Vancouver, Canada (Virtual), 2021

R3Det: Refined Single-Stage Detector with Feature Refinement for Rotating Object
Xue Yang, Junchi Yan, Ziming Feng, Tao He
In Proceedings of the Thirty-Five AAAI Conference on Artificial Intelligence (AAAI, CCF-A), Vancouver, Canada (Virtual), 2021

Arbitrary-Oriented Object Detection with Circular Smooth Label
Xue Yang, Junchi Yan
In Proceedings of the European Conference on Computer Vision (ECCV, CCF-B, Tsinghua-A), Glasgow, Scotland, UK (Virtual), 2020

SCRDet: Towards More Robust Detection for Small, Cluttered and Rotated Objects
Xue Yang, Jirui Yang, Junchi Yan, Yue Zhang, Tengfei Zhang, Zhi Guo, Sun Xian, Kun Fu
In Proceedings of the IEEE International Conference on Computer Vision (ICCV, CCF-A), Seoul, Korea, 2019

Automatic Ship Detection in Remote Sensing Images from Google Earth of Complex Scenes Based on Multiscale Rotation Dense Feature Pyramid Networks
Xue Yang, Hao Sun, Kun Fu, Jirui Yang, Xian Sun, Menglong Yan, Zhi Guo
In Remote Sensing, 2018
📝 Object Detection, Instance Segmentation and Beyond [Full List]

PatchDCT: Patch Refinement for High Quality Instance Segmentation
Qinrou Wen, Jirui Yang, Xue Yang, Kewei Liang
In International Conference on Learning Representations (ICLR, Tsinghua-A), Kigali, Rwanda, 2023
📝 Scene Text Detection and Recognition [Full List]

Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors
Tongkun Guan, Wei Shen, Xue Yang, Xuehui Wang, Xiaokang Yang
In Proceedings of the European Conference on Computer Vision (ECCV, CCF-B, Tsinghua-A), MiCo Milano, Italy, 2024

Self-supervised Character-to-Character Distillation for Text Recognition
Tongkun Guan, Wei Shen, Xue Yang, Qi Feng, Zekun Jiang, Xiaokang Yang
In Proceedings of the IEEE International Conference on Computer Vision (ICCV, CCF-A), Paris, France, 2023

Self-supervised Implicit Glyph Attention for Text Recognition
Tongkun Guan, Chaochen Gu, Jingzheng Tu, Xue Yang, Qi Feng, Yudi Zhao, Wei Shen
In Proceedings of the IEEE Computer Vision and Pattern Recognition (CVPR, CCF-A), Vancouver, Canada, 2023
📝 Low-Level Vision [Full List]

Dual-path Image Inpainting with Auxiliary GAN Inversion
Wentao Wang, Li Niu, Jianfu Zhang, Xue Yang, Liqing Zhang
In Proceedings of the IEEE Computer Vision and Pattern Recognition (CVPR, CCF-A), New Orleans, Louisiana, USA, 2022

Parallel Multi-Resolution Fusion Network for Image Inpainting
Wentao Wang, Jianfu Zhang, Li Niu, Haoyu Ling, Xue Yang, Liqing Zhang
In Proceedings of the IEEE International Conference on Computer Vision (ICCV, CCF-A), Virtual, 2021
LEARN MORE>>
📚 Academic Activities

Conference Area Chair/Reviewer
  •   • IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Reviewer: 2021-2024
  •   • IEEE/CVF International Conference on Computer Vision (ICCV). Reviewer: 2023
  •   • European Conference on Computer Vision (ECCV). Reviewer: 2022/2024
  •   • Asian Conference on Computer Vision (ACCV). Reviewer: 2024
  •   • Neural Information Processing Systems (NeurIPS). Reviewer: 2022-2024
  •   • International Conference on Machine Learning (ICML). Reviewer: 2022-2024
  •   • International Conference on Learning Representations (ICLR). AC: 2025; Reviewer: 2024
  •   • AAAI Conference on Artificial Intelligence (AAAI). Reviewer: 2022-2025
  •   • ACM International Conference on Multimedia (ACM MM). Reviewer: 2021-2024
Journal Reviewer
  •   • IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
  •   • International Journal of Computer Vision (IJCV)
  •   • IEEE Transactions on Image Processing (TIP)
  •   • Pattern Recognition (PR)
  •   • IEEE Transactions on Neural Networks and Learning Systems (TNNLS)
  •   • IEEE Transactions on Multimedia (TMM)
  •   • IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)
  •   • IEEE Transactions on Geoscience and Remote Sensing (TGRS)
  •   • IEEE Geoscience and Remote Sensing Letters (GRSL)
  •   • IEEE Transactions on Intelligent Transportation Systems (TITS)
  •   • IEEE Transactions on Multimedia Computing Communications and Applications (TOMM)
  •   • Remote Sensing
Tech. Talks
🎓 Education

B.E. degree from School of Information Science and Engineering, Central South University, Hunan, China
Sep. 2012 - July 2016

Ph.D. in CV, Department of Computer Science and Engineering, Shanghai Jiao Tong University, Shanghai, China
Sep. 2019 - July 2023
🧑🏻‍💻 Internship and Cooperation

                       
🎖 Awards

💻 Demos



👬 People

Close Collaborators
Yue Zhou (PostDoc., SJTU), Yi Yu (PostDoc., SEU), Gefan Zhang (Master, SJTU), Wentao Wang (Ph.D., SJTU), Jirui Yang (Master, UCAS), Qi Ming (Ph.D., BIT), Xuehui Wang (Ph.D., SJTU), Qingyun Li (Ph.D., HIT), Tongkun Guan (Ph.D., SJTU), Junwei Luo (Master, WHU), Wentong Li (Ph.D., ZJU), Jiaqing Zhang (Ph.D., Xidian), Zhaokai Wang (Ph.D., SJTU & PJLAB), Botao Ren (Ph.D., Tsinghua)
Students
Yifan Zhou (Master, SJTU, 2024-), Wei Zhang (Master, SJTU, 2024-), Mingxin Liu (Master, SJTU, 2024-)
Qipeng Liu (Ph.D. SJTU, 2024-)