📝 Summary

Conference: CVPR/ICCV/ECCV (7/3/3), NeurIPS/ICML/ICLR (2/1/3), AAAI (2), ACM MM (1)

Journal: TPAMI (2), IJCV (1), SCIENTIA SINICA Informationis (1), IEEE Transactions (4)

Award: ESI Hot Cited Paper (1), ESI Highly Cited Paper (7), Most Influential Paper (2)

( * indicates equal contribution, indicates corresponding author, # indicates project lead)

📝 LLMs and AI Agent

Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training
Gen Luo*, Xue Yang*, Wenhan Dou*, Zhaokai Wang*, Jifeng Dai, Yu Qiao, Xizhou Zhu
citations
[project page]
[解读]

Towards Vision-Language Geo-Foundation Model: A Survey
Yue Zhou, Litong Feng, Yiping Ke, Xue Jiang, Junchi Yan, Xue Yang, Wayne Zhang
citations
[Awesome-VLGFM]
[解读]

Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft
Hao Li*, Xue Yang*, Zhaokai Wang*, Xizhou Zhu, Jie Zhou, Yu Qiao, Xiaogang Wang, Hongsheng Li, Lewei Lu, Jifeng Dai
In Proceedings of the IEEE Computer Vision and Pattern Recognition (CVPR, CCF-A), Seattle WA, USA, 2024
citations
[project page]

InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Zhaoyang Liu*, Yinan He*, Wenhai Wang*, Weiyun Wang*, Yi Wang*, Shoufa Chen*, Qinglong Zhang*, Yang Yang, Qingyun Li, Jiashuo Yu, Kunchang Li, Zhe Chen, Xue Yang, Xizhou Zhu, Yali Wang, Limin Wang, Ping Luo, Jifeng Dai, Yu Qiao
citations
[InternGPT]
[Demo]
📝 PEFT

5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks
Dongshuo Yin*, Leiyi Hu*, Bin Li, Youqun Zhang, Xue Yang
citations
[Mona-PyTorch]

FLoRA: Low-Rank Core Space for N-dimension
Chongjie Si*, Xuehui Wang*, Xue Yang, Zhengqin Xu, Qingyun Li, Jifeng Dai, Yu Qiao, Xiaokang Yang, Wei Shen
citations
[FLoRA-PyTorch]
[解读]
📝 Oriented Object Detection

PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection
Botao Ren*, Xue Yang*, Yi Yu*, Junwei Luo, Zhidong Deng
citations
[PointOBB-v2-PyTorch]

STAR: A First-Ever Dataset and A Large-Scale Benchmark for Scene Graph Generation in Large-Size Satellite Imagery
Yansheng Li, Linlin Wang, Tingzhu Wang, Xue Yang, Junwei Luo, Qi Wang, Youming Deng, Wenbin Wang, Xian Sun, Haifeng Li, Bo Dang, Yongjun Zhang, Yi Yu, Junchi Yan
citations
[SGG-ToolKit], [STAR-MMRotate], [STAR-MMDetection]
[解读]
[project page]

Theoretically Achieving Continuous Representation of Oriented Bounding Boxes
Zikai Xiao, Guoye Yang, Xue Yang, Taijiang Mu, Junchi Yan, Shimin Hu
In Proceedings of the IEEE Computer Vision and Pattern Recognition (CVPR, CCF-A), Seattle WA, USA, 2024
citations
[JDet]

Point2RBox: Combine Knowledge from Synthetic Visual Patterns for End-to-end Oriented Object Detection with Single Point Supervision
Yi Yu*, Xue Yang*, Qingyun Li, Feipeng Da, Jifeng Dai, Yu Qiao, Junchi Yan
In Proceedings of the IEEE Computer Vision and Pattern Recognition (CVPR, CCF-A), Seattle WA, USA, 2024
citations
[Point2RBox-MMRotate]
[解读]

PointOBB: Learning Oriented Object Detection via Single Point Supervision
Junwei Luo, Xue Yang#, Yu Yi, Qingyun Li, Junchi Yan, Yansheng Li
In Proceedings of the IEEE Computer Vision and Pattern Recognition (CVPR, CCF-A), Seattle WA, USA, 2024
citations
[PointOBB-Pytorch]
[解读]

ARS-DETR: Aspect Ratio-Sensitive Detection Transformer for Aerial Oriented Object Detection
Ying Zeng, Yushi Chen, Xue Yang, Qingyun Li, Junchi Yan
IEEE Transactions on Geoscience and Remote Sensing (TGRS, CCF-B), 2024
citations
[ARS-DETR-PyTorch]

AlphaRotate: A Rotation Detection Benchmark using TensorFlow
Xue Yang, Yue Zhou, Wenlong Liao, Tao He, Junchi Yan
In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP, CCF-B), Seoul, Korea, 2024
citations
[docs]
[AlphaRotate-TF]

P2RBox: Point Prompt Oriented Object Detection with SAM
Guangming Cao*, Xuehui Yu*, Wenwen Yu, Xumeng Han, Xue Yang, Guorong Li, Jianbin Jiao, Zhenjun Han
citations

H2RBox-v2: Incorporating Symmetry for Boosting Horizontal Box Supervised Oriented Object Detection
Yi Yu*, Xue Yang*, Qingyun Li, Yue Zhou, Gefan Zhang, Feipeng Da, Junchi Yan
Advances in Neural Information Processing Systems (NeurIPS, CCF-A), New Orleans, Louisiana, USA, 2023
citations
[H2RBox-v2-MMRotate]
[解读]

G-Rep: Gaussian Representation for Arbitrary-Oriented Object Detection
Liping Hou, Ke Lu, Xue Yang, Yuqiu Li, Jian Xue
In Remote Sensing, 2023
citations
[paper]
[G-Rep-PyTorch]

H2RBox: Horizontal Box Annotation is All You Need for Oriented Object Detection
Xue Yang, Gefan Zhang, Wentng Li, Xuehui Wang, Yue Zhou, Junchi Yan
In International Conference on Learning Representations (ICLR, Tsinghua-A), Kigali, Rwanda, 2023
citations
[H2RBox-PyTorch], [H2RBox-MMRotate]
[H2RBox-Jittor], [H2RBox-JDet]
[解读], [知乎视频解读]
[B站视频解读]

The KFIoU Loss for Rotated Object Detection
Xue Yang, Yue Zhou, Gefan Zhang, Jirui Yang, Wentao Wang, Junchi Yan, Xiaopeng Zhang, Qi Tian
In International Conference on Learning Representations (ICLR, Tsinghua-A), Kigali, Rwanda, 2023
citations
[KFIoU-TF], [KFIoU-PyTorch], [KFIoU-Jittor]
[解读]

Task Interleaving and Orientation Estimation for High-Precision Oriented Object Detection in Aerial Images
Qi Ming, Lingjuan Miao, Zhiqiang Zhou, Junjie Song, Yunpeng Dong, Xue Yang
ISPRS Journal of Photogrammetry and Remote Sensing (ISPRS), 2023
citations
[paper]
[TIOE-PyTorch]

PVT-SAR: An Arbitrarily Oriented SAR Ship Detector with Pyramid Vision Transformer
Yue Zhou, Xue Jiang, Guozheng Xu, Xue Yang, Xingzhao Liu
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (JSTARS), 2022
citations
[paper]

Detecting Rotated Objects as Gaussian Distributions and Its 3-D Generalization
Xue Yang, Gefan Zhang, Xiaojiang Yang, Yue Zhou, Wentao Wang, Jin Tang, Tao He, Junchi Yan
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI, CCF-A), 2022
citations
[AlphaRotate], [MMRotate], [mmdet3d-gaussian], [JDet]
[GWD解读], [KLD解读], [知乎视频解读]
[视频解读]

MMRotate: A Rotated Object Detection Benchmark using PyTorch
Yue Zhou*, Xue Yang*, Gefan Zhang, Jiabao Wang, Yanyi Liu, Liping Hou, Xue Jiang, Xingzhao Liu, Junchi Yan, Chengqi Lyu, Wenwei Zhang, Kai Chen
In Proceedings of the 30th ACM International Conference on Multimedia (ACM MM, CCF-A), Lisboa, Portugal, Open Source Software Competition, Oral, 2022
citations
[MMRotate]
[Video]

RSDet++: Point-based Modulated Loss for More Accurate Rotated Object Detection
Wen Qian, Xue Yang, Silong Peng, Junchi Yan, Xiujuan Zhang
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT, CCF-B), 2022
citations
[RSDet++-TF]

SCRDet++: Detecting Small, Cluttered and Rotated Objects via Instance-Level Feature Denoising and Rotation Loss Smoothing
Xue Yang, Junchi Yan, Wenlong Liao, Xiaokang Yang, Jin Tang, Tao He
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI, CCF-A), 2022
citations
[IoU-Smooth L1 Loss-TF], [DOTA-DOAI]
[S2TLD]
[project page]

On the Arbitrary-Oriented Object Detection: Classification based Approaches Revisited
Xue Yang, Junchi Yan
International Journal of Computer Vision (IJCV, CCF-A), 2022
citations
[CSL-TF], [DCL-TF], [OHDet-TF]
[OHD-SJTU]
[project page]

Learning High-Precision Bounding Box for Rotated Object Detection via Kullback-Leibler Divergence
Xue Yang, Xiaojiang Yang, Jirui Yang, Qi Ming, Wentao Wang, Qi Tian, Junchi Yan
Advances in Neural Information Processing Systems (NeurIPS, CCF-A), Virtual, 2021
citations
[KLD-TF], [KLD-PyTorch], [KLD-Jittor]
[slides], [poster]
[解读], [知乎视频解读]
[视频解读]

Optimization for Arbitrary-Oriented Object Detection via Representation Invariance Loss
Qi Ming, Lingjuan Miao, Zhiqiang Zhou, Xue Yang, Yunpeng Dong
IEEE Geoscience and Remote Sensing Letters (GRSL, CCF-C), 2021
citations
[RIDet-TF], [RIDet-PyTorch]

Sparse Label Assignment for Oriented Object Detection in Aerial Images
Qi Ming, Lingjuan Miao, Zhiqiang Zhou, Junjie Song, Xue Yang
In Remote Sensing, 2021
citations
[paper]
[SLA-PyTorch]

Rethinking Rotated Object Detection with Gaussian Wasserstein Distance Loss
Xue Yang, Junchi Yan, Qi Ming, Wentao Wang, Xiaopeng Zhang, Qi Tian
In International Conference on Machine Learning (ICML, CCF-A), Virtual, 2021
citations
[GWD-TF], [GWD-PyTorch], [GWD-Jittor]
[slides], [poster]
[解读], [知乎视频解读]
[视频解读]

Dense Label Encoding for Boundary Discontinuity Free Rotation Detection
Xue Yang, Liping Hou, Yue Zhou, Wentao Wang, Junchi Yan
In Proceedings of the IEEE Computer Vision and Pattern Recognition (CVPR, CCF-A), Virtual, 2021
citations
[RetinaNet-DCL-TF], [R3Det-DCL-TF]
[slides], [poster]
[解读]

Learning Modulated Loss for Rotated Object Detection
Wen Qian, Xue Yang, Silong Peng, Junchi Yan, Yue Guo
In Proceedings of the Thirty-Five AAAI Conference on Artificial Intelligence (AAAI, CCF-A), Vancouver, Canada (Virtual), 2021
citations
[RSDet-TF] [RSDet-Jittor]
[slides], [poster]
[解读]

R3Det: Refined Single-Stage Detector with Feature Refinement for Rotating Object
Xue Yang, Junchi Yan, Ziming Feng, Tao He
In Proceedings of the Thirty-Five AAAI Conference on Artificial Intelligence (AAAI, CCF-A), Vancouver, Canada (Virtual), 2021
citations
[R3Det-TF], [R3Det-PyTorch]
[slides], [poster] [期刊版本《中国科学:信息科学》]

Arbitrary-Oriented Object Detection with Circular Smooth Label
Xue Yang, Junchi Yan
In Proceedings of the European Conference on Computer Vision (ECCV, CCF-B, Tsinghua-A), Glasgow, Scotland, UK (Virtual), 2020
citations
[CSL-RetinaNet-TF] [CSL-RetinaNet-PyTorch] [CSL-RetinaNet-Jittor]
[slides]
[解读]

SCRDet: Towards More Robust Detection for Small, Cluttered and Rotated Objects
Xue Yang, Jirui Yang, Junchi Yan, Yue Zhang, Tengfei Zhang, Zhi Guo, Sun Xian, Kun Fu
In Proceedings of the IEEE International Conference on Computer Vision (ICCV, CCF-A), Seoul, Korea, 2019
citations
[IoU-Smooth L1 Loss-TF], [R2CNN++-TF]
[poster]
[解读]

Position detection and direction prediction for arbitrary-oriented ships via multitask rotation region convolutional neural network
Xue Yang, Hao Sun, Xian Sun, Menglong Yan, Zhi Guo, Kun Fu
In IEEE Access, 2018
citations
[paper]
[R2CNN_HEAD_FPN-TF]

Object Detection With Head Direction in Remote Sensing Images Based on Rotational Region CNN
Xue Yang, Kun Fu, Hao Sun, Xian Sun, Menglong Yan, Wenhui Diao, Zhi Guo
In IEEE International Geoscience and Remote Sensing Symposium (IGARSS), 2018
citations
[paper]

Automatic Ship Detection in Remote Sensing Images from Google Earth of Complex Scenes Based on Multiscale Rotation Dense Feature Pyramid Networks
Xue Yang, Hao Sun, Kun Fu, Jirui Yang, Xian Sun, Menglong Yan, Zhi Guo
In Remote Sensing, 2018
citations
[R-DFPN-TF]
📝 Object Detection, Instance Segmentation and Beyond

Parameter-Inverted Image Pyramid Networks
Xizhou Zhu*, Xue Yang*, Zhaokai Wang*, Hao Li, Wenhan Dou, Junqi Ge, Lewei Lu, Yu Qiao, Jifeng Dai
Advances in Neural Information Processing Systems (NeurIPS, CCF-A), Vancouver, Canada, Spotlight, 2024
citations
[PIIP-PyTorch]
[解读]

Drones Help Drones: A Collaborative Framework for Multi-Drone Object Trajectory Prediction and Beyond
Zhechao Wang, Peirui Cheng, Mingxin Chen, Pengju Tian, Zhirui Wang, Xinming Li, Xue Yang, Xian Sun
Advances in Neural Information Processing Systems (NeurIPS, CCF-A), Vancouver, Canada, 2024
citations
[DHD-PyTorch]

Toward Open Vocabulary Aerial Object Detection with CLIP-Activated Student-Teacher Learning
Yan Li, Weiwei Guo, Xue Yang, Ning Liao, Dunyun He, Jiaqi Zhou, Wenxian Yu
In Proceedings of the European Conference on Computer Vision (ECCV, CCF-B, Tsinghua-A), MiCo Milano, Italy, 2024
citations
[CastDet-PyTorch]

E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion Detection
Jiaqing Zhang, Mingxiang Cao, Jie Lei, Weiying Xie, Daixun Li, Wenbo Huang, Yunsong Li, Xue Yang
Advances in Neural Information Processing Systems (NeurIPS, CCF-A), Vancouver, Canada, Oral, 2024
citations
[E2E-MFD-PyTorch]

PatchDCT: Patch Refinement for High Quality Instance Segmentation
Qinrou Wen, Jirui Yang, Xue Yang, Kewei Liang
In International Conference on Learning Representations (ICLR, Tsinghua-A), Kigali, Rwanda, 2023
citations
[PatchDCT-PyTorch]
[解读], [知乎视频解读]
[B站视频解读]

Rethinking Classification and Localization for Cascade R-CNN
Ang Li, Xue Yang, Chongyang Zhang
In Proceedings of the 30th British Machine Vision Conference (BMVC, CCF-C), Cardiff, Wales, UK, 2019
citations

A Densely Connected End-to-End Neural Network for Multiscale and Multiscene SAR Ship Detection
Jiao Jiao, Yue Zhang, Hao Sun, Xue Yang, Xun Gao, Wen Hong, Kun Fu, Xian Sun
In IEEE Access, 2018
citations
[paper]
📝 Scene Text Detection and Recognition

Bridging Synthetic and Real Worlds for Pre-training Scene Text Detectors
Tongkun Guan, Wei Shen, Xue Yang, Xuehui Wang, Xiaokang Yang
In Proceedings of the European Conference on Computer Vision (ECCV, CCF-B, Tsinghua-A), MiCo Milano, Italy, 2024
citations
[FreeReal-PyTorch]

Self-supervised Character-to-Character Distillation for Text Recognition
Tongkun Guan, Wei Shen, Xue Yang, Qi Feng, Zekun Jiang, Xiaokang Yang
In Proceedings of the IEEE International Conference on Computer Vision (ICCV, CCF-A), Paris, France, 2023
citations
[CCD-PyTorch]
[解读]

Self-supervised Implicit Glyph Attention for Text Recognition
Tongkun Guan, Chaochen Gu, Jingzheng Tu, Xue Yang, Qi Feng, Yudi Zhao, Wei Shen
In Proceedings of the IEEE Computer Vision and Pattern Recognition (CVPR, CCF-A), Vancouver, Canada, 2023
citations
[SIGA-PyTorch]
[解读]
📝 Low-Level Vision

Dual-path Image Inpainting with Auxiliary GAN Inversion
Wentao Wang, Li Niu, Jianfu Zhang, Xue Yang, Liqing Zhang
In Proceedings of the IEEE Computer Vision and Pattern Recognition (CVPR, CCF-A), New Orleans, Louisiana, USA, 2022
citations
[paper], [poster]

Parallel Multi-Resolution Fusion Network for Image Inpainting
Wentao Wang, Jianfu Zhang, Li Niu, Haoyu Ling, Xue Yang, Liqing Zhang
In Proceedings of the IEEE International Conference on Computer Vision (ICCV, CCF-A), Virtual, 2021
citations
[paper], [poster]