Associate Professor

Xin Yu

Email:: xin.yu@uq.edu.au

Positions

Affiliate of ARC COE for Children and Families Over the Lifecourse: ARC COE for Children and Families Over the Lifecourse; Faculty of Humanities, Arts and Social Sciences

Affiliate of Centre for Enterprise AI: Centre for Enterprise AI; Faculty of Engineering, Architecture and Information Technology

Honorary Associate Professor: School of Electrical Engineering and Computer Science; Faculty of Engineering, Architecture and Information Technology

Background

My name is Xin Yu, an Associate Professor at the University of Queensland. I am an Australian Research Council Discovery Early Career Researcher Award 2023-2025 (DECRA) recipient and an awardee of the prestigious Google Research Scholar Program in 2021. I am also a Google Visiting Faculty. Previously, I was a research fellow at the Australian National University (ANU). I received my PhD degree from the Australian National Unversity under the supervision of Prof. Richard Hartley, Prof. Fatih Porikli and Dr. Basura Fernando. I also received a PhD degree from Tsinghua University supervised by Prof. Li Zhang. I am interested in Computer Vision and Machine Learning topics.

My research topics includes various computer vision and machine learning tasks, especially in efficient low-level image processing, image retrieval and localization, action recognition, 3D pose estimation, visual navigation and sign language recognition and translation.

Availability

Associate Professor Xin Yu is:: Available for supervision

Research impacts

One of my research papers has been awarded "Best Paper Honorable Mention" award in the premium computer vision conference WACV 2020, and one paper has been nominated for the Best Paper Award in CVPR 2020.

I was awarded the Outstanding Reviewer Award in ECCV 2020, CVPR 2021 and ICCV 2021. CVPR, ICCV and ECCV are internationally world-leading computer vision and machine learning conferences. My research interests include deep learning techniques, image processing, and computer vision tasks. I am a program committee member of top-tier computer vision and machine learning conferences, such as CVPR, ICCV, ECCV, ICML, ICLR and NeurIPS, and a reviewer of prestigious journals, such as TPAMI, IJCV and TIP.

I am happy to supervise self-motivated PhD and MPhil students. If you are an undergraduate student and willing to conduct your honour project, please drop me an email.

Search Professor Xin Yu’s works on UQ eSpace

182 works between 2011 and 2026

All (182) Journal Article (61) Conference Publication (119) Book Chapter (2)

2026

Conference Publication

Mebm: Exploring the Synergy of Mixture of Experts in Background Matting

Wang, Yiru, Lu, Ming, Tian, Senmao, Yu, Xin and Zhang, Shunli (2026). Mebm: Exploring the Synergy of Mixture of Experts in Background Matting. IEEE. doi: 10.1109/icassp55912.2026.11464270

Mebm: Exploring the Synergy of Mixture of Experts in Background Matting

2026

Conference Publication

Content-Aware Model Slimming for Image Super-Resolution with Large Input

Tian, Senmao, Hong, Gangyi, Wang, Shuyun, Yu, Xin and Zhang, Shunli (2026). Content-Aware Model Slimming for Image Super-Resolution with Large Input. IEEE. doi: 10.1109/icassp55912.2026.11461944

Content-Aware Model Slimming for Image Super-Resolution with Large Input

2026

Journal Article

DFBSNet: Dual frequency-domain branch fusion and selection network for hyperspectral anomaly detection

Yao, Yiming, Wang, Qing, Zhao, Dong, You, Mingtao, Xiang, Pei, Asano, Yuta, Yu, Xin, Wang, Chao, Zhou, Huixin and Ren, Jinchang (2026). DFBSNet: Dual frequency-domain branch fusion and selection network for hyperspectral anomaly detection. Pattern Recognition, 180 113967, 113967. doi: 10.1016/j.patcog.2026.113967

DFBSNet: Dual frequency-domain branch fusion and selection network for hyperspectral anomaly detection

2026

Journal Article

Cluster-aware prompt ensemble learning for few-shot vision-language model adaptation

Chen, Zhi, Yu, Xin, Tao, Xiaohui, Li, Yan and Huang, Zi (2026). Cluster-aware prompt ensemble learning for few-shot vision-language model adaptation. Pattern Recognition, 172 (C) 112596. doi: 10.1016/j.patcog.2025.112596

Cluster-aware prompt ensemble learning for few-shot vision-language model adaptation

2026

Journal Article

Compression-Oriented Video Super-Resolution

Wang, Shuyun, Liu, Yanbin, Lu, Ming, Wu, Zhuojie, Tian, Senmao, Guo, Yandong and Yu, Xin (2026). Compression-Oriented Video Super-Resolution. IEEE Transactions on Image Processing, PP (99), 1-1. doi: 10.1109/tip.2026.3682128

Compression-Oriented Video Super-Resolution

2026

Conference Publication

Augment to Segment: Tackling Pixel-Level Imbalance in Wheat Disease and Pest Segmentation

Wei, Tianqi, Yu, Xin, Chen, Zhi, Chapman, Scott and Huang, Zi (2026). Augment to Segment: Tackling Pixel-Level Imbalance in Wheat Disease and Pest Segmentation. Springer Science and Business Media Deutschland GmbH. doi: 10.1007/978-981-95-6196-4_3

Augment to Segment: Tackling Pixel-Level Imbalance in Wheat Disease and Pest Segmentation

2026

Journal Article

Safe and Reliable Diffusion Models via Subspace Projection

Chen, Huiqiang, Zhu, Tianqing, Wang, Linlin, Yu, Xin, Gao, Longxiang and Zhou, Wanlei (2026). Safe and Reliable Diffusion Models via Subspace Projection. IEEE Transactions on Dependable and Secure Computing, PP (99), 1-14. doi: 10.1109/TDSC.2026.3692493

Safe and Reliable Diffusion Models via Subspace Projection

2026

Journal Article

Distributed Zero-Shot Learning for Visual Recognition

Chen, Zhi, Luo, Yadan, Huang, Zi, Li, Jingjing, Wang, Sen and Yu, Xin (2026). Distributed Zero-Shot Learning for Visual Recognition. IEEE Transactions on Multimedia, PP (99), 1-12. doi: 10.1109/TMM.2026.3673561

Distributed Zero-Shot Learning for Visual Recognition

2026

Conference Publication

Dynamic Orchestration of Multi-agent System for Real-World Multi-image Agricultural VQA

Ke, Yan, Yu, Xin, Du, Heming, Chapman, Scott and Huang, Helen (2026). Dynamic Orchestration of Multi-agent System for Real-World Multi-image Agricultural VQA. Springer Science and Business Media Deutschland GmbH. doi: 10.1007/978-981-95-6196-4_11

Dynamic Orchestration of Multi-agent System for Real-World Multi-image Agricultural VQA

2026

Book Chapter

High-Resolution and Multimodal Optogenetic fMRI of Brain Dynamics

He, Yi, Yuan, Jianyu, Liang, Mingyao, Xie, Zeping and Yu, Xin (2026). High-Resolution and Multimodal Optogenetic fMRI of Brain Dynamics. Neuromethods. (pp. 3-18) New York, NY: Springer US. doi: 10.1007/978-1-0716-5178-0_1

High-Resolution and Multimodal Optogenetic fMRI of Brain Dynamics

2026

Journal Article

Preface

Liu, Miaomiao, Yu, Xin, Xu, Chang and Song, Yiliao (2026). Preface. Lecture Notes in Computer Science, 16370 LNAI, v-vi.

Preface

2025

Journal Article

Hyperspectral video object tracking with cross-modal spectral complementary and memory prompt network

Jiang, Wenhao, Zhao, Dong, Wang, Chen, Yu, Xin, Arun, Pattathal V., Asano, Yuta, Xiang, Pei and Zhou, Huixin (2025). Hyperspectral video object tracking with cross-modal spectral complementary and memory prompt network. Knowledge-Based Systems, 330 (Part B) 114595, 1-16. doi: 10.1016/j.knosys.2025.114595

Hyperspectral video object tracking with cross-modal spectral complementary and memory prompt network

2025

Journal Article

Analytical Survey of Learning with Low-Resource Data: From Analysis to Investigation

Cao, Xiaofeng, Xu, Mingwei, Yu, Xin, Yao, Jiangchao, Ye, Wei, Huang, Shengjun, Zhang, Minling, Tsang, Ivor, Ong, Yew-Soon, Kwok, James T. and Shen, Heng Tao (2025). Analytical Survey of Learning with Low-Resource Data: From Analysis to Investigation. ACM Computing Surveys, 58 (6) 3773075, 1-47. doi: 10.1145/3773075

Analytical Survey of Learning with Low-Resource Data: From Analysis to Investigation

2025

Conference Publication

3DRealCar: An In-the-Wild RGB-D Car Dataset with 360-Degree Views

Du, Xiaobiao, Wang, Yida, Sun, Haiyang, Wu, Zhuojie, Sheng, Hongwei, Wang, Shuyun, Ying, Jiaying, Lu, Ming, Zhu, Tianqing, Zhan, Kun and Yu, Xin (2025). 3DRealCar: An In-the-Wild RGB-D Car Dataset with 360-Degree Views. IEEE. doi: 10.1109/iccv51701.2025.02458

3DRealCar: An In-the-Wild RGB-D Car Dataset with 360-Degree Views

2025

Conference Publication

LDPose: Towards Inclusive Human Pose Estimation for Limb-Deficient Individuals in the Wild

Ying, Jiaying, Du, Heming, Zhang, Kaihao, Li, Lincheng and Yu, Xin (2025). LDPose: Towards Inclusive Human Pose Estimation for Limb-Deficient Individuals in the Wild. IEEE. doi: 10.1109/iccv51701.2025.00920

LDPose: Towards Inclusive Human Pose Estimation for Limb-Deficient Individuals in the Wild

2025

Conference Publication

Cross-View Isolated Sign Language Recognition via View Synthesis and Feature Disentanglement

Shen, Xin, Wang, Xinyu, Shen, Lei, Zhang, Kaihao and Yu, Xin (2025). Cross-View Isolated Sign Language Recognition via View Synthesis and Feature Disentanglement. IEEE. doi: 10.1109/iccv51701.2025.01920

Cross-View Isolated Sign Language Recognition via View Synthesis and Feature Disentanglement

2025

Conference Publication

Robust audio-visual segmentation via audio-guided visual convergent alignment

Liu, Chen, Li, Peike, Yang, Liying, Wang, Dadong, Li, Lincheng and Yu, Xin (2025). Robust audio-visual segmentation via audio-guided visual convergent alignment. 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN USA, 10-17 June 2025. Piscataway, NJ USA: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvpr52734.2025.02693

Robust audio-visual segmentation via audio-guided visual convergent alignment

2025

Conference Publication

EasyCraft: a robust and efficient framework for automatic avatar crafting

Wang, Suzhen, Chen, Weijie, Zhang, Wei, Zhao, Minda, Li, Lincheng, Zhang, Rongsheng, Hu, Zhipeng and Yu, Xin (2025). EasyCraft: a robust and efficient framework for automatic avatar crafting. 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN USA, 10-17 June 2025. New York, NY USA: IEEE Computer Society. doi: 10.1109/CVPR52734.2025.00524

EasyCraft: a robust and efficient framework for automatic avatar crafting

2025

Conference Publication

Dynamic derivation and elimination: audio visual segmentation with enhanced audio semantics

Liu, Chen, Yang, Liying, Li, Peike, Wang, Dadong, Li, Lincheng and Yu, Xin (2025). Dynamic derivation and elimination: audio visual segmentation with enhanced audio semantics. 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN USA, 10-17 June 2025. Piscataway, NJ USA: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvpr52734.2025.00298

Dynamic derivation and elimination: audio visual segmentation with enhanced audio semantics

2025

Conference Publication

Blind bitstream-corrupted video recovery via metadata-guided diffusion model

Wang, Shuyun, Zhang, Hu, Shen, Xin, Wang, Dadong and Yu, Xin (2025). Blind bitstream-corrupted video recovery via metadata-guided diffusion model. 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN USA, 10-17 June 2025. New York, NY USA: IEEE Computer Society. doi: 10.1109/CVPR52734.2025.02139

Blind bitstream-corrupted video recovery via metadata-guided diffusion model

Current funding

2025 - 2026

Creation of an interactive online seaweed production map to support policy-making for the Indonesian seaweed industry

KONEKSI Environment and Climate Change Extension Support

Open grant
2024 - 2027

AI-Empowered and Video-Based uplift of Paralympic classification systems (AQIRP project administered by Follow Me AI)

Follow Me AI Pty Ltd

Open grant
2023 - 2028

Breaking the Communication Barrier for the Australian Deaf Community: Vision Based Australian Sign Language Translation and Production

Google Asia Pacific Pte Ltd

Open grant
2023 - 2028

ARC Research Hub to Advance Timber for Australia's Future Built Environment

ARC Industrial Transformation Research Hubs

Open grant
2023 - 2027

Analytics for the Australian Grains Industry (AAGI)

Grains Research & Development Corporation

Open grant

Past funding

2024

Breaking the Communication Barrier for the Australian Deaf Community: Vision Based Australian Sign Language Translation and Production

Google Inc

Open grant
2023 - 2024

Developing applications of satellite imagery for modelling environmental and social impacts of climate change on seaweed farming in Indonesia (KONEKSI Grant administered by Griffith University)

Griffith University

Open grant
2023 - 2025

Advancing Human Perception: Countering Evolving Malicious Fake Visual Data

ARC Discovery Early Career Researcher Award

Open grant
2023 - 2025

Two-way Auslan: Automatic Machine Translation of Australian Sign Language (ARC Discovery Project administered by ANU)

The Australian National University

Open grant

Availability

Associate Professor Xin Yu is:: Available for supervision

Looking for a supervisor? Read our advice on how to choose a supervisor.

Supervision history

Current supervision

Doctor Philosophy

Compressed Video Restoration

Principal Advisor

Other advisors: Dr Miao Xu, Dr Heming Du
Doctor Philosophy

Understanding Human Intention and Performance

Principal Advisor

Other advisors: Dr Heming Du, Dr Miao Xu
Doctor Philosophy

Integrating Deep Learning and Remote Sensing for Precision Agriculture in Staple Crops

Principal Advisor

Other advisors: Dr Miao Xu
Doctor Philosophy

Human Understanding in Sports

Principal Advisor

Other advisors: Associate Professor Sen Wang, Dr Heming Du
Doctor Philosophy

Towards Comprehensive Australian Sign Language Understanding: Datasets, Methodologies, and Systems

Principal Advisor

Other advisors: Professor Helen Huang, Dr Heming Du
Doctor Philosophy

Human Posture Recognition Applied to Physical Activity

Principal Advisor

Other advisors: Professor Sean Tweedy
Doctor Philosophy

Combating evolving deceptive fake visual information through deepfake detection

Principal Advisor

Other advisors: Dr Miao Xu
Doctor Philosophy

Understanding Human Movements and Sport Performance Analysis

Principal Advisor

Other advisors: Dr Miao Xu
Doctor Philosophy

Understanding Human Intention and Performance

Principal Advisor

Other advisors: Associate Professor Sen Wang
Doctor Philosophy

Towards knowledge discovery from imperfect and evolving data

Associate Advisor

Other advisors: Dr Miao Xu
Doctor Philosophy

Effective Visual Data Compression

Associate Advisor

Other advisors: Associate Professor Sen Wang, Dr Heming Du
Doctor Philosophy

Pose Estimation for Human with Disabilities

Associate Advisor

Other advisors: Dr Heming Du
Doctor Philosophy

The Unlabeled Truth: Rethinking Medical Imaging Supervision for Foundation Models in the Wild

Associate Advisor

Other advisors: Associate Professor Mahsa Baktashmotlagh, Dr Heming Du
Doctor Philosophy

Remote Sensing Analysis in computer vision

Associate Advisor

Other advisors: Professor Helen Huang
Doctor Philosophy

Data driven approaches for smart farming

Associate Advisor

Other advisors: Professor Helen Huang
Doctor Philosophy

Multimodal foundation model design and analysis

Associate Advisor

Other advisors: Dr Miao Xu, Dr Heming Du
Doctor Philosophy

Enhancing Robustness and Generalizability in Computational Models

Associate Advisor

Other advisors: Associate Professor Mahsa Baktashmotlagh

Completed supervision

2026

Doctor Philosophy

Towards Data-Driven Analysis of Handheld Fundus Videos

Principal Advisor

Other advisors: Associate Professor Mahsa Baktashmotlagh, Dr Heming Du
2026

Doctor Philosophy

Object-Centric Audio-Visual Alignment for Sounding Source Segmentation

Principal Advisor

Other advisors: Associate Professor Sen Wang

Enquiries

For media enquiries about Associate Professor Xin Yu's areas of expertise, story ideas and help finding experts, contact our Media team:

communications@uq.edu.au

External profiles

Personal links

Update my profile

Xin Yu

Overview

Background

Availability

Research impacts

Works

Mebm: Exploring the Synergy of Mixture of Experts in Background Matting

Content-Aware Model Slimming for Image Super-Resolution with Large Input

DFBSNet: Dual frequency-domain branch fusion and selection network for hyperspectral anomaly detection

Cluster-aware prompt ensemble learning for few-shot vision-language model adaptation

Compression-Oriented Video Super-Resolution

Augment to Segment: Tackling Pixel-Level Imbalance in Wheat Disease and Pest Segmentation

Safe and Reliable Diffusion Models via Subspace Projection

Distributed Zero-Shot Learning for Visual Recognition

Dynamic Orchestration of Multi-agent System for Real-World Multi-image Agricultural VQA

High-Resolution and Multimodal Optogenetic fMRI of Brain Dynamics

Preface

Hyperspectral video object tracking with cross-modal spectral complementary and memory prompt network

Analytical Survey of Learning with Low-Resource Data: From Analysis to Investigation

3DRealCar: An In-the-Wild RGB-D Car Dataset with 360-Degree Views

LDPose: Towards Inclusive Human Pose Estimation for Limb-Deficient Individuals in the Wild

Cross-View Isolated Sign Language Recognition via View Synthesis and Feature Disentanglement

Robust audio-visual segmentation via audio-guided visual convergent alignment

EasyCraft: a robust and efficient framework for automatic avatar crafting

Dynamic derivation and elimination: audio visual segmentation with enhanced audio semantics

Blind bitstream-corrupted video recovery via metadata-guided diffusion model

Funding

Current funding

Past funding

Supervision

Availability

Supervision history

Current supervision

Compressed Video Restoration

Understanding Human Intention and Performance

Integrating Deep Learning and Remote Sensing for Precision Agriculture in Staple Crops

Human Understanding in Sports

Towards Comprehensive Australian Sign Language Understanding: Datasets, Methodologies, and Systems

Human Posture Recognition Applied to Physical Activity

Combating evolving deceptive fake visual information through deepfake detection

Understanding Human Movements and Sport Performance Analysis

Understanding Human Intention and Performance

Towards knowledge discovery from imperfect and evolving data

Effective Visual Data Compression

Pose Estimation for Human with Disabilities

The Unlabeled Truth: Rethinking Medical Imaging Supervision for Foundation Models in the Wild

Remote Sensing Analysis in computer vision

Data driven approaches for smart farming

Multimodal foundation model design and analysis

Enhancing Robustness and Generalizability in Computational Models

Completed supervision

Towards Data-Driven Analysis of Handheld Fundus Videos

Object-Centric Audio-Visual Alignment for Sounding Source Segmentation

Media

Enquiries