Associate Professor

Xin Yu

Email:: xin.yu@uq.edu.au

Positions

Affiliate of ARC COE for Children and Families Over the Lifecourse: ARC COE for Children and Families Over the Lifecourse; Faculty of Humanities, Arts and Social Sciences

Affiliate of Centre for Enterprise AI: Centre for Enterprise AI; Faculty of Engineering, Architecture and Information Technology

Honorary Associate Professor: School of Electrical Engineering and Computer Science; Faculty of Engineering, Architecture and Information Technology

Background

My name is Xin Yu, an Associate Professor at the University of Queensland. I am an Australian Research Council Discovery Early Career Researcher Award 2023-2025 (DECRA) recipient and an awardee of the prestigious Google Research Scholar Program in 2021. I am also a Google Visiting Faculty. Previously, I was a research fellow at the Australian National University (ANU). I received my PhD degree from the Australian National Unversity under the supervision of Prof. Richard Hartley, Prof. Fatih Porikli and Dr. Basura Fernando. I also received a PhD degree from Tsinghua University supervised by Prof. Li Zhang. I am interested in Computer Vision and Machine Learning topics.

My research topics includes various computer vision and machine learning tasks, especially in efficient low-level image processing, image retrieval and localization, action recognition, 3D pose estimation, visual navigation and sign language recognition and translation.

Availability

Associate Professor Xin Yu is:: Available for supervision

Research impacts

One of my research papers has been awarded "Best Paper Honorable Mention" award in the premium computer vision conference WACV 2020, and one paper has been nominated for the Best Paper Award in CVPR 2020.

I was awarded the Outstanding Reviewer Award in ECCV 2020, CVPR 2021 and ICCV 2021. CVPR, ICCV and ECCV are internationally world-leading computer vision and machine learning conferences. My research interests include deep learning techniques, image processing, and computer vision tasks. I am a program committee member of top-tier computer vision and machine learning conferences, such as CVPR, ICCV, ECCV, ICML, ICLR and NeurIPS, and a reviewer of prestigious journals, such as TPAMI, IJCV and TIP.

I am happy to supervise self-motivated PhD and MPhil students. If you are an undergraduate student and willing to conduct your honour project, please drop me an email.

Search Professor Xin Yu’s works on UQ eSpace

179 works between 2011 and 2026

All (179) Journal Article (60) Conference Publication (117) Book Chapter (2)

2024

Conference Publication

MM-WLAuslan: multi-view multi-modal word-level Australian Sign Language recognition dataset

Shen, Xin, Du, Heming, Sheng, Hongwei, Wang, Shuyun, Chen, Hui, Chen, Huiqiang, Wu, Zhuojie, Du, Xiaobiao, Ying, Jiaying, Lu, Ruihan, Xu, Qingzheng and Yu, Xin (2024). MM-WLAuslan: multi-view multi-modal word-level Australian Sign Language recognition dataset. 38th Annual Conference on Neural Information Processing Systems (NeurIPS 2024), Vancouver, BC, Canada, 10-15 December 2024. San Mateo, CA, United States: Morgan Kaufmann Publishers. doi: 10.52202/079017-2227

MM-WLAuslan: multi-view multi-modal word-level Australian Sign Language recognition dataset

2023

Journal Article

Calligraphy Font generation via explicitly modeling location-aware glyph component deformations

Zhao, Minda, Qi, Xingqun, Hu, Zhipeng, Li, Lincheng, Zhang, Yongqiang, Huang, Zi and Yu, Xin (2023). Calligraphy Font generation via explicitly modeling location-aware glyph component deformations. IEEE Transactions on Multimedia, 26, 5939-5950. doi: 10.1109/tmm.2023.3342690

Calligraphy Font generation via explicitly modeling location-aware glyph component deformations

2023

Journal Article

DMMG: Dual min-max games for self-supervised skeleton-based action recognition

Guan, Shannan, Yu, Xin, Huang, Wei, Fang, Gengfa and Lu, Haiyan (2023). DMMG: Dual min-max games for self-supervised skeleton-based action recognition. IEEE Transactions on Image Processing, 33, 395-407. doi: 10.1109/tip.2023.3338410

DMMG: Dual min-max games for self-supervised skeleton-based action recognition

2023

Conference Publication

Learning efficient unsupervised satellite image-based building damage detection

Zhang, Yiyun, Wang, Zijian, Luo, Yadan, Yu, Xin and Huang, Zi (2023). Learning efficient unsupervised satellite image-based building damage detection. 2023 IEEE International Conference on Data Mining (ICDM), Shanghai, China, 1-4 December 2023. Piscataway, NJ, United States: IEEE. doi: 10.1109/icdm58522.2023.00206

Learning efficient unsupervised satellite image-based building damage detection

2023

Conference Publication

Context-based masking for spontaneous venous pulsations detection

Sheng, Hongwei, Yu, Xin, Li, Xue and Golzan, Mojtaba (2023). Context-based masking for spontaneous venous pulsations detection. 36th Australasian Joint Conference on Artificial Intelligence, AJCAI 2023, Brisbane, QLD Australia, 28 November –1 December 2023. Singapore: Springer. doi: 10.1007/978-981-99-8388-9_42

Context-based masking for spontaneous venous pulsations detection

2023

Conference Publication

A new perspective of weakly supervised 3D instance segmentation via bounding boxes

Yu, Qingtao, Du, Heming and Yu, Xin (2023). A new perspective of weakly supervised 3D instance segmentation via bounding boxes. 36th Australasian Joint Conference on Artificial Intelligence, AJCAI 2023, Brisbane, QLD Australia, 28 November –1 December 2023. Singapore: Springer. doi: 10.1007/978-981-99-8388-9_9

A new perspective of weakly supervised 3D instance segmentation via bounding boxes

2023

Conference Publication

Toward a unified framework for RGB and RGB-D visual navigation

Du, Heming, Huang, Zi, Chapman, Scott and Yu, Xin (2023). Toward a unified framework for RGB and RGB-D visual navigation. 36th Australasian Joint Conference on Artificial Intelligence, AJCAI 2023, Brisbane, QLD Australia, 28 November –1 December 2023. Singapore: Springer. doi: 10.1007/978-981-99-8391-9_29

Toward a unified framework for RGB and RGB-D visual navigation

2023

Conference Publication

Towards reliable and efficient vegetation segmentation for Australian wheat data analysis

Yuan, Bowen, Wang, Zijian and Yu, Xin (2023). Towards reliable and efficient vegetation segmentation for Australian wheat data analysis. 34th Australasian Database Conference (ADC), Melbourne, NSW Australia, 1-3 November 2023. Cham, Switzerland: Springer Cham. doi: 10.1007/978-3-031-47843-7_9

Towards reliable and efficient vegetation segmentation for Australian wheat data analysis

2023

Conference Publication

Audio-visual segmentation by exploring cross-modal mutual semantics

Liu, Chen, Li, Peike Patrick, Qi, Xingqun, Zhang, Hu, Li, Lincheng, Wang, Dadong and Yu, Xin (2023). Audio-visual segmentation by exploring cross-modal mutual semantics. MM '23: The 31st ACM International Conference on Multimedia, Ottawa, ON Canada, 29 October - 3 November 2023. New York, NY United States: Association for Computing Machinery. doi: 10.1145/3581783.3612373

Audio-visual segmentation by exploring cross-modal mutual semantics

2023

Conference Publication

DyGait: exploiting dynamic representations for high-performance gait recognition

Wang, Ming, Guo, Xianda, Lin, Beibei, Yang, Tian, Zhu, Zheng, Li, Lincheng, Zhang, Shunli and Yu, Xin (2023). DyGait: exploiting dynamic representations for high-performance gait recognition. IEEE/CVF International Conference on Computer Vision (ICCV), Paris, France, 2-6 October 2023. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/iccv51070.2023.01235

DyGait: exploiting dynamic representations for high-performance gait recognition

2023

Conference Publication

Gait recognition with mask-based regularization

Shen, Chuanfu, Lin, Beibei, Zhang, Shunli, Yu, Xin, Huang, George Q. and Yu, Shiqi (2023). Gait recognition with mask-based regularization. IEEE International Joint Conference on Biometrics (IJCB), Ljubljana, Slovenia, 25-28 September 2023. New York, NY, United States: IEEE. doi: 10.1109/ijcb57857.2023.10449112

Gait recognition with mask-based regularization

2023

Journal Article

Deep idempotent network for efficient single image blind deblurring

Mao, Yuxin, Wan, Zhexiong, Dai, Yuchao and Yu, Xin (2023). Deep idempotent network for efficient single image blind deblurring. IEEE Transactions on Circuits and Systems for Video Technology, 33 (1), 172-185. doi: 10.1109/tcsvt.2022.3202361

Deep idempotent network for efficient single image blind deblurring

2023

Conference Publication

Diverse 3D Hand Gesture Prediction from Body Dynamics by Bilateral Hand Disentanglement

Qi, Xingqun, Liu, Chen, Sun, Muyi, Li, Lincheng, Fan, Changjie and Yu, Xin (2023). Diverse 3D Hand Gesture Prediction from Body Dynamics by Bilateral Hand Disentanglement. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, 17-24 June 2023. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvpr52729.2023.00448

Diverse 3D Hand Gesture Prediction from Body Dynamics by Bilateral Hand Disentanglement

2023

Conference Publication

Autonomous stabilization of retinal videos for streamlining assessment of spontaneous venous pulsations

Sheng, Hongwei, Yu, Xin, Wang, Feiyu, Khan, MD Wahiduzzaman, Weng, Hexuan, Shariflou, Sahar and Golzan, S. Mojtaba (2023). Autonomous stabilization of retinal videos for streamlining assessment of spontaneous venous pulsations. 45th Annual International Conference of the IEEE-Engineering-in-Medicine-and-Biology-Society (EMBC), Sydney, NSW, Australia, 24-27 July 2023. Piscataway, NJ, United States: IEEE. doi: 10.1109/embc40787.2023.10341088

Autonomous stabilization of retinal videos for streamlining assessment of spontaneous venous pulsations

2023

Conference Publication

FlowFace: Semantic Flow-Guided Shape-Aware Face Swapping

Zeng, Hao, Zhang, Wei, Fan, Changjie, Lv, Tangjie, Wang, Suzhen, Zhang, Zhimeng, Ma, Bowen, Li, Lincheng, Ding, Yu and Yu, Xin (2023). FlowFace: Semantic Flow-Guided Shape-Aware Face Swapping. Thirty-Seventh AAAI Conference on Artificial Intelligence, Washington, DC United States, 7–14 February 2023. Washington, DC United States: Association for the Advancement of Artificial Intelligence. doi: 10.1609/aaai.v37i3.25444

FlowFace: Semantic Flow-Guided Shape-Aware Face Swapping

2023

Conference Publication

NeFII: Inverse Rendering for Reflectance Decomposition with Near-Field Indirect Illumination

Wu, Haoqian, Hu, Zhipeng, Li, Lincheng, Zhang, Yongqiang, Fan, Changjie and Yu, Xin (2023). NeFII: Inverse Rendering for Reflectance Decomposition with Near-Field Indirect Illumination. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, 17-24 June 2023. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvpr52729.2023.00418

NeFII: Inverse Rendering for Reflectance Decomposition with Near-Field Indirect Illumination

2023

Conference Publication

Object-goal visual navigation via effective exploration of relations among historical navigation states

Du, Heming, Li, Lincheng, Huang, Zi and Yu, Xin (2023). Object-goal visual navigation via effective exploration of relations among historical navigation states. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada, 17-24 June 2023. Piscataway, NJ, United States: IEEE. doi: 10.1109/cvpr52729.2023.00252

Object-goal visual navigation via effective exploration of relations among historical navigation states

2023

Conference Publication

Exploring active 3D object detection from a generalization perspective

Luo, Yadan, Chen, Zhuoxiao, Wang, Zijian, Yu, Xin, Huang, Zi and Baktashmotlagh, Mahsa (2023). Exploring active 3D object detection from a generalization perspective. 11th International Conference on Learning Representations (ICLR), Kigali, Rwanda, 1 - 5 May 2023. New York, NY, United States: Cornell Tech. doi: 10.48550/arXiv.2301.09249

Exploring active 3D object detection from a generalization perspective

2023

Conference Publication

A divide-and-conquer solution to 3D human motion estimation from raw MoCap data

Tang, Jilin, Li, Lincheng, Hou, Jie, Xin, Haoran and Yu, Xin (2023). A divide-and-conquer solution to 3D human motion estimation from raw MoCap data. 30th IEEE Conference Virtual Reality and 3D User Interfaces (IEEE VR), Shanghai, China, 25-29 March 2023. Piscataway, NJ United States: IEEE. doi: 10.1109/vrw58643.2023.00226

A divide-and-conquer solution to 3D human motion estimation from raw MoCap data

2023

Journal Article

Accurate 3-DoF camera geo-localization via ground-to-satellite image matching

Shi, Yujiao, Yu, Xin, Liu, Liu, Campbell, Dylan, Koniusz, Piotr and Li, Hongdong (2023). Accurate 3-DoF camera geo-localization via ground-to-satellite image matching. IEEE transactions on pattern analysis and machine intelligence, 45 (3), 2682-2697. doi: 10.1109/TPAMI.2022.3189702

Accurate 3-DoF camera geo-localization via ground-to-satellite image matching

Current funding

2025 - 2026

Creation of an interactive online seaweed production map to support policy-making for the Indonesian seaweed industry

KONEKSI Environment and Climate Change Extension Support

Open grant
2024 - 2027

AI-Empowered and Video-Based uplift of Paralympic classification systems (AQIRP project administered by Follow Me AI)

Follow Me AI Pty Ltd

Open grant
2023 - 2028

Breaking the Communication Barrier for the Australian Deaf Community: Vision Based Australian Sign Language Translation and Production

Google Asia Pacific Pte Ltd

Open grant
2023 - 2028

ARC Research Hub to Advance Timber for Australia's Future Built Environment

ARC Industrial Transformation Research Hubs

Open grant
2023 - 2027

Analytics for the Australian Grains Industry (AAGI)

Grains Research & Development Corporation

Open grant

Past funding

2024

Breaking the Communication Barrier for the Australian Deaf Community: Vision Based Australian Sign Language Translation and Production

Google Inc

Open grant
2023 - 2024

Developing applications of satellite imagery for modelling environmental and social impacts of climate change on seaweed farming in Indonesia (KONEKSI Grant administered by Griffith University)

Griffith University

Open grant
2023 - 2025

Advancing Human Perception: Countering Evolving Malicious Fake Visual Data

ARC Discovery Early Career Researcher Award

Open grant
2023 - 2025

Two-way Auslan: Automatic Machine Translation of Australian Sign Language (ARC Discovery Project administered by ANU)

The Australian National University

Open grant

Availability

Associate Professor Xin Yu is:: Available for supervision

Looking for a supervisor? Read our advice on how to choose a supervisor.

Supervision history

Current supervision

Doctor Philosophy

Understanding Human Intention and Performance

Principal Advisor

Other advisors: Associate Professor Sen Wang
Doctor Philosophy

Integrating Deep Learning and Remote Sensing for Precision Agriculture in Staple Crops

Principal Advisor

Other advisors: Dr Miao Xu
Doctor Philosophy

Understanding Human Movements and Sport Performance Analysis

Principal Advisor

Other advisors: Dr Miao Xu
Doctor Philosophy

Combating evolving deceptive fake visual information through deepfake detection

Principal Advisor

Other advisors: Dr Miao Xu
Doctor Philosophy

Human Posture Recognition Applied to Physical Activity

Principal Advisor

Other advisors: Professor Sean Tweedy
Doctor Philosophy

Towards Comprehensive Australian Sign Language Understanding: Datasets, Methodologies, and Systems

Principal Advisor

Other advisors: Professor Helen Huang, Dr Heming Du
Doctor Philosophy

Human Understanding in Sports

Principal Advisor

Other advisors: Associate Professor Sen Wang, Dr Heming Du
Doctor Philosophy

Compressed Video Restoration

Principal Advisor

Other advisors: Dr Miao Xu, Dr Heming Du
Doctor Philosophy

Understanding Human Intention and Performance

Principal Advisor

Other advisors: Dr Heming Du, Dr Miao Xu
Doctor Philosophy

Towards Data-Driven Analysis of Handheld Fundus Videos

Principal Advisor

Other advisors: Associate Professor Mahsa Baktashmotlagh, Dr Heming Du
Doctor Philosophy

Effective Visual Data Compression

Associate Advisor

Other advisors: Associate Professor Sen Wang, Dr Heming Du
Doctor Philosophy

Pose Estimation for Human with Disabilities

Associate Advisor

Other advisors: Dr Heming Du
Doctor Philosophy

Multimodal foundation model design and analysis

Associate Advisor

Other advisors: Dr Miao Xu, Dr Heming Du
Doctor Philosophy

Enhancing Robustness and Generalizability in Computational Models

Associate Advisor

Other advisors: Associate Professor Mahsa Baktashmotlagh
Doctor Philosophy

The Unlabeled Truth: Rethinking Medical Imaging Supervision for Foundation Models in the Wild

Associate Advisor

Other advisors: Associate Professor Mahsa Baktashmotlagh, Dr Heming Du
Doctor Philosophy

Towards knowledge discovery from imperfect and evolving data

Associate Advisor

Other advisors: Dr Miao Xu
Doctor Philosophy

Remote Sensing Analysis in computer vision

Associate Advisor

Other advisors: Professor Helen Huang
Doctor Philosophy

Data driven approaches for smart farming

Associate Advisor

Other advisors: Professor Helen Huang

Completed supervision

2026

Doctor Philosophy

Object-Centric Audio-Visual Alignment for Sounding Source Segmentation

Principal Advisor

Other advisors: Associate Professor Sen Wang

Enquiries

For media enquiries about Associate Professor Xin Yu's areas of expertise, story ideas and help finding experts, contact our Media team:

communications@uq.edu.au

External profiles

Personal links

Update my profile

Xin Yu

Overview

Background

Availability

Research impacts

Works

MM-WLAuslan: multi-view multi-modal word-level Australian Sign Language recognition dataset

Calligraphy Font generation via explicitly modeling location-aware glyph component deformations

DMMG: Dual min-max games for self-supervised skeleton-based action recognition

Learning efficient unsupervised satellite image-based building damage detection

Context-based masking for&nbsp;spontaneous venous pulsations detection

A new perspective of&nbsp;weakly supervised 3D instance segmentation via&nbsp;bounding boxes

Toward a&nbsp;unified framework for&nbsp;RGB and&nbsp;RGB-D visual navigation

Towards reliable and efficient vegetation segmentation for Australian wheat data analysis

Audio-visual segmentation by exploring cross-modal mutual semantics

DyGait: exploiting dynamic representations for high-performance gait recognition

Gait recognition with mask-based regularization

Deep idempotent network for efficient single image blind deblurring

Diverse 3D Hand Gesture Prediction from Body Dynamics by Bilateral Hand Disentanglement

Autonomous stabilization of retinal videos for streamlining assessment of spontaneous venous pulsations

FlowFace: Semantic Flow-Guided Shape-Aware Face Swapping

NeFII: Inverse Rendering for Reflectance Decomposition with Near-Field Indirect Illumination

Object-goal visual navigation via effective exploration of relations among historical navigation states

Exploring active 3D object detection from a generalization perspective

A divide-and-conquer solution to 3D human motion estimation from raw MoCap data

Accurate 3-DoF camera geo-localization via ground-to-satellite image matching

Funding

Current funding

Past funding

Supervision

Availability

Supervision history

Current supervision

Understanding Human Intention and Performance

Integrating Deep Learning and Remote Sensing for Precision Agriculture in Staple Crops

Understanding Human Movements and Sport Performance Analysis

Combating evolving deceptive fake visual information through deepfake detection

Human Posture Recognition Applied to Physical Activity

Towards Comprehensive Australian Sign Language Understanding: Datasets, Methodologies, and Systems

Human Understanding in Sports

Compressed Video Restoration

Understanding Human Intention and Performance

Towards Data-Driven Analysis of Handheld Fundus Videos

Effective Visual Data Compression

Pose Estimation for Human with Disabilities

Multimodal foundation model design and analysis

Enhancing Robustness and Generalizability in Computational Models

The Unlabeled Truth: Rethinking Medical Imaging Supervision for Foundation Models in the Wild

Towards knowledge discovery from imperfect and evolving data

Remote Sensing Analysis in computer vision

Data driven approaches for smart farming

Completed supervision

Object-Centric Audio-Visual Alignment for Sounding Source Segmentation

Media

Enquiries

Context-based masking for spontaneous venous pulsations detection

A new perspective of weakly supervised 3D instance segmentation via bounding boxes

Toward a unified framework for RGB and RGB-D visual navigation