Skip to menu Skip to content Skip to footer
Dr Xin Yu
Dr

Xin Yu

Email: 

Overview

Background

My name is Xin Yu, a Senior Lecturer at the University of Queensland. I am an Australian Research Council Discovery Early Career Researcher Award 2023-2025 (DECRA) recipient and an awardee of the prestigious Google Research Scholar Program in 2021. I am also a Google Visiting Faculty. Previously, I was a research fellow at the Australian National University (ANU). I received my PhD degree from the Australian National Unversity under the supervision of Prof. Richard Hartley, Prof. Fatih Porikli and Dr. Basura Fernando. I also received a PhD degree from Tsinghua University supervised by Prof. Li Zhang. I am interested in Computer Vision and Machine Learning topics.

My research topics includes various computer vision and machine learning tasks, especially in efficient low-level image processing, image retrieval and localization, action recognition, 3D pose estimation, visual navigation and sign language recognition and translation.

Availability

Dr Xin Yu is:
Available for supervision

Research impacts

One of my research papers has been awarded "Best Paper Honorable Mention" award in the premium computer vision conference WACV 2020, and one paper has been nominated for the Best Paper Award in CVPR 2020.

I was awarded the Outstanding Reviewer Award in ECCV 2020, CVPR 2021 and ICCV 2021. CVPR, ICCV and ECCV are internationally world-leading computer vision and machine learning conferences. My research interests include deep learning techniques, image processing, and computer vision tasks. I am a program committee member of top-tier computer vision and machine learning conferences, such as CVPR, ICCV, ECCV, ICML, ICLR and NeurIPS, and a reviewer of prestigious journals, such as TPAMI, IJCV and TIP.

I am happy to supervise self-motivated PhD and MPhil students. If you are an undergraduate student and willing to conduct your honour project, please drop me an email.

Works

Search Professor Xin Yu’s works on UQ eSpace

165 works between 2011 and 2025

61 - 80 of 165 works

2023

Conference Publication

Autonomous stabilization of retinal videos for streamlining assessment of spontaneous venous pulsations

Sheng, Hongwei, Yu, Xin, Wang, Feiyu, Khan, MD Wahiduzzaman, Weng, Hexuan, Shariflou, Sahar and Golzan, S. Mojtaba (2023). Autonomous stabilization of retinal videos for streamlining assessment of spontaneous venous pulsations. 45th Annual International Conference of the IEEE-Engineering-in-Medicine-and-Biology-Society (EMBC), Sydney, NSW, Australia, 24-27 July 2023. Piscataway, NJ, United States: IEEE. doi: 10.1109/embc40787.2023.10341088

Autonomous stabilization of retinal videos for streamlining assessment of spontaneous venous pulsations

2023

Conference Publication

FlowFace: Semantic Flow-Guided Shape-Aware Face Swapping

Zeng, Hao, Zhang, Wei, Fan, Changjie, Lv, Tangjie, Wang, Suzhen, Zhang, Zhimeng, Ma, Bowen, Li, Lincheng, Ding, Yu and Yu, Xin (2023). FlowFace: Semantic Flow-Guided Shape-Aware Face Swapping. Thirty-Seventh AAAI Conference on Artificial Intelligence, Washington, DC United States, 7–14 February 2023. Washington, DC United States: Association for the Advancement of Artificial Intelligence. doi: 10.1609/aaai.v37i3.25444

FlowFace: Semantic Flow-Guided Shape-Aware Face Swapping

2023

Conference Publication

Object-goal visual navigation via effective exploration of relations among historical navigation states

Du, Heming, Li, Lincheng, Huang, Zi and Yu, Xin (2023). Object-goal visual navigation via effective exploration of relations among historical navigation states. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada, 17-24 June 2023. Piscataway, NJ, United States: IEEE. doi: 10.1109/cvpr52729.2023.00252

Object-goal visual navigation via effective exploration of relations among historical navigation states

2023

Conference Publication

NeFII: Inverse Rendering for Reflectance Decomposition with Near-Field Indirect Illumination

Wu, Haoqian, Hu, Zhipeng, Li, Lincheng, Zhang, Yongqiang, Fan, Changjie and Yu, Xin (2023). NeFII: Inverse Rendering for Reflectance Decomposition with Near-Field Indirect Illumination. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, 17-24 June 2023. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvpr52729.2023.00418

NeFII: Inverse Rendering for Reflectance Decomposition with Near-Field Indirect Illumination

2023

Conference Publication

Exploring active 3D object detection from a generalization perspective

Luo, Yadan, Chen, Zhuoxiao, Wang, Zijian, Yu, Xin, Huang, Zi and Baktashmotlagh, Mahsa (2023). Exploring active 3D object detection from a generalization perspective. 11th International Conference on Learning Representations (ICLR), Kigali, Rwanda, 1 - 5 May 2023. New York, NY, United States: Cornell Tech. doi: 10.48550/arXiv.2301.09249

Exploring active 3D object detection from a generalization perspective

2023

Conference Publication

A divide-and-conquer solution to 3D human motion estimation from raw MoCap data

Tang, Jilin, Li, Lincheng, Hou, Jie, Xin, Haoran and Yu, Xin (2023). A divide-and-conquer solution to 3D human motion estimation from raw MoCap data. 30th IEEE Conference Virtual Reality and 3D User Interfaces (IEEE VR), Shanghai, China, 25-29 March 2023. Piscataway, NJ United States: IEEE. doi: 10.1109/vrw58643.2023.00226

A divide-and-conquer solution to 3D human motion estimation from raw MoCap data

2023

Journal Article

Accurate 3-DoF camera geo-localization via ground-to-satellite image matching

Shi, Yujiao, Yu, Xin, Liu, Liu, Campbell, Dylan, Koniusz, Piotr and Li, Hongdong (2023). Accurate 3-DoF camera geo-localization via ground-to-satellite image matching. IEEE transactions on pattern analysis and machine intelligence, 45 (3), 2682-2697. doi: 10.1109/TPAMI.2022.3189702

Accurate 3-DoF camera geo-localization via ground-to-satellite image matching

2023

Conference Publication

Sign Spotting via Multi-modal Fusion and Testing Time Transferring

Fu, Hongyu, Liu, Chen, Qi, Xingqun, Lin, Beibei, Li, Lincheng, Zhang, Li and Yu, Xin (2023). Sign Spotting via Multi-modal Fusion and Testing Time Transferring. European Conference on Computer Vision (ECCV 2022), Tel Aviv, Israel, 23–27 October 2022. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-25085-9_16

Sign Spotting via Multi-modal Fusion and Testing Time Transferring

2023

Conference Publication

Proactive deepfake defence via identity watermarking

Zhao, Yuan, Liu, Bo, Ding, Ming, Liu, Baoping, Zhu, Tianqing and Yu, Xin (2023). Proactive deepfake defence via identity watermarking. 23rd IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 3-7 January 2023. Piscataway, NJ United States: IEEE. doi: 10.1109/wacv56688.2023.00458

Proactive deepfake defence via identity watermarking

2023

Conference Publication

TI2Net: Temporal identity inconsistency network for deepfake detection

Liu, Baoping, Liu, Bo, Ding, Ming, Zhu, Tianqing and Yu, Xin (2023). TI2Net: Temporal identity inconsistency network for deepfake detection. 23rd IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 3-7 January 2023. Piscataway, NJ United States: IEEE. doi: 10.1109/wacv56688.2023.00467

TI2Net: Temporal identity inconsistency network for deepfake detection

2023

Conference Publication

Weakly-supervised Point Cloud Instance Segmentation with Geometric Priors

Du, Heming, Yu, Xin, Hussain, Farookh, Armin, Mohammad Ali, Petersson, Lars and Li, Weihao (2023). Weakly-supervised Point Cloud Instance Segmentation with Geometric Priors. 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 2-7 January 2023. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/wacv56688.2023.00425

Weakly-supervised Point Cloud Instance Segmentation with Geometric Priors

2023

Conference Publication

Auslan-Daily: Australian sign language translation for daily communication and news

Shen, Xin, Yuan, Shaozu, Sheng, Hongwei, Du, Heming and Yu, Xin (2023). Auslan-Daily: Australian sign language translation for daily communication and news. 37th Conference on Neural Information Processing Systems (NeurIPS 2023), New Orleans, LA, United States, 10-16 December 2023. San Diego, CA, United States: Neural Information Processing Systems Foundation.

Auslan-Daily: Australian sign language translation for daily communication and news

2023

Conference Publication

StyleTalk: one-shot talking head generation with controllable speaking styles

Ma, Yifeng, Wang, Suzhen, Hu, Zhipeng, Fan, Changjie, Lv, Tangjie, Ding, Yu, Deng, Zhidong and Yu, Xin (2023). StyleTalk: one-shot talking head generation with controllable speaking styles. 37th AAAI Conference on Artificial Intelligence (AAAI) / 35th Conference on Innovative Applications of Artificial Intelligence / 13th Symposium on Educational Advances in Artificial Intelligence, Washington, DC, United States, 7-14 February 2023. Palo Alto, CA, United States: Association for the Advancement of Artificial Intelligence. doi: 10.1609/aaai.v37i2.25280

StyleTalk: one-shot talking head generation with controllable speaking styles

2023

Journal Article

Cyclic self-training with proposal weight modulation for cross-supervised object detection

Xu, Yunqiu, Zhou, Chunluan, Yu, Xin and Yang, Yi (2023). Cyclic self-training with proposal weight modulation for cross-supervised object detection. IEEE Transactions on Image Processing, 32, 1992-2002. doi: 10.1109/TIP.2023.3261752

Cyclic self-training with proposal weight modulation for cross-supervised object detection

2023

Conference Publication

Sim2RealVS: A new benchmark for video stabilization with a strong baseline

Rao, Qi, Yu, Xin, Navasardyan, Shant and Shi, Humphrey (2023). Sim2RealVS: A new benchmark for video stabilization with a strong baseline. 23rd IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 3-7 January 2023. Piscataway, NJ United States: IEEE. doi: 10.1109/wacv56688.2023.00537

Sim2RealVS: A new benchmark for video stabilization with a strong baseline

2023

Conference Publication

Meta knowledge condensation for federated learning

Liu, Ping, Yu, Xin and Zhou, Joey Tianyi (2023). Meta knowledge condensation for federated learning. 11th International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, 1-5 May 2023. International Conference on Learning Representations, ICLR.

Meta knowledge condensation for federated learning

2023

Journal Article

Boosting model inversion attacks with adversarial examples

Zhou, Shuai, Zhu, Tianqing, Ye, Dayong, Yu, Xin and Zhou, Wanlei (2023). Boosting model inversion attacks with adversarial examples. IEEE Transactions on Dependable and Secure Computing, 21 (3), 1-18. doi: 10.1109/TDSC.2023.3285015

Boosting model inversion attacks with adversarial examples

2023

Conference Publication

GaitStrip: Gait Recognition via Effective Strip-Based Feature Representations and Multi-level Framework

Wang, Ming, Lin, Beibei, Guo, Xianda, Li, Lincheng, Zhu, Zheng, Sun, Jiande, Zhang, Shunli, Liu, Yu and Yu, Xin (2023). GaitStrip: Gait Recognition via Effective Strip-Based Feature Representations and Multi-level Framework. 16th Asian Conference on Computer Vision, Macao, China, 4–8 December 2022. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-26316-3_42

GaitStrip: Gait Recognition via Effective Strip-Based Feature Representations and Multi-level Framework

2023

Journal Article

HairStyle editing via parametric controllable strokes

Song, Xinhui, Liu, Chen, Zheng, Youyi, Feng, Zunlei, Li, Lincheng, Zhou, Kun and Yu, Xin (2023). HairStyle editing via parametric controllable strokes. IEEE Transactions on Visualization and Computer Graphics, 30 (7), 1-14. doi: 10.1109/TVCG.2023.3241894

HairStyle editing via parametric controllable strokes

2023

Conference Publication

CVLNet: Cross-view Semantic Correspondence Learning for Video-Based Camera Localization

Shi, Yujiao, Yu, Xin, Wang, Shan and Li, Hongdong (2023). CVLNet: Cross-view Semantic Correspondence Learning for Video-Based Camera Localization. 16th Asian Conference on Computer Vision, Macao, China, 4–8 December 2022. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-26319-4_8

CVLNet: Cross-view Semantic Correspondence Learning for Video-Based Camera Localization

Funding

Current funding

  • 2025 - 2026
    Creation of an interactive online seaweed production map to support policy-making for the Indonesian seaweed industry
    KONEKSI Environment and Climate Change Extension Support
    Open grant
  • 2024 - 2027
    AI-Empowered and Video-Based uplift of Paralympic classification systems (AQIRP project administered by Follow Me AI)
    Follow Me AI Pty Ltd
    Open grant
  • 2023 - 2028
    Breaking the Communication Barrier for the Australian Deaf Community: Vision Based Australian Sign Language Translation and Production
    Google Asia Pacific Pte Ltd
    Open grant
  • 2023 - 2026
    Advancing Human Perception: Countering Evolving Malicious Fake Visual Data
    ARC Discovery Early Career Researcher Award
    Open grant
  • 2023 - 2027
    Analytics for the Australian Grains Industry (AAGI)
    Grains Research & Development Corporation
    Open grant

Past funding

  • 2024
    Breaking the Communication Barrier for the Australian Deaf Community: Vision Based Australian Sign Language Translation and Production
    Google Inc
    Open grant
  • 2023 - 2024
    Developing applications of satellite imagery for modelling environmental and social impacts of climate change on seaweed farming in Indonesia (KONEKSI Grant administered by Griffith University)
    Griffith University
    Open grant
  • 2023 - 2025
    Two-way Auslan: Automatic Machine Translation of Australian Sign Language (ARC Discovery Project administered by ANU)
    The Australian National University
    Open grant

Supervision

Availability

Dr Xin Yu is:
Available for supervision

Looking for a supervisor? Read our advice on how to choose a supervisor.

Supervision history

Current supervision

  • Doctor Philosophy

    Multimodal foundation model design and analysis

    Principal Advisor

    Other advisors: Dr Miao Xu, Dr Heming Du

  • Doctor Philosophy

    Human Posture Recognition Applied to Physical Activity

    Principal Advisor

    Other advisors: Professor Sean Tweedy

  • Doctor Philosophy

    Understanding Human Intention and Performance

    Principal Advisor

    Other advisors: Associate Professor Sen Wang

  • Doctor Philosophy

    Understanding Human Movements and Sport Performance Analysis

    Principal Advisor

    Other advisors: Dr Miao Xu

  • Doctor Philosophy

    Effective Visual Data Compression

    Principal Advisor

    Other advisors: Associate Professor Sen Wang, Dr Heming Du

  • Doctor Philosophy

    Automatic Retinal Health Monitoring through Multi-modal Medical Imaging

    Principal Advisor

    Other advisors: Associate Professor Mahsa Baktashmotlagh

  • Doctor Philosophy

    Combating evolving deceptive fake visual information through deepfake detection

    Principal Advisor

    Other advisors: Dr Miao Xu

  • Doctor Philosophy

    Two way Auslan Translation

    Principal Advisor

    Other advisors: Associate Professor Mahsa Baktashmotlagh, Dr Heming Du

  • Doctor Philosophy

    Integrating Deep Learning and Remote Sensing for Precision Agriculture in Staple Crops

    Principal Advisor

    Other advisors: Dr Miao Xu

  • Doctor Philosophy

    Understanding Human Intention and Performance

    Principal Advisor

    Other advisors: Dr Heming Du, Dr Miao Xu

  • Doctor Philosophy

    Human Understanding in Sports

    Principal Advisor

    Other advisors: Associate Professor Sen Wang, Dr Heming Du

  • Doctor Philosophy

    Pose Estimation for Human with Disabilities

    Principal Advisor

    Other advisors: Professor Brian Lovell

  • Doctor Philosophy

    Two way Auslan Translation

    Principal Advisor

    Other advisors: Professor Helen Huang, Dr Heming Du

  • Doctor Philosophy

    Compressed Video Restoration

    Principal Advisor

    Other advisors: Dr Miao Xu, Dr Heming Du

  • Doctor Philosophy

    Object-Centric Audio-Visual Alignment for Sounding Source Segmentation

    Principal Advisor

    Other advisors: Associate Professor Sen Wang

  • Doctor Philosophy

    Remote Sensing Analysis in computer vision

    Associate Advisor

    Other advisors: Professor Helen Huang

  • Doctor Philosophy

    Enhancing Robustness and Generalizability in Computational Models

    Associate Advisor

    Other advisors: Associate Professor Mahsa Baktashmotlagh

  • Doctor Philosophy

    Data driven approaches for smart farming

    Associate Advisor

    Other advisors: Professor Helen Huang

  • Doctor Philosophy

    Towards knowledge discovery from imperfect and evolving data

    Associate Advisor

    Other advisors: Dr Miao Xu

Media

Enquiries

For media enquiries about Dr Xin Yu's areas of expertise, story ideas and help finding experts, contact our Media team:

communications@uq.edu.au