Skip to menu Skip to content Skip to footer
Dr Xin Yu
Dr

Xin Yu

Email: 

Overview

Background

My name is Xin Yu, a Senior Lecturer at the University of Queensland. I am an Australian Research Council Discovery Early Career Researcher Award 2023-2025 (DECRA) recipient and an awardee of the prestigious Google Research Scholar Program in 2021. I am also a Google Visiting Faculty. Previously, I was a research fellow at the Australian National University (ANU). I received my PhD degree from the Australian National Unversity under the supervision of Prof. Richard Hartley, Prof. Fatih Porikli and Dr. Basura Fernando. I also received a PhD degree from Tsinghua University supervised by Prof. Li Zhang. I am interested in Computer Vision and Machine Learning topics.

My research topics includes various computer vision and machine learning tasks, especially in efficient low-level image processing, image retrieval and localization, action recognition, 3D pose estimation, visual navigation and sign language recognition and translation.

Availability

Dr Xin Yu is:
Available for supervision

Research impacts

One of my research papers has been awarded "Best Paper Honorable Mention" award in the premium computer vision conference WACV 2020, and one paper has been nominated for the Best Paper Award in CVPR 2020.

I was awarded the Outstanding Reviewer Award in ECCV 2020, CVPR 2021 and ICCV 2021. CVPR, ICCV and ECCV are internationally world-leading computer vision and machine learning conferences. My research interests include deep learning techniques, image processing, and computer vision tasks. I am a program committee member of top-tier computer vision and machine learning conferences, such as CVPR, ICCV, ECCV, ICML, ICLR and NeurIPS, and a reviewer of prestigious journals, such as TPAMI, IJCV and TIP.

I am happy to supervise self-motivated PhD and MPhil students. If you are an undergraduate student and willing to conduct your honour project, please drop me an email.

Works

Search Professor Xin Yu’s works on UQ eSpace

152 works between 2011 and 2025

21 - 40 of 152 works

2024

Conference Publication

Benchmarking audio visual segmentation for long-untrimmed videos

Liu, Chen, Li, Peike Patrick, Yu, Qingtao, Sheng, Hongwei, Wang, Dadong, Li, Lincheng and Yu, Xin (2024). Benchmarking audio visual segmentation for long-untrimmed videos. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, United States, 16-22 June 2024. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/CVPR52733.2024.02143

Benchmarking audio visual segmentation for long-untrimmed videos

2024

Conference Publication

Text-guided 3D face synthesis - from generation to editing

Wu, Yunjie, Meng, Yapeng, Hu, Zhipeng, Li, Lincheng, Wu, Haoqian, Zhou, Kun, Xu, Weiwei and Yu, Xin (2024). Text-guided 3D face synthesis - from generation to editing. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, United States, 16-22 June 2024. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/CVPR52733.2024.00126

Text-guided 3D face synthesis - from generation to editing

2024

Journal Article

MarkerNet: A divide-and-conquer solution to motion capture solving from raw markers

Hu, Zhipeng, Tang, Jilin, Li, Lincheng, Hou, Jie, Xin, Haoran, Yu, Xin and Bu, Jiajun (2024). MarkerNet: A divide-and-conquer solution to motion capture solving from raw markers. Computer Animation and Virtual Worlds, 35 (1) e2228, 1-19. doi: 10.1002/cav.2228

MarkerNet: A divide-and-conquer solution to motion capture solving from raw markers

2024

Journal Article

EmotionGesture: audio-driven diverse emotional co-speech 3D gesture generation

Qi, Xingqun, Liu, Chen, Li, Lincheng, Hou, Jie, Xin, Haoran and Yu, Xin (2024). EmotionGesture: audio-driven diverse emotional co-speech 3D gesture generation. IEEE Transactions on Multimedia, 26, 10420-10430. doi: 10.1109/tmm.2024.3407692

EmotionGesture: audio-driven diverse emotional co-speech 3D gesture generation

2024

Journal Article

CBARF: cascaded bundle-adjusting neural radiance fields from imperfect camera poses

Fu, Hongyu, Yu, Xin, Li, Lincheng and Zhang, Li (2024). CBARF: cascaded bundle-adjusting neural radiance fields from imperfect camera poses. IEEE Transactions on Multimedia, 26, 9304-9315. doi: 10.1109/tmm.2024.3388929

CBARF: cascaded bundle-adjusting neural radiance fields from imperfect camera poses

2024

Conference Publication

MMOOC: a multimodal misinformation dataset for out-of-context news analysis

Xu, Qingzheng, Du, Heming, Chen, Huiqiang, Liu, Bo and Yu, Xin (2024). MMOOC: a multimodal misinformation dataset for out-of-context news analysis. 29th Australasian Conference, ACISP 2024, Sydney, NSW, Australia, 15–17 July 2024. Heidelberg, Germany: Springer. doi: 10.1007/978-981-97-5101-3_24

MMOOC: a multimodal misinformation dataset for out-of-context news analysis

2024

Journal Article

BAVS: Bootstrapping Audio-Visual Segmentation by Integrating Foundation Knowledge

Liu, Chen, Li, Peike, Zhang, Hu, Li, Lincheng, Huang, Zi, Wang, Dadong and Yu, Xin (2024). BAVS: Bootstrapping Audio-Visual Segmentation by Integrating Foundation Knowledge. IEEE Transactions on Multimedia, 26, 10015-10028. doi: 10.1109/tmm.2024.3405622

BAVS: Bootstrapping Audio-Visual Segmentation by Integrating Foundation Knowledge

2024

Conference Publication

EfficientDreamer: high-fidelity and stable 3D creation via orthogonal-view diffusion priors

Hu, Zhipeng, Zhao, Minda, Zhao, Chaoyi, Liang, Xinyue, Li, Lincheng, Zhao, Zeng, Fan, Changjie, Zhou, Xiaowei and Yu, Xin (2024). EfficientDreamer: high-fidelity and stable 3D creation via orthogonal-view diffusion priors. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, United States, 16-22 June 2024. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/CVPR52733.2024.00473

EfficientDreamer: high-fidelity and stable 3D creation via orthogonal-view diffusion priors

2024

Conference Publication

Pupil-fMRI correlation-based Explainable AI to classify Alzheimer’s Disease

Liu, Xiaochen, Xu, William, Hike, David, Xie, Zeping, Liu, Andy, Choi, Sangcheon, Zhu, Biyue, Ran, Chongzhao, Jiang, Yuanyuan and Yu, Xin (2024). Pupil-fMRI correlation-based Explainable AI to classify Alzheimer’s Disease. 2024 ISMRM & ISMRT Annual Meeting, Singapore, 4-9 May 2024. Concord, CA United States: ISMRM. doi: 10.58530/2024/1124

Pupil-fMRI correlation-based Explainable AI to classify Alzheimer’s Disease

2024

Journal Article

CMGNet: Collaborative multi-modal graph network for video captioning

Rao, Qi, Yu, Xin, Li, Guang and Zhu, Linchao (2024). CMGNet: Collaborative multi-modal graph network for video captioning. Computer Vision and Image Understanding, 238 103864, 1-10. doi: 10.1016/j.cviu.2023.103864

CMGNet: Collaborative multi-modal graph network for video captioning

2024

Journal Article

StyleTalk++: A unified framework for controlling the speaking styles of talking heads

Wang, Suzhen, Ma, Yifeng, Ding, Yu, Hu, Zhipeng, Fan, Changjie, Lv, Tangjie, Deng, Zhidong and Yu, Xin (2024). StyleTalk++: A unified framework for controlling the speaking styles of talking heads. IEEE Transactions on Pattern Analysis and Machine Intelligence, 46 (6), 4331-4347. doi: 10.1109/tpami.2024.3357808

StyleTalk++: A unified framework for controlling the speaking styles of talking heads

2024

Conference Publication

AS-NeRF: learning auxiliary sampling for generalizable novel view synthesis from sparse views

Tang, Jilin, Li, Lincheng, Qi, Xingqun, Chen, Yingfeng, Fan, Changjie and Yu, Xin (2024). AS-NeRF: learning auxiliary sampling for generalizable novel view synthesis from sparse views. 2024 IEEE International Conference on Multimedia and Expo (ICME), Niagara Falls, ON, Canada, 15-19 July 2024. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/ICME57554.2024.10688126

AS-NeRF: learning auxiliary sampling for generalizable novel view synthesis from sparse views

2023

Journal Article

Calligraphy Font generation via explicitly modeling location-aware glyph component deformations

Zhao, Minda, Qi, Xingqun, Hu, Zhipeng, Li, Lincheng, Zhang, Yongqiang, Huang, Zi and Yu, Xin (2023). Calligraphy Font generation via explicitly modeling location-aware glyph component deformations. IEEE Transactions on Multimedia, 26, 5939-5950. doi: 10.1109/tmm.2023.3342690

Calligraphy Font generation via explicitly modeling location-aware glyph component deformations

2023

Journal Article

DMMG: Dual min-max games for self-supervised skeleton-based action recognition

Guan, Shannan, Yu, Xin, Huang, Wei, Fang, Gengfa and Lu, Haiyan (2023). DMMG: Dual min-max games for self-supervised skeleton-based action recognition. IEEE Transactions on Image Processing, 33, 395-407. doi: 10.1109/tip.2023.3338410

DMMG: Dual min-max games for self-supervised skeleton-based action recognition

2023

Conference Publication

Learning efficient unsupervised satellite image-based building damage detection

Zhang, Yiyun, Wang, Zijian, Luo, Yadan, Yu, Xin and Huang, Zi (2023). Learning efficient unsupervised satellite image-based building damage detection. 2023 IEEE International Conference on Data Mining (ICDM), Shanghai, China, 1-4 December 2023. Piscataway, NJ, United States: IEEE. doi: 10.1109/icdm58522.2023.00206

Learning efficient unsupervised satellite image-based building damage detection

2023

Conference Publication

A new perspective of weakly supervised 3D instance segmentation via bounding boxes

Yu, Qingtao, Du, Heming and Yu, Xin (2023). A new perspective of weakly supervised 3D instance segmentation via bounding boxes. 36th Australasian Joint Conference on Artificial Intelligence, AJCAI 2023, Brisbane, QLD Australia, 28 November –1 December 2023. Singapore: Springer. doi: 10.1007/978-981-99-8388-9_9

A new perspective of weakly supervised 3D instance segmentation via bounding boxes

2023

Conference Publication

Toward a unified framework for RGB and RGB-D visual navigation

Du, Heming, Huang, Zi, Chapman, Scott and Yu, Xin (2023). Toward a unified framework for RGB and RGB-D visual navigation. 36th Australasian Joint Conference on Artificial Intelligence, AJCAI 2023, Brisbane, QLD Australia, 28 November –1 December 2023. Singapore: Springer. doi: 10.1007/978-981-99-8391-9_29

Toward a unified framework for RGB and RGB-D visual navigation

2023

Conference Publication

Context-based masking for spontaneous venous pulsations detection

Sheng, Hongwei, Yu, Xin, Li, Xue and Golzan, Mojtaba (2023). Context-based masking for spontaneous venous pulsations detection. 36th Australasian Joint Conference on Artificial Intelligence, AJCAI 2023, Brisbane, QLD Australia, 28 November –1 December 2023. Singapore: Springer. doi: 10.1007/978-981-99-8388-9_42

Context-based masking for spontaneous venous pulsations detection

2023

Conference Publication

Towards reliable and efficient vegetation segmentation for Australian wheat data analysis

Yuan, Bowen, Wang, Zijian and Yu, Xin (2023). Towards reliable and efficient vegetation segmentation for Australian wheat data analysis. 34th Australasian Database Conference (ADC), Melbourne, NSW Australia, 1-3 November 2023. Cham, Switzerland: Springer Cham. doi: 10.1007/978-3-031-47843-7_9

Towards reliable and efficient vegetation segmentation for Australian wheat data analysis

2023

Conference Publication

Audio-visual segmentation by exploring cross-modal mutual semantics

Liu, Chen, Li, Peike Patrick, Qi, Xingqun, Zhang, Hu, Li, Lincheng, Wang, Dadong and Yu, Xin (2023). Audio-visual segmentation by exploring cross-modal mutual semantics. MM '23: The 31st ACM International Conference on Multimedia, Ottawa, ON Canada, 29 October - 3 November 2023. New York, NY United States: Association for Computing Machinery. doi: 10.1145/3581783.3612373

Audio-visual segmentation by exploring cross-modal mutual semantics

Supervision

Availability

Dr Xin Yu is:
Available for supervision

Before you email them, read our advice on how to contact a supervisor.

Supervision history

Current supervision

  • Doctor Philosophy

    Enhancing Building Fire Safety by Utilising Machine Learning Techniques

    Principal Advisor

    Other advisors: Professor Brian Lovell

  • Doctor Philosophy

    Human Posture Recognition Applied to Physical Activity

    Principal Advisor

    Other advisors: Professor Sean Tweedy

  • Doctor Philosophy

    Two way Auslan Translation

    Principal Advisor

    Other advisors: Professor Helen Huang

  • Doctor Philosophy

    Two way Auslan Translation

    Principal Advisor

    Other advisors: Associate Professor Mahsa Baktashmotlagh

  • Doctor Philosophy

    Towards Efficient Pest Detection in Agriculture

    Principal Advisor

    Other advisors: Associate Professor Sen Wang

  • Doctor Philosophy

    Multimodal foundation model design and analysis

    Principal Advisor

    Other advisors: Dr Miao Xu

  • Doctor Philosophy

    The prediction, diagnosis, and severity estimation models for plant disease

    Principal Advisor

    Other advisors: Associate Professor Sen Wang

  • Doctor Philosophy

    Advancing Human Perception: Countering Evolving Malicious Fake Visual Data

    Principal Advisor

    Other advisors: Dr Miao Xu

  • Doctor Philosophy

    Automatic Retinal Health Monitoring through Multi-modal Medical Imaging

    Principal Advisor

    Other advisors: Associate Professor Mahsa Baktashmotlagh

  • Doctor Philosophy

    Understanding Human Intention and Performance

    Principal Advisor

    Other advisors: Associate Professor Sen Wang

  • Doctor Philosophy

    Combating evolving deceptive fake visual information through deepfake detection

    Principal Advisor

    Other advisors: Associate Professor Sen Wang

  • Doctor Philosophy

    Understanding Human Intention and Performance

    Principal Advisor

    Other advisors: Dr Miao Xu

  • Doctor Philosophy

    Data driven approaches for smart farming

    Associate Advisor

    Other advisors: Professor Helen Huang

  • Doctor Philosophy

    Enhancing Robustness and Generalizability in Computational Models

    Associate Advisor

    Other advisors: Associate Professor Mahsa Baktashmotlagh

  • Doctor Philosophy

    Remote Sensing Analysis in computer vision

    Associate Advisor

    Other advisors: Professor Helen Huang

Media

Enquiries

For media enquiries about Dr Xin Yu's areas of expertise, story ideas and help finding experts, contact our Media team:

communications@uq.edu.au