Overview
Background
My name is Xin Yu, a Senior Lecturer at the University of Queensland. I am an Australian Research Council Discovery Early Career Researcher Award 2023-2025 (DECRA) recipient and an awardee of the prestigious Google Research Scholar Program in 2021. I am also a Google Visiting Faculty. Previously, I was a research fellow at the Australian National University (ANU). I received my PhD degree from the Australian National Unversity under the supervision of Prof. Richard Hartley, Prof. Fatih Porikli and Dr. Basura Fernando. I also received a PhD degree from Tsinghua University supervised by Prof. Li Zhang. I am interested in Computer Vision and Machine Learning topics.
My research topics includes various computer vision and machine learning tasks, especially in efficient low-level image processing, image retrieval and localization, action recognition, 3D pose estimation, visual navigation and sign language recognition and translation.
Availability
- Dr Xin Yu is:
- Available for supervision
Research impacts
One of my research papers has been awarded "Best Paper Honorable Mention" award in the premium computer vision conference WACV 2020, and one paper has been nominated for the Best Paper Award in CVPR 2020.
I was awarded the Outstanding Reviewer Award in ECCV 2020, CVPR 2021 and ICCV 2021. CVPR, ICCV and ECCV are internationally world-leading computer vision and machine learning conferences. My research interests include deep learning techniques, image processing, and computer vision tasks. I am a program committee member of top-tier computer vision and machine learning conferences, such as CVPR, ICCV, ECCV, ICML, ICLR and NeurIPS, and a reviewer of prestigious journals, such as TPAMI, IJCV and TIP.
I am happy to supervise self-motivated PhD and MPhil students. If you are an undergraduate student and willing to conduct your honour project, please drop me an email.
Works
Search Professor Xin Yu’s works on UQ eSpace
2024
Conference Publication
AS-NeRF: learning auxiliary sampling for generalizable novel view synthesis from sparse views
Tang, Jilin, Li, Lincheng, Qi, Xingqun, Chen, Yingfeng, Fan, Changjie and Yu, Xin (2024). AS-NeRF: learning auxiliary sampling for generalizable novel view synthesis from sparse views. 2024 IEEE International Conference on Multimedia and Expo (ICME), Niagara Falls, ON, Canada, 15-19 July 2024. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/ICME57554.2024.10688126
2024
Conference Publication
TPR: Topology-preserving reservoirs for generalized zero-shot learning
Chen, Hui, Liu, Yanbin, Ma, Yongqiang, Zheng, Nanning and Yu, Xin (2024). TPR: Topology-preserving reservoirs for generalized zero-shot learning. NIPS '24: 38th International Conference on Neural Information Processing Systems, Vancouver, BC, Canada, 10-15 December 2024. Maryland Heights, MO, United States: Morgan Kaufmann Publishers.
2024
Conference Publication
Benchmarking audio visual segmentation for long-untrimmed videos
Liu, Chen, Li, Peike Patrick, Yu, Qingtao, Sheng, Hongwei, Wang, Dadong, Li, Lincheng and Yu, Xin (2024). Benchmarking audio visual segmentation for long-untrimmed videos. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, United States, 16-22 June 2024. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/CVPR52733.2024.02143
2024
Conference Publication
Text-guided 3D face synthesis - from generation to editing
Wu, Yunjie, Meng, Yapeng, Hu, Zhipeng, Li, Lincheng, Wu, Haoqian, Zhou, Kun, Xu, Weiwei and Yu, Xin (2024). Text-guided 3D face synthesis - from generation to editing. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, United States, 16-22 June 2024. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/CVPR52733.2024.00126
2024
Conference Publication
MM-WLAuslan: multi-view multi-modal word-level Australian Sign Language recognition dataset
Shen, Xin, Du, Heming, Sheng, Hongwei, Wang, Shuyun, Chen, Hui, Chen, Huiqiang, Wu, Zhuojie, Du, Xiaobiao, Ying, Jiaying, Lu, Ruihan, Xu, Qingzheng and Yu, Xin (2024). MM-WLAuslan: multi-view multi-modal word-level Australian Sign Language recognition dataset. NeurIPS 2024, Vancouver, BC, Canada, 10 - 15 December 2024. Maryland Heights, MO, United States: Morgan Kaufmann Publishers.
2024
Journal Article
StyleTalk++: A unified framework for controlling the speaking styles of talking heads
Wang, Suzhen, Ma, Yifeng, Ding, Yu, Hu, Zhipeng, Fan, Changjie, Lv, Tangjie, Deng, Zhidong and Yu, Xin (2024). StyleTalk++: A unified framework for controlling the speaking styles of talking heads. IEEE Transactions on Pattern Analysis and Machine Intelligence, 46 (6), 4331-4347. doi: 10.1109/tpami.2024.3357808
2024
Conference Publication
An empirical analysis on spatial reasoning capabilities of large multimodal models
Shiri, Fatemeh, Guo, Xiao-Yu, Far, Mona Golestan, Yu, Xin, Haffari, Gholamreza and Li, Yuan-Fang (2024). An empirical analysis on spatial reasoning capabilities of large multimodal models. 2024 Conference on Empirical Methods in Natural Language Processing, Miami, FL, United States, 12-16 November 2024. Kerrville, TX, United States: Association for Computational Linguistics (ACL). doi: 10.18653/v1/2024.emnlp-main.1195
2024
Journal Article
CMGNet: Collaborative multi-modal graph network for video captioning
Rao, Qi, Yu, Xin, Li, Guang and Zhu, Linchao (2024). CMGNet: Collaborative multi-modal graph network for video captioning. Computer Vision and Image Understanding, 238 103864, 1-10. doi: 10.1016/j.cviu.2023.103864
2023
Journal Article
Calligraphy Font generation via explicitly modeling location-aware glyph component deformations
Zhao, Minda, Qi, Xingqun, Hu, Zhipeng, Li, Lincheng, Zhang, Yongqiang, Huang, Zi and Yu, Xin (2023). Calligraphy Font generation via explicitly modeling location-aware glyph component deformations. IEEE Transactions on Multimedia, 26, 5939-5950. doi: 10.1109/tmm.2023.3342690
2023
Journal Article
DMMG: Dual min-max games for self-supervised skeleton-based action recognition
Guan, Shannan, Yu, Xin, Huang, Wei, Fang, Gengfa and Lu, Haiyan (2023). DMMG: Dual min-max games for self-supervised skeleton-based action recognition. IEEE Transactions on Image Processing, 33, 395-407. doi: 10.1109/tip.2023.3338410
2023
Conference Publication
Learning efficient unsupervised satellite image-based building damage detection
Zhang, Yiyun, Wang, Zijian, Luo, Yadan, Yu, Xin and Huang, Zi (2023). Learning efficient unsupervised satellite image-based building damage detection. 2023 IEEE International Conference on Data Mining (ICDM), Shanghai, China, 1-4 December 2023. Piscataway, NJ, United States: IEEE. doi: 10.1109/icdm58522.2023.00206
2023
Conference Publication
Context-based masking for spontaneous venous pulsations detection
Sheng, Hongwei, Yu, Xin, Li, Xue and Golzan, Mojtaba (2023). Context-based masking for spontaneous venous pulsations detection. 36th Australasian Joint Conference on Artificial Intelligence, AJCAI 2023, Brisbane, QLD Australia, 28 November –1 December 2023. Singapore: Springer. doi: 10.1007/978-981-99-8388-9_42
2023
Conference Publication
A new perspective of weakly supervised 3D instance segmentation via bounding boxes
Yu, Qingtao, Du, Heming and Yu, Xin (2023). A new perspective of weakly supervised 3D instance segmentation via bounding boxes. 36th Australasian Joint Conference on Artificial Intelligence, AJCAI 2023, Brisbane, QLD Australia, 28 November –1 December 2023. Singapore: Springer. doi: 10.1007/978-981-99-8388-9_9
2023
Conference Publication
Toward a unified framework for RGB and RGB-D visual navigation
Du, Heming, Huang, Zi, Chapman, Scott and Yu, Xin (2023). Toward a unified framework for RGB and RGB-D visual navigation. 36th Australasian Joint Conference on Artificial Intelligence, AJCAI 2023, Brisbane, QLD Australia, 28 November –1 December 2023. Singapore: Springer. doi: 10.1007/978-981-99-8391-9_29
2023
Conference Publication
Towards reliable and efficient vegetation segmentation for Australian wheat data analysis
Yuan, Bowen, Wang, Zijian and Yu, Xin (2023). Towards reliable and efficient vegetation segmentation for Australian wheat data analysis. 34th Australasian Database Conference (ADC), Melbourne, NSW Australia, 1-3 November 2023. Cham, Switzerland: Springer Cham. doi: 10.1007/978-3-031-47843-7_9
2023
Conference Publication
Audio-visual segmentation by exploring cross-modal mutual semantics
Liu, Chen, Li, Peike Patrick, Qi, Xingqun, Zhang, Hu, Li, Lincheng, Wang, Dadong and Yu, Xin (2023). Audio-visual segmentation by exploring cross-modal mutual semantics. MM '23: The 31st ACM International Conference on Multimedia, Ottawa, ON Canada, 29 October - 3 November 2023. New York, NY United States: Association for Computing Machinery. doi: 10.1145/3581783.3612373
2023
Conference Publication
DyGait: exploiting dynamic representations for high-performance gait recognition
Wang, Ming, Guo, Xianda, Lin, Beibei, Yang, Tian, Zhu, Zheng, Li, Lincheng, Zhang, Shunli and Yu, Xin (2023). DyGait: exploiting dynamic representations for high-performance gait recognition. IEEE/CVF International Conference on Computer Vision (ICCV), Paris, France, 2-6 October 2023. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/iccv51070.2023.01235
2023
Conference Publication
Gait recognition with mask-based regularization
Shen, Chuanfu, Lin, Beibei, Zhang, Shunli, Yu, Xin, Huang, George Q. and Yu, Shiqi (2023). Gait recognition with mask-based regularization. IEEE International Joint Conference on Biometrics (IJCB), Ljubljana, Slovenia, 25-28 September 2023. New York, NY, United States: IEEE. doi: 10.1109/ijcb57857.2023.10449112
2023
Journal Article
Deep idempotent network for efficient single image blind deblurring
Mao, Yuxin, Wan, Zhexiong, Dai, Yuchao and Yu, Xin (2023). Deep idempotent network for efficient single image blind deblurring. IEEE Transactions on Circuits and Systems for Video Technology, 33 (1), 172-185. doi: 10.1109/tcsvt.2022.3202361
2023
Conference Publication
Diverse 3D Hand Gesture Prediction from Body Dynamics by Bilateral Hand Disentanglement
Qi, Xingqun, Liu, Chen, Sun, Muyi, Li, Lincheng, Fan, Changjie and Yu, Xin (2023). Diverse 3D Hand Gesture Prediction from Body Dynamics by Bilateral Hand Disentanglement. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, 17-24 June 2023. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvpr52729.2023.00448
Funding
Current funding
Past funding
Supervision
Availability
- Dr Xin Yu is:
- Available for supervision
Looking for a supervisor? Read our advice on how to choose a supervisor.
Supervision history
Current supervision
-
Doctor Philosophy
Automatic Retinal Health Monitoring through Multi-modal Medical Imaging
Principal Advisor
Other advisors: Associate Professor Mahsa Baktashmotlagh
-
Doctor Philosophy
Two way Auslan Translation
Principal Advisor
Other advisors: Professor Helen Huang, Dr Heming Du
-
Doctor Philosophy
Combating evolving deceptive fake visual information through deepfake detection
Principal Advisor
Other advisors: Dr Miao Xu
-
Doctor Philosophy
Two way Auslan Translation
Principal Advisor
Other advisors: Associate Professor Mahsa Baktashmotlagh, Dr Heming Du
-
Doctor Philosophy
Human Posture Recognition Applied to Physical Activity
Principal Advisor
Other advisors: Professor Sean Tweedy
-
Doctor Philosophy
Integrating Deep Learning and Remote Sensing for Precision Agriculture in Staple Crops
Principal Advisor
Other advisors: Dr Miao Xu
-
Doctor Philosophy
Human Understanding in Sports
Principal Advisor
Other advisors: Associate Professor Sen Wang, Dr Heming Du
-
Doctor Philosophy
Pose Estimation for Human with Disabilities
Principal Advisor
Other advisors: Professor Brian Lovell
-
Doctor Philosophy
Compressed Video Restoration
Principal Advisor
Other advisors: Dr Miao Xu, Dr Heming Du
-
Doctor Philosophy
Effective Visual Data Compression
Principal Advisor
Other advisors: Associate Professor Sen Wang, Dr Heming Du
-
Doctor Philosophy
Object-Centric Audio-Visual Alignment for Sounding Source Segmentation
Principal Advisor
Other advisors: Associate Professor Sen Wang
-
Doctor Philosophy
Understanding Human Intention and Performance
Principal Advisor
Other advisors: Dr Heming Du, Dr Miao Xu
-
Doctor Philosophy
Multimodal foundation model design and analysis
Principal Advisor
Other advisors: Dr Miao Xu, Dr Heming Du
-
Doctor Philosophy
Understanding Human Intention and Performance
Principal Advisor
Other advisors: Associate Professor Sen Wang
-
Doctor Philosophy
Understanding Human Movements and Sport Performance Analysis
Principal Advisor
Other advisors: Dr Miao Xu
-
Doctor Philosophy
Towards knowledge discovery from imperfect and evolving data
Associate Advisor
Other advisors: Dr Miao Xu
-
Doctor Philosophy
Remote Sensing Analysis in computer vision
Associate Advisor
Other advisors: Professor Helen Huang
-
Doctor Philosophy
Enhancing Robustness and Generalizability in Computational Models
Associate Advisor
Other advisors: Associate Professor Mahsa Baktashmotlagh
-
Doctor Philosophy
Data driven approaches for smart farming
Associate Advisor
Other advisors: Professor Helen Huang
Media
Enquiries
For media enquiries about Dr Xin Yu's areas of expertise, story ideas and help finding experts, contact our Media team: