Overview
Background
My name is Xin Yu, a Senior Lecturer at the University of Queensland. I am an Australian Research Council Discovery Early Career Researcher Award 2023-2025 (DECRA) recipient and an awardee of the prestigious Google Research Scholar Program in 2021. I am also a Google Visiting Faculty. Previously, I was a research fellow at the Australian National University (ANU). I received my PhD degree from the Australian National Unversity under the supervision of Prof. Richard Hartley, Prof. Fatih Porikli and Dr. Basura Fernando. I also received a PhD degree from Tsinghua University supervised by Prof. Li Zhang. I am interested in Computer Vision and Machine Learning topics.
My research topics includes various computer vision and machine learning tasks, especially in efficient low-level image processing, image retrieval and localization, action recognition, 3D pose estimation, visual navigation and sign language recognition and translation.
Availability
- Dr Xin Yu is:
- Available for supervision
Research impacts
One of my research papers has been awarded "Best Paper Honorable Mention" award in the premium computer vision conference WACV 2020, and one paper has been nominated for the Best Paper Award in CVPR 2020.
I was awarded the Outstanding Reviewer Award in ECCV 2020, CVPR 2021 and ICCV 2021. CVPR, ICCV and ECCV are internationally world-leading computer vision and machine learning conferences. My research interests include deep learning techniques, image processing, and computer vision tasks. I am a program committee member of top-tier computer vision and machine learning conferences, such as CVPR, ICCV, ECCV, ICML, ICLR and NeurIPS, and a reviewer of prestigious journals, such as TPAMI, IJCV and TIP.
I am happy to supervise self-motivated PhD and MPhil students. If you are an undergraduate student and willing to conduct your honour project, please drop me an email.
Works
Search Professor Xin Yu’s works on UQ eSpace
2024
Conference Publication
Snap and diagnose: an advanced multimodal retrieval system for identifying plant diseases in the wild
Wei, Tianqi, Chen, Zhi and Yu, Xin (2024). Snap and diagnose: an advanced multimodal retrieval system for identifying plant diseases in the wild. MMASIA ’24, Auckland, New Zealand, 3-6 December 2024. New York, United States: ACM. doi: 10.1145/3696409.3700293
2024
Journal Article
M3 A: A multimodal misinformation dataset for media authenticity analysis
Xu, Qingzheng, Chen, Huiqiang, Du, Heming, Zhang, Hu, Łukasik, Szymon, Zhu, Tianqing and Yu, Xin (2024). M3 A: A multimodal misinformation dataset for media authenticity analysis. Computer Vision and Image Understanding, 249 104205. doi: 10.1016/j.cviu.2024.104205
2024
Book Chapter
OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection
Zhang, Hu, Xu, Jianhua, Tang, Tao, Sun, Haiyang, Yu, Xin, Huang, Zi and Yu, Kaicheng (2024). OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection. Lecture Notes in Computer Science. (pp. 1-19) Cham: Springer Nature Switzerland. doi: 10.1007/978-3-031-72907-2_1
2024
Conference Publication
Benchmarking in-the-wild multimodal disease recognition and a versatile baseline
Wei, Tianqi, Chen, Zhi, Huang, Zi and Yu, Xin (2024). Benchmarking in-the-wild multimodal disease recognition and a versatile baseline. MM '24: The 32nd ACM International Conference on Multimedia, Melbourne, VIC, Australia, 28 October-1 November 2024. New York, United States: Association for Computing Machinery. doi: 10.1145/3664647.3680599
2024
Journal Article
Ethics-aware face recognition aided by synthetic face images
Du, Xiaobiao, Yu, Xin, Liu, Jinhui, Dai, Beifen and Xu, Feng (2024). Ethics-aware face recognition aided by synthetic face images. Neurocomputing, 600 128129, 128129. doi: 10.1016/j.neucom.2024.128129
2024
Conference Publication
Machine Unlearning via Null Space Calibration
Chen, Huiqiang, Zhu, Tianqing, Yu, Xin and Zhou, Wanlei (2024). Machine Unlearning via Null Space Calibration. 33rd International Joint Conference on Artificial Intelligence (IJCAI), Jeju, South Korea, 3-9 August 2024. California: International Joint Conferences on Artificial Intelligence Organization. doi: 10.24963/ijcai.2024/40
2024
Conference Publication
Learning transferable compound expressions from Masked AutoEncoder pretraining
Qiu, Feng, Du, Heming, Zhang, Wei, Liu, Chen, Li, Lincheng, Guo, Tianchen and Yu, Xin (2024). Learning transferable compound expressions from Masked AutoEncoder pretraining. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, United States, 17-18 June 2024. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvprw63382.2024.00476
2024
Conference Publication
An effective ensemble learning framework for affective behaviour analysis
Zhang, Wei, Qiu, Feng, Liu, Chen, Li, Lincheng, Du, Heming, Guo, Tianchen and Yu, Xin (2024). An effective ensemble learning framework for affective behaviour analysis. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, United States, 17-18 June 2024. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvprw63382.2024.00479
2024
Conference Publication
Language-guided multi-modal emotional mimicry intensity estimation
Qiu, Feng, Zhang, Wei, Liu, Chen, Li, Lincheng, Du, Heming, Guo, Tianchen and Yu, Xin (2024). Language-guided multi-modal emotional mimicry intensity estimation. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, United States, 17-18 June 2024. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvprw63382.2024.00477
2024
Journal Article
Proactive image manipulation detection via deep semi-fragile watermark
Zhao, Yuan, Liu, Bo, Zhu, Tianqing, Ding, Ming, Yu, Xin and Zhou, Wanlei (2024). Proactive image manipulation detection via deep semi-fragile watermark. Neurocomputing, 585 127593. doi: 10.1016/j.neucom.2024.127593
2024
Conference Publication
DiPEx: Dispersing Prompt Expansion for class-agnostic object detection
Lim, Jia Syuen, Chen, Zhuoxiao, Baktashmotlagh, Mahsa, Chen, Zhi, Yu, Xin, Huang, Zi and Luo, Yadan (2024). DiPEx: Dispersing Prompt Expansion for class-agnostic object detection. 38th International Conference on Neural Information Processing Systems, Vancouver, BC Canada, 10-15 December 2024. New York, NY USA: Association for Computing Machinery.
2024
Journal Article
BAVS: Bootstrapping audio-visual segmentation by integrating foundation knowledge
Liu, Chen, Li, Peike, Zhang, Hu, Li, Lincheng, Huang, Zi, Wang, Dadong and Yu, Xin (2024). BAVS: Bootstrapping audio-visual segmentation by integrating foundation knowledge. IEEE Transactions on Multimedia, 26, 10015-10028. doi: 10.1109/tmm.2024.3405622
2024
Journal Article
AI empowered Auslan learning for parents of deaf children and children of deaf adults
Sheng, Hongwei, Shen, Xin, Du, Heming, Zhang, Hu, Huang, Zi and Yu, Xin (2024). AI empowered Auslan learning for parents of deaf children and children of deaf adults. AI and Ethics, 4 (4), 1-11. doi: 10.1007/s43681-024-00457-y
2024
Journal Article
Detecting facial action units from global-local fine-grained expressions
Zhang, Wei, Li, Lincheng, Ding, Yu, Chen, Wei, Deng, Zhigang and Yu, Xin (2024). Detecting facial action units from global-local fine-grained expressions. IEEE Transactions on Circuits and Systems for Video Technology, 34 (2), 983-994. doi: 10.1109/tcsvt.2023.3288903
2024
Conference Publication
When 3D bounding-box meets SAM: point cloud instance segmentation with weak-and-noisy supervision
Yu, Qingtao, Du, Heming, Liu, Chen and Yu, Xin (2024). When 3D bounding-box meets SAM: point cloud instance segmentation with weak-and-noisy supervision. 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, United States, 3-8 January 2024. Piscataway, NJ, United States: IEEE. doi: 10.1109/wacv57701.2024.00368
2024
Journal Article
StyleTalk++: A unified framework for controlling the speaking styles of talking heads
Wang, Suzhen, Ma, Yifeng, Ding, Yu, Hu, Zhipeng, Fan, Changjie, Lv, Tangjie, Deng, Zhidong and Yu, Xin (2024). StyleTalk++: A unified framework for controlling the speaking styles of talking heads. IEEE Transactions on Pattern Analysis and Machine Intelligence, 46 (6), 4331-4347. doi: 10.1109/tpami.2024.3357808
2024
Conference Publication
An empirical analysis on spatial reasoning capabilities of large multimodal models
Shiri, Fatemeh, Guo, Xiao-Yu, Far, Mona Golestan, Yu, Xin, Haffari, Gholamreza and Li, Yuan-Fang (2024). An empirical analysis on spatial reasoning capabilities of large multimodal models. 2024 Conference on Empirical Methods in Natural Language Processing, Miami, FL, United States, 12-16 November 2024. Kerrville, TX, United States: Association for Computational Linguistics (ACL). doi: 10.18653/v1/2024.emnlp-main.1195
2024
Journal Article
CMGNet: Collaborative multi-modal graph network for video captioning
Rao, Qi, Yu, Xin, Li, Guang and Zhu, Linchao (2024). CMGNet: Collaborative multi-modal graph network for video captioning. Computer Vision and Image Understanding, 238 103864, 1-10. doi: 10.1016/j.cviu.2023.103864
2024
Journal Article
MarkerNet: A divide-and-conquer solution to motion capture solving from raw markers
Hu, Zhipeng, Tang, Jilin, Li, Lincheng, Hou, Jie, Xin, Haoran, Yu, Xin and Bu, Jiajun (2024). MarkerNet: A divide-and-conquer solution to motion capture solving from raw markers. Computer Animation and Virtual Worlds, 35 (1) e2228, 1-19. doi: 10.1002/cav.2228
2024
Journal Article
EmotionGesture: audio-driven diverse emotional co-speech 3D gesture generation
Qi, Xingqun, Liu, Chen, Li, Lincheng, Hou, Jie, Xin, Haoran and Yu, Xin (2024). EmotionGesture: audio-driven diverse emotional co-speech 3D gesture generation. IEEE Transactions on Multimedia, 26, 10420-10430. doi: 10.1109/tmm.2024.3407692
Funding
Current funding
Past funding
Supervision
Availability
- Dr Xin Yu is:
- Available for supervision
Looking for a supervisor? Read our advice on how to choose a supervisor.
Supervision history
Current supervision
-
Doctor Philosophy
Human Posture Recognition Applied to Physical Activity
Principal Advisor
Other advisors: Professor Sean Tweedy
-
Doctor Philosophy
Two way Auslan Translation
Principal Advisor
Other advisors: Associate Professor Mahsa Baktashmotlagh, Dr Heming Du
-
Doctor Philosophy
Human Understanding in Sports
Principal Advisor
Other advisors: Associate Professor Sen Wang, Dr Heming Du
-
Doctor Philosophy
Pose Estimation for Human with Disabilities
Principal Advisor
Other advisors: Professor Brian Lovell
-
Doctor Philosophy
Two way Auslan Translation
Principal Advisor
Other advisors: Professor Helen Huang, Dr Heming Du
-
Doctor Philosophy
Compressed Video Restoration
Principal Advisor
Other advisors: Dr Miao Xu, Dr Heming Du
-
Doctor Philosophy
Object-Centric Audio-Visual Alignment for Sounding Source Segmentation
Principal Advisor
Other advisors: Associate Professor Sen Wang
-
Doctor Philosophy
Understanding Human Intention and Performance
Principal Advisor
Other advisors: Dr Heming Du, Dr Miao Xu
-
Doctor Philosophy
Integrating Deep Learning and Remote Sensing for Precision Agriculture in Staple Crops
Principal Advisor
Other advisors: Dr Miao Xu
-
Doctor Philosophy
Multimodal foundation model design and analysis
Principal Advisor
Other advisors: Dr Miao Xu, Dr Heming Du
-
Doctor Philosophy
Understanding Human Intention and Performance
Principal Advisor
Other advisors: Associate Professor Sen Wang
-
Doctor Philosophy
Understanding Human Movements and Sport Performance Analysis
Principal Advisor
Other advisors: Dr Miao Xu
-
Doctor Philosophy
Effective Visual Data Compression
Principal Advisor
Other advisors: Associate Professor Sen Wang, Dr Heming Du
-
Doctor Philosophy
Automatic Retinal Health Monitoring through Multi-modal Medical Imaging
Principal Advisor
Other advisors: Associate Professor Mahsa Baktashmotlagh
-
Doctor Philosophy
Combating evolving deceptive fake visual information through deepfake detection
Principal Advisor
Other advisors: Dr Miao Xu
-
Doctor Philosophy
Remote Sensing Analysis in computer vision
Associate Advisor
Other advisors: Professor Helen Huang
-
Doctor Philosophy
Towards knowledge discovery from imperfect and evolving data
Associate Advisor
Other advisors: Dr Miao Xu
-
Doctor Philosophy
Data driven approaches for smart farming
Associate Advisor
Other advisors: Professor Helen Huang
-
Doctor Philosophy
Enhancing Robustness and Generalizability in Computational Models
Associate Advisor
Other advisors: Associate Professor Mahsa Baktashmotlagh
Media
Enquiries
For media enquiries about Dr Xin Yu's areas of expertise, story ideas and help finding experts, contact our Media team: