Skip to menu Skip to content Skip to footer
Dr Xin Yu
Dr

Xin Yu

Email: 

Overview

Background

My name is Xin Yu, a Senior Lecturer at the University of Queensland. I am an Australian Research Council Discovery Early Career Researcher Award 2023-2025 (DECRA) recipient and an awardee of the prestigious Google Research Scholar Program in 2021. Previously, I was a research fellow at the Australian National University (ANU). I received my PhD degree from the Australian National Unversity under the supervision of Prof. Richard Hartley, Prof. Fatih Porikli and Dr. Basura Fernando. I also received a PhD degree from Tsinghua University supervised by Prof. Li Zhang. I am interested in Computer Vision and Machine Learning topics.

My research topics includes various computer vision and machine learning tasks, especially in efficient low-level image processing, image retrieval and localization, action recognition, 3D pose estimation, visual navigation and sign language recognition and translation.

Availability

Dr Xin Yu is:
Available for supervision

Research impacts

One of my research papers has been awarded "Best Paper Honorable Mention" award in the premium computer vision conference WACV 2020, and one paper has been nominated for the Best Paper Award in CVPR 2020.

I was awarded the Outstanding Reviewer Award in ECCV 2020, CVPR 2021 and ICCV 2021. CVPR, ICCV and ECCV are internationally world-leading computer vision and machine learning conferences. My research interests include deep learning techniques, image processing, and computer vision tasks. I am a program committee member of top-tier computer vision and machine learning conferences, such as CVPR, ICCV, ECCV, ICML, ICLR and NeurIPS, and a reviewer of prestigious journals, such as TPAMI, IJCV and TIP.

I am happy to supervise self-motivated PhD and MPhil students. If you are an undergraduate student and willing to conduct your honour project, please drop me an email.

Works

Search Professor Xin Yu’s works on UQ eSpace

142 works between 2011 and 2024

1 - 20 of 142 works

2024

Journal Article

M3 A: A multimodal misinformation dataset for media authenticity analysis

Xu, Qingzheng, Chen, Huiqiang, Du, Heming, Zhang, Hu, Łukasik, Szymon, Zhu, Tianqing and Yu, Xin (2024). M3 A: A multimodal misinformation dataset for media authenticity analysis. Computer Vision and Image Understanding, 249 104205. doi: 10.1016/j.cviu.2024.104205

M3 A: A multimodal misinformation dataset for media authenticity analysis

2024

Book Chapter

Vision-Based Abnormal Action Dataset for Recognising Body Motion Disorders

Ying, Jiaying, Shen, Xin and Yu, Xin (2024). Vision-Based Abnormal Action Dataset for Recognising Body Motion Disorders. Lecture Notes in Computer Science. (pp. 443-455) Singapore: Springer Nature Singapore. doi: 10.1007/978-981-96-0351-0_33

Vision-Based Abnormal Action Dataset for Recognising Body Motion Disorders

2024

Book Chapter

OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection

Zhang, Hu, Xu, Jianhua, Tang, Tao, Sun, Haiyang, Yu, Xin, Huang, Zi and Yu, Kaicheng (2024). OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection. Lecture Notes in Computer Science. (pp. 1-19) Cham: Springer Nature Switzerland. doi: 10.1007/978-3-031-72907-2_1

OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection

2024

Conference Publication

Benchmarking In-the-Wild Multimodal Disease Recognition and A Versatile Baseline

Wei, Tianqi, Chen, Zhi, Huang, Zi and Yu, Xin (2024). Benchmarking In-the-Wild Multimodal Disease Recognition and A Versatile Baseline. New York, NY, USA: ACM. doi: 10.1145/3664647.3680599

Benchmarking In-the-Wild Multimodal Disease Recognition and A Versatile Baseline

2024

Journal Article

Ethics-aware face recognition aided by synthetic face images

Du, Xiaobiao, Yu, Xin, Liu, Jinhui, Dai, Beifen and Xu, Feng (2024). Ethics-aware face recognition aided by synthetic face images. Neurocomputing, 600 128129, 128129. doi: 10.1016/j.neucom.2024.128129

Ethics-aware face recognition aided by synthetic face images

2024

Journal Article

Restoring consciousness with pharmacologic therapy: mechanisms, targets, and future directions

Barra, Megan E., Solt, Ken, Yu, Xin and Edlow, Brian L. (2024). Restoring consciousness with pharmacologic therapy: mechanisms, targets, and future directions. Neurotherapeutics, 21 (4) e00374, e00374. doi: 10.1016/j.neurot.2024.e00374

Restoring consciousness with pharmacologic therapy: mechanisms, targets, and future directions

2024

Conference Publication

An effective ensemble learning framework for affective behaviour analysis

Zhang, Wei, Qiu, Feng, Liu, Chen, Li, Lincheng, Du, Heming, Guo, Tianchen and Yu, Xin (2024). An effective ensemble learning framework for affective behaviour analysis. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, United States, 17-18 June 2024. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvprw63382.2024.00479

An effective ensemble learning framework for affective behaviour analysis

2024

Conference Publication

Learning transferable compound expressions from Masked AutoEncoder pretraining

Qiu, Feng, Du, Heming, Zhang, Wei, Liu, Chen, Li, Lincheng, Guo, Tianchen and Yu, Xin (2024). Learning transferable compound expressions from Masked AutoEncoder pretraining. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, United States, 17-18 June 2024. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvprw63382.2024.00476

Learning transferable compound expressions from Masked AutoEncoder pretraining

2024

Conference Publication

Language-guided multi-modal emotional mimicry intensity estimation

Qiu, Feng, Zhang, Wei, Liu, Chen, Li, Lincheng, Du, Heming, Guo, Tianchen and Yu, Xin (2024). Language-guided multi-modal emotional mimicry intensity estimation. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, United States, 17-18 June 2024. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvprw63382.2024.00477

Language-guided multi-modal emotional mimicry intensity estimation

2024

Journal Article

Proactive image manipulation detection via deep semi-fragile watermark

Zhao, Yuan, Liu, Bo, Zhu, Tianqing, Ding, Ming, Yu, Xin and Zhou, Wanlei (2024). Proactive image manipulation detection via deep semi-fragile watermark. Neurocomputing, 585 127593. doi: 10.1016/j.neucom.2024.127593

Proactive image manipulation detection via deep semi-fragile watermark

2024

Journal Article

AI empowered Auslan learning for parents of deaf children and children of deaf adults

Sheng, Hongwei, Shen, Xin, Du, Heming, Zhang, Hu, Huang, Zi and Yu, Xin (2024). AI empowered Auslan learning for parents of deaf children and children of deaf adults. AI and Ethics, 4 (4), 1-11. doi: 10.1007/s43681-024-00457-y

AI empowered Auslan learning for parents of deaf children and children of deaf adults

2024

Journal Article

Detecting facial action units from global-local fine-grained expressions

Zhang, Wei, Li, Lincheng, Ding, Yu, Chen, Wei, Deng, Zhigang and Yu, Xin (2024). Detecting facial action units from global-local fine-grained expressions. IEEE Transactions on Circuits and Systems for Video Technology, 34 (2), 983-994. doi: 10.1109/tcsvt.2023.3288903

Detecting facial action units from global-local fine-grained expressions

2024

Conference Publication

When 3D bounding-box meets SAM: point cloud instance segmentation with weak-and-noisy supervision

Yu, Qingtao, Du, Heming, Liu, Chen and Yu, Xin (2024). When 3D bounding-box meets SAM: point cloud instance segmentation with weak-and-noisy supervision. 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, United States, 3-8 January 2024. Piscataway, NJ, United States: IEEE. doi: 10.1109/wacv57701.2024.00368

When 3D bounding-box meets SAM: point cloud instance segmentation with weak-and-noisy supervision

2024

Journal Article

StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads

Wang, Suzhen, Ma, Yifeng, Ding, Yu, Hu, Zhipeng, Fan, Changjie, Lv, Tangjie, Deng, Zhidong and Yu, Xin (2024). StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads. IEEE Transactions on Pattern Analysis and Machine Intelligence, PP (6), 1-17. doi: 10.1109/tpami.2024.3357808

StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads

2024

Conference Publication

AS-NeRF: learning auxiliary sampling for generalizable novel view synthesis from sparse views

Tang, Jilin, Li, Lincheng, Qi, Xingqun, Chen, Yingfeng, Fan, Changjie and Yu, Xin (2024). AS-NeRF: learning auxiliary sampling for generalizable novel view synthesis from sparse views. 2024 IEEE International Conference on Multimedia and Expo (ICME), Niagara Falls, ON, Canada, 15-19 July 2024. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/ICME57554.2024.10688126

AS-NeRF: learning auxiliary sampling for generalizable novel view synthesis from sparse views

2024

Conference Publication

Benchmarking audio visual segmentation for long-untrimmed videos

Liu, Chen, Li, Peike Patrick, Yu, Qingtao, Sheng, Hongwei, Wang, Dadong, Li, Lincheng and Yu, Xin (2024). Benchmarking audio visual segmentation for long-untrimmed videos. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, United States, 16-22 June 2024. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/CVPR52733.2024.02143

Benchmarking audio visual segmentation for long-untrimmed videos

2024

Journal Article

MarkerNet: A divide-and-conquer solution to motion capture solving from raw markers

Hu, Zhipeng, Tang, Jilin, Li, Lincheng, Hou, Jie, Xin, Haoran, Yu, Xin and Bu, Jiajun (2024). MarkerNet: A divide-and-conquer solution to motion capture solving from raw markers. Computer Animation and Virtual Worlds, 35 (1) e2228. doi: 10.1002/cav.2228

MarkerNet: A divide-and-conquer solution to motion capture solving from raw markers

2024

Journal Article

EmotionGesture: Audio-Driven Diverse Emotional Co-Speech 3D Gesture Generation

Qi, Xingqun, Liu, Chen, Li, Lincheng, Hou, Jie, Xin, Haoran and Yu, Xin (2024). EmotionGesture: Audio-Driven Diverse Emotional Co-Speech 3D Gesture Generation. IEEE Transactions on Multimedia, 26, 1-11. doi: 10.1109/tmm.2024.3407692

EmotionGesture: Audio-Driven Diverse Emotional Co-Speech 3D Gesture Generation

2024

Journal Article

CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields From Imperfect Camera Poses

Fu, Hongyu, Yu, Xin, Li, Lincheng and Zhang, Li (2024). CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields From Imperfect Camera Poses. IEEE Transactions on Multimedia, 26, 1-12. doi: 10.1109/tmm.2024.3388929

CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields From Imperfect Camera Poses

2024

Conference Publication

MMOOC: a multimodal misinformation dataset for out-of-context news analysis

Xu, Qingzheng, Du, Heming, Chen, Huiqiang, Liu, Bo and Yu, Xin (2024). MMOOC: a multimodal misinformation dataset for out-of-context news analysis. 29th Australasian Conference, ACISP 2024, Sydney, NSW, Australia, 15–17 July 2024. Heidelberg, Germany: Springer. doi: 10.1007/978-981-97-5101-3_24

MMOOC: a multimodal misinformation dataset for out-of-context news analysis

Funding

Current funding

  • 2024 - 2027
    AI-Empowered and Video-Based uplift of Paralympic classification systems (AQIRP project administered by Follow Me AI)
    Follow Me AI Pty Ltd
    Open grant
  • 2023 - 2028
    Breaking the Communication Barrier for the Australian Deaf Community: Vision Based Australian Sign Language Translation and Production
    Google Asia Pacific Pte Ltd
    Open grant
  • 2023 - 2026
    Advancing Human Perception: Countering Evolving Malicious Fake Visual Data
    ARC Discovery Early Career Researcher Award
    Open grant
  • 2023 - 2027
    Analytics for the Australian Grains Industry (AAGI)
    Grains Research & Development Corporation
    Open grant
  • 2023 - 2025
    Two-way Auslan: Automatic Machine Translation of Australian Sign Language (ARC Discovery Project administered by ANU)
    The Australian National University
    Open grant

Past funding

  • 2024
    Breaking the Communication Barrier for the Australian Deaf Community: Vision Based Australian Sign Language Translation and Production
    Google Inc
    Open grant
  • 2023 - 2024
    Developing applications of satellite imagery for modelling environmental and social impacts of climate change on seaweed farming in Indonesia (KONEKSI Grant administered by Griffith University)
    Griffith University
    Open grant

Supervision

Availability

Dr Xin Yu is:
Available for supervision

Before you email them, read our advice on how to contact a supervisor.

Supervision history

Current supervision

Media

Enquiries

For media enquiries about Dr Xin Yu's areas of expertise, story ideas and help finding experts, contact our Media team:

communications@uq.edu.au