Expert publications - About - The University of Queensland

All (156) Journal Article (57) Book Chapter (2) Conference Publication (97)

2023

Conference Publication

Sim2RealVS: A new benchmark for video stabilization with a strong baseline

Rao, Qi, Yu, Xin, Navasardyan, Shant and Shi, Humphrey (2023). Sim2RealVS: A new benchmark for video stabilization with a strong baseline. 23rd IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 3-7 January 2023. Piscataway, NJ United States: IEEE. doi: 10.1109/wacv56688.2023.00537

Sim2RealVS: A new benchmark for video stabilization with a strong baseline

2023

Conference Publication

Meta knowledge condensation for federated learning

Liu, Ping, Yu, Xin and Zhou, Joey Tianyi (2023). Meta knowledge condensation for federated learning. 11th International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, 1-5 May 2023. International Conference on Learning Representations, ICLR.

Meta knowledge condensation for federated learning

2023

Conference Publication

Auslan-Daily: Australian sign language translation for daily communication and news

Shen, Xin, Yuan, Shaozu, Sheng, Hongwei, Du, Heming and Yu, Xin (2023). Auslan-Daily: Australian sign language translation for daily communication and news. 37th Conference on Neural Information Processing Systems (NeurIPS 2023), New Orleans, LA, United States, 10-16 December 2023. San Diego, CA, United States: Neural Information Processing Systems Foundation.

Auslan-Daily: Australian sign language translation for daily communication and news

2023

Conference Publication

CVLNet: Cross-view Semantic Correspondence Learning for Video-Based Camera Localization

Shi, Yujiao, Yu, Xin, Wang, Shan and Li, Hongdong (2023). CVLNet: Cross-view Semantic Correspondence Learning for Video-Based Camera Localization. 16th Asian Conference on Computer Vision, Macao, China, 4–8 December 2022. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-26319-4_8

CVLNet: Cross-view Semantic Correspondence Learning for Video-Based Camera Localization

2023

Conference Publication

RVD: a handheld device-based fundus video dataset for retinal vessel segmentation

Khan, Md Wahiduzzaman, Sheng, Hongwei, Zhang, Hu, Du, Heming, Wang, Sen, Coroneo, Minas Theodore, Hajati, Farshid, Shariflou, Sahar, Kalloniatis, Michael, Phu, Jack, Agar, Ashish, Huang, Zi, Golzan, Mojtaba and Yu, Xin (2023). RVD: a handheld device-based fundus video dataset for retinal vessel segmentation. 37th Conference on Neural Information Processing Systems (NeurIPS 2023) Track on Datasets and Benchmarks, New Orleans, LA, United States, 10 - 16 December 2023. Maryland Heights, MO, United States: Morgan Kaufmann Publishers.

RVD: a handheld device-based fundus video dataset for retinal vessel segmentation

2022

Conference Publication

MHR-Net: Multiple-Hypothesis Reconstruction of Non-Rigid Shapes from 2D Views

Zeng, Haitian, Yu, Xin, Miao, Jiaxu and Yang, Yi (2022). MHR-Net: Multiple-Hypothesis Reconstruction of Non-Rigid Shapes from 2D Views. European Conference on Computer Vision (ECCV 2022), Tel Aviv, Israel, 23-27 October 2022. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-20086-1_1

MHR-Net: Multiple-Hypothesis Reconstruction of Non-Rigid Shapes from 2D Views

2022

Conference Publication

Instance as identity: a generic online paradigm for video instance segmentation

Zhu, Feng, Yang, Zongxin, Yu, Xin, Yang, Yi and Wei, Yunchao (2022). Instance as identity: a generic online paradigm for video instance segmentation. Computer Vision – ECCV 2022 17th European Conference, Tel Aviv, Israel, 23–27 October 2022. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-19818-2_30

Instance as identity: a generic online paradigm for video instance segmentation

2022

Conference Publication

Learning Implicit Body Representations from Double Diffusion Based Neural Radiance Fields

Yao, Guangming, Wu, Hongzhi, Yuan, Yi, Li, Lincheng, Zhou, Kun and Yu, Xin (2022). Learning Implicit Body Representations from Double Diffusion Based Neural Radiance Fields. Thirty-First International Joint Conference on Artificial Intelligence IJCAI-ECAI 2022, Vienna, Austria, 23-29 July 2022. Los Angeles, CA United States: International Joint Conferences on Artificial Intelligence Organization. doi: 10.24963/ijcai.2022/218

Learning Implicit Body Representations from Double Diffusion Based Neural Radiance Fields

2022

Conference Publication

One-shot talking face generation from single-speaker audio-visual correlation learning

Wang, Suzhen, Li, Lincheng, Ding, Yu and Yu, Xin (2022). One-shot talking face generation from single-speaker audio-visual correlation learning. 36th AAAI Conference on Artificial Intelligence / 34th Conference on Innovative Applications of Artificial Intelligence / 12th Symposium on Educational Advances in Artificial Intelligence, Online, 22 February –1 March 2022. Palo Alto, CA United States: ASSOC. doi: 10.1609/aaai.v36i3.20154

One-shot talking face generation from single-speaker audio-visual correlation learning

2022

Conference Publication

Monocular camera-based point-goal navigation by learning depth channel and cross-modality pyramid fusion

Tang, Tianqi, Du, Heming, Yu, Xin and Yang, Yi (2022). Monocular camera-based point-goal navigation by learning depth channel and cross-modality pyramid fusion. 36th AAAI Conference on Artificial Intelligence / 34th Conference on Innovative Applications of Artificial Intelligence / 12th Symposium on Educational Advances in Artificial Intelligence, Online, 22 February –1 March 2022. Palo Alto, CA United States: Association for the Advancement of Artificial Intelligence. doi: 10.1609/aaai.v36i5.20480

Monocular camera-based point-goal navigation by learning depth channel and cross-modality pyramid fusion

2022

Conference Publication

Batch Multi-Fidelity Active Learning with Budget Constraints

Li, Shibo, Phillips, Jeff M., Yu, Xin, Kirby, Robert M. and Zhe, Shandian (2022). Batch Multi-Fidelity Active Learning with Budget Constraints. 36th Conference on Neural Information Processing Systems (NeurIPS 2022), Online, 28 November - 9 December 2022. Maryland Heights, MO United States: Morgan Kaufmann Publishers.

Batch Multi-Fidelity Active Learning with Budget Constraints

2021

Conference Publication

End-to-end multi-instance robotic reaching from monocular vision

Zhuang, Zheyu, Yu, Xin and Mahony, Robert (2021). End-to-end multi-instance robotic reaching from monocular vision. IEEE International Conference on Robotics and Automation (ICRA), Xian, China, 30 May - 5 June 2021. Washington, DC United States: IEEE Computer Society. doi: 10.1109/ICRA48506.2021.9561518

End-to-end multi-instance robotic reaching from monocular vision

2021

Conference Publication

Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion

Wang, Suzhen, Li, Lincheng, Ding, Yu, Fan, Changjie and Yu, Xin (2021). Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion. Thirtieth International Joint Conference on Artificial Intelligence, Montreal, Canada, 19-27 August 2021. Los Angeles, CA United States: International Joint Conferences on Artificial Intelligence Organization. doi: 10.24963/ijcai.2021/152

Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion

2021

Conference Publication

The IKEA ASM dataset: understanding people assembling furniture through actions, objects and pose

Ben-Shabat, Yizhak, Yu, Xin, Saleh, Fatemeh, Campbell, Dylan, Rodriguez-Opazo, Cristian, Li, Hongdong and Gould, Stephen (2021). The IKEA ASM dataset: understanding people assembling furniture through actions, objects and pose. IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 5-9 January 2021. Piscataway, NJ United States: IEEE Computer Society. doi: 10.1109/WACV48630.2021.00089

The IKEA ASM dataset: understanding people assembling furniture through actions, objects and pose

2021

Conference Publication

Auto-navigator: decoupled neural architecture search for visual navigation

Tang, Tianqi, Yu, Xin, Dong, Xuanyi and Yang, Yi (2021). Auto-navigator: decoupled neural architecture search for visual navigation. IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 5-9 January 2021. Piscataway, NJ United States: IEEE. doi: 10.1109/WACV48630.2021.00379

Auto-navigator: decoupled neural architecture search for visual navigation

2021

Conference Publication

Modeling the probabilistic distribution of unlabeled data for one-shot medical image segmentation

Ding, Yuhang, Yu, Xin and Yang, Yi (2021). Modeling the probabilistic distribution of unlabeled data for one-shot medical image segmentation. 35th AAAI Conference on Artificial Intelligence / 33rd Conference on Innovative Applications of Artificial Intelligence / 11th Symposium on Educational Advances in Artificial Intelligence, Online, 2–9 February 2021. Palo Alto, CA United States: Association for the Advancement of Artificial Intelligence. doi: 10.1609/aaai.v35i2.16212

Modeling the probabilistic distribution of unlabeled data for one-shot medical image segmentation

2021

Conference Publication

Write-a-speaker: Text-based emotional and rhythmic talking-head generation

Li, Lincheng, Wang, Suzhen, Zhang, Zhimeng, Ding, Yu, Zheng, Yixing, Yu, Xin and Changjie Fan (2021). Write-a-speaker: Text-based emotional and rhythmic talking-head generation. 35th AAAI Conference on Artificial Intelligence / 33rd Conference on Innovative Applications of Artificial Intelligence / 11th Symposium on Educational Advances in Artificial Intelligence, Online, 2–9 February 2021. Palo Alto, CA United States: Association for the Advancement of Artificial Intelligence. doi: 10.1609/aaai.v35i3.16286

Write-a-speaker: Text-based emotional and rhythmic talking-head generation

2021

Conference Publication

PSTNET: point spatio-temporal convolution on point cloud sequences

Fan, Hehe, Yu, Xin, Ding, Yuhang, Yang, Yi and Kankanhalli, Mohan (2021). PSTNET: point spatio-temporal convolution on point cloud sequences. 9th International Conference on Learning Representations, Virtual, 3-7 May 2021. International Conference on Learning Representations, ICLR.

PSTNET: point spatio-temporal convolution on point cloud sequences

2021

Conference Publication

VTNET: visual transformer network for object goal navigation

Du, Heming, Yu, Xin and Zheng, Liang (2021). VTNET: visual transformer network for object goal navigation. 9th International Conference on Learning Representations, Virtual, 3-7 May 2021. Appleton WI USA: International Conference on Learning Representations.

VTNET: visual transformer network for object goal navigation

2021

Conference Publication

Leaping from 2D Detection to Efficient 6DoF Object Pose Estimation

Liu, Jinhui, Zou, Zhikang, Ye, Xiaoqing, Tan, Xiao, Ding, Errui, Xu, Feng and Yu, Xin (2021). Leaping from 2D Detection to Efficient 6DoF Object Pose Estimation. European Conference on Computer Vision ECCV 2020, Glasgow, United Kingdom, 23–28 August 2020. Cham, Switzerland: Springer. doi: 10.1007/978-3-030-66096-3_47

Leaping from 2D Detection to Efficient 6DoF Object Pose Estimation

Sim2RealVS: A new benchmark for video stabilization with a strong baseline

Meta knowledge condensation for federated learning

Auslan-Daily: Australian sign language translation for daily communication and news

CVLNet: Cross-view Semantic Correspondence Learning for Video-Based Camera Localization

RVD: a handheld device-based fundus video dataset for retinal vessel segmentation

MHR-Net: Multiple-Hypothesis Reconstruction of Non-Rigid Shapes from 2D Views

Instance as&nbsp;identity: a generic online paradigm for&nbsp;video instance segmentation

Learning Implicit Body Representations from Double Diffusion Based Neural Radiance Fields

One-shot talking face generation from single-speaker audio-visual correlation learning

Monocular camera-based point-goal navigation by learning depth channel and cross-modality pyramid fusion

Batch Multi-Fidelity Active Learning with Budget Constraints

End-to-end multi-instance robotic reaching from monocular vision

Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion

The IKEA ASM dataset: understanding people assembling furniture through actions, objects and pose

Auto-navigator: decoupled neural architecture search for visual navigation

Modeling the probabilistic distribution of unlabeled data for one-shot medical image segmentation

Write-a-speaker: Text-based emotional and rhythmic talking-head generation

PSTNET: point spatio-temporal convolution on point cloud sequences

VTNET: visual transformer network for object goal navigation

Leaping from 2D Detection to Efficient 6DoF Object Pose Estimation

Instance as identity: a generic online paradigm for video instance segmentation