Skip to menu Skip to content Skip to footer

2023

Conference Publication

Auslan-Daily: Australian sign language translation for daily communication and news

Shen, Xin, Yuan, Shaozu, Sheng, Hongwei, Du, Heming and Yu, Xin (2023). Auslan-Daily: Australian sign language translation for daily communication and news. 37th Conference on Neural Information Processing Systems (NeurIPS 2023), New Orleans, LA, United States, 10-16 December 2023. San Diego, CA, United States: Neural Information Processing Systems Foundation.

Auslan-Daily: Australian sign language translation for daily communication and news

2022

Conference Publication

MHR-Net: Multiple-Hypothesis Reconstruction of Non-Rigid Shapes from 2D Views

Zeng, Haitian, Yu, Xin, Miao, Jiaxu and Yang, Yi (2022). MHR-Net: Multiple-Hypothesis Reconstruction of Non-Rigid Shapes from 2D Views. European Conference on Computer Vision (ECCV 2022), Tel Aviv, Israel, 23-27 October 2022. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-20086-1_1

MHR-Net: Multiple-Hypothesis Reconstruction of Non-Rigid Shapes from 2D Views

2022

Conference Publication

Instance as identity: a generic online paradigm for video instance segmentation

Zhu, Feng, Yang, Zongxin, Yu, Xin, Yang, Yi and Wei, Yunchao (2022). Instance as identity: a generic online paradigm for video instance segmentation. Computer Vision – ECCV 2022 17th European Conference, Tel Aviv, Israel, 23–27 October 2022. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-19818-2_30

Instance as identity: a generic online paradigm for video instance segmentation

2022

Conference Publication

Learning Implicit Body Representations from Double Diffusion Based Neural Radiance Fields

Yao, Guangming, Wu, Hongzhi, Yuan, Yi, Li, Lincheng, Zhou, Kun and Yu, Xin (2022). Learning Implicit Body Representations from Double Diffusion Based Neural Radiance Fields. Thirty-First International Joint Conference on Artificial Intelligence IJCAI-ECAI 2022, Vienna, Austria, 23-29 July 2022. Los Angeles, CA United States: International Joint Conferences on Artificial Intelligence Organization. doi: 10.24963/ijcai.2022/218

Learning Implicit Body Representations from Double Diffusion Based Neural Radiance Fields

2022

Conference Publication

One-shot talking face generation from single-speaker audio-visual correlation learning

Wang, Suzhen, Li, Lincheng, Ding, Yu and Yu, Xin (2022). One-shot talking face generation from single-speaker audio-visual correlation learning. 36th AAAI Conference on Artificial Intelligence / 34th Conference on Innovative Applications of Artificial Intelligence / 12th Symposium on Educational Advances in Artificial Intelligence, Online, 22 February –1 March 2022. Palo Alto, CA United States: ASSOC. doi: 10.1609/aaai.v36i3.20154

One-shot talking face generation from single-speaker audio-visual correlation learning

2022

Conference Publication

Monocular camera-based point-goal navigation by learning depth channel and cross-modality pyramid fusion

Tang, Tianqi, Du, Heming, Yu, Xin and Yang, Yi (2022). Monocular camera-based point-goal navigation by learning depth channel and cross-modality pyramid fusion. 36th AAAI Conference on Artificial Intelligence / 34th Conference on Innovative Applications of Artificial Intelligence / 12th Symposium on Educational Advances in Artificial Intelligence, Online, 22 February –1 March 2022. Palo Alto, CA United States: Association for the Advancement of Artificial Intelligence. doi: 10.1609/aaai.v36i5.20480

Monocular camera-based point-goal navigation by learning depth channel and cross-modality pyramid fusion

2022

Conference Publication

Batch Multi-Fidelity Active Learning with Budget Constraints

Li, Shibo, Phillips, Jeff M., Yu, Xin, Kirby, Robert M. and Zhe, Shandian (2022). Batch Multi-Fidelity Active Learning with Budget Constraints. 36th Conference on Neural Information Processing Systems (NeurIPS 2022), Online, 28 November - 9 December 2022. Maryland Heights, MO United States: Morgan Kaufmann Publishers.

Batch Multi-Fidelity Active Learning with Budget Constraints

2021

Conference Publication

End-to-end multi-instance robotic reaching from monocular vision

Zhuang, Zheyu, Yu, Xin and Mahony, Robert (2021). End-to-end multi-instance robotic reaching from monocular vision. IEEE International Conference on Robotics and Automation (ICRA), Xian, China, 30 May - 5 June 2021. Washington, DC United States: IEEE Computer Society. doi: 10.1109/ICRA48506.2021.9561518

End-to-end multi-instance robotic reaching from monocular vision

2021

Conference Publication

Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion

Wang, Suzhen, Li, Lincheng, Ding, Yu, Fan, Changjie and Yu, Xin (2021). Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion. Thirtieth International Joint Conference on Artificial Intelligence, Montreal, Canada, 19-27 August 2021. Los Angeles, CA United States: International Joint Conferences on Artificial Intelligence Organization. doi: 10.24963/ijcai.2021/152

Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion

2021

Conference Publication

The IKEA ASM dataset: understanding people assembling furniture through actions, objects and pose

Ben-Shabat, Yizhak, Yu, Xin, Saleh, Fatemeh, Campbell, Dylan, Rodriguez-Opazo, Cristian, Li, Hongdong and Gould, Stephen (2021). The IKEA ASM dataset: understanding people assembling furniture through actions, objects and pose. IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 5-9 January 2021. Piscataway, NJ United States: IEEE Computer Society. doi: 10.1109/WACV48630.2021.00089

The IKEA ASM dataset: understanding people assembling furniture through actions, objects and pose

2021

Conference Publication

Auto-navigator: decoupled neural architecture search for visual navigation

Tang, Tianqi, Yu, Xin, Dong, Xuanyi and Yang, Yi (2021). Auto-navigator: decoupled neural architecture search for visual navigation. IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 5-9 January 2021. Piscataway, NJ United States: IEEE. doi: 10.1109/WACV48630.2021.00379

Auto-navigator: decoupled neural architecture search for visual navigation

2021

Conference Publication

Write-a-speaker: Text-based emotional and rhythmic talking-head generation

Li, Lincheng, Wang, Suzhen, Zhang, Zhimeng, Ding, Yu, Zheng, Yixing, Yu, Xin and Changjie Fan (2021). Write-a-speaker: Text-based emotional and rhythmic talking-head generation. 35th AAAI Conference on Artificial Intelligence / 33rd Conference on Innovative Applications of Artificial Intelligence / 11th Symposium on Educational Advances in Artificial Intelligence, Online, 2–9 February 2021. Palo Alto, CA United States: Association for the Advancement of Artificial Intelligence. doi: 10.1609/aaai.v35i3.16286

Write-a-speaker: Text-based emotional and rhythmic talking-head generation

2021

Conference Publication

Modeling the probabilistic distribution of unlabeled data for one-shot medical image segmentation

Ding, Yuhang, Yu, Xin and Yang, Yi (2021). Modeling the probabilistic distribution of unlabeled data for one-shot medical image segmentation. 35th AAAI Conference on Artificial Intelligence / 33rd Conference on Innovative Applications of Artificial Intelligence / 11th Symposium on Educational Advances in Artificial Intelligence, Online, 2–9 February 2021. Palo Alto, CA United States: Association for the Advancement of Artificial Intelligence. doi: 10.1609/aaai.v35i2.16212

Modeling the probabilistic distribution of unlabeled data for one-shot medical image segmentation

2021

Conference Publication

VTNET: visual transformer network for object goal navigation

Du, Heming, Yu, Xin and Zheng, Liang (2021). VTNET: visual transformer network for object goal navigation. 9th International Conference on Learning Representations, Virtual, 3-7 May 2021. Appleton WI USA: International Conference on Learning Representations.

VTNET: visual transformer network for object goal navigation

2021

Conference Publication

PSTNET: point spatio-temporal convolution on point cloud sequences

Fan, Hehe, Yu, Xin, Ding, Yuhang, Yang, Yi and Kankanhalli, Mohan (2021). PSTNET: point spatio-temporal convolution on point cloud sequences. 9th International Conference on Learning Representations, Virtual, 3-7 May 2021. International Conference on Learning Representations, ICLR.

PSTNET: point spatio-temporal convolution on point cloud sequences

2021

Conference Publication

Leaping from 2D Detection to Efficient 6DoF Object Pose Estimation

Liu, Jinhui, Zou, Zhikang, Ye, Xiaoqing, Tan, Xiao, Ding, Errui, Xu, Feng and Yu, Xin (2021). Leaping from 2D Detection to Efficient 6DoF Object Pose Estimation. European Conference on Computer Vision ECCV 2020, Glasgow, United Kingdom, 23–28 August 2020. Cham, Switzerland: Springer. doi: 10.1007/978-3-030-66096-3_47

Leaping from 2D Detection to Efficient 6DoF Object Pose Estimation

2021

Conference Publication

DSC-PoseNet: learning 6DoF object pose estimation via dual-scale consistency

Yang, Zongxin, Yu, Xin and Yang, Yi (2021). DSC-PoseNet: learning 6DoF object pose estimation via dual-scale consistency. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Virtual, 19-25 June 2021. Washington, DC, United States: I E E E Computer Society. doi: 10.1109/CVPR46437.2021.00390

DSC-PoseNet: learning 6DoF object pose estimation via dual-scale consistency

2021

Conference Publication

RFNet: region-aware fusion network for incomplete multi-modal brain tumor segmentation

Ding, Yuhang, Yu, Xin and Yang, Yi (2021). RFNet: region-aware fusion network for incomplete multi-modal brain tumor segmentation. 18th IEEE/CVF International Conference on Computer Vision (ICCV), Virtual, 11-17 October 2021. New York, NY, United States: IEEE. doi: 10.1109/ICCV48922.2021.00394

RFNet: region-aware fusion network for incomplete multi-modal brain tumor segmentation

2021

Conference Publication

Self-supervised visibility learning for novel view synthesis

Shi, Yujiao, Li, Hongdong and Yu, Xin (2021). Self-supervised visibility learning for novel view synthesis. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Virtual, 19-25 June 2021. Washington, DC, United States: I E E E Computer Society. doi: 10.1109/CVPR46437.2021.00955

Self-supervised visibility learning for novel view synthesis

2021

Conference Publication

RGB-D saliency detection via cascaded mutual information minimization

Zhang, Jing, Fan, Deng-Ping, Dai, Yuchao, Yu, Xin, Zhong, Yiran, Barnes, Nick and Shao, Ling (2021). RGB-D saliency detection via cascaded mutual information minimization. 18th IEEE/CVF International Conference on Computer Vision (ICCV), Virtual, 11-17 October 2021. New York, NY, United States: IEEE. doi: 10.1109/ICCV48922.2021.00430

RGB-D saliency detection via cascaded mutual information minimization