Skip to menu Skip to content Skip to footer

2022

Journal Article

Geometry-guided street-view panorama synthesis from satellite imagery

Shi, Yujiao, Campbell, Dylan, Yu, Xin and Li, Hongdong (2022). Geometry-guided street-view panorama synthesis from satellite imagery. IEEE Transactions On Pattern Analysis and Machine Intelligence, 44 (12), 10009-10022. doi: 10.1109/TPAMI.2022.3140750

Geometry-guided street-view panorama synthesis from satellite imagery

2022

Journal Article

Deep hierarchical representation of point cloud videos via spatio-temporal decomposition

Fan, Hehe, Yu, Xin, Yang, Yi and Kankanhalli, Mohan (2022). Deep hierarchical representation of point cloud videos via spatio-temporal decomposition. IEEE Transactions On Pattern Analysis and Machine Intelligence, 44 (12), 9918-9930. doi: 10.1109/TPAMI.2021.3135117

Deep hierarchical representation of point cloud videos via spatio-temporal decomposition

2022

Journal Article

Single image based 3D human pose estimation via uncertainty learning

Han, Chuchu, Yu, Xin, Gao, Changxin, Sang, Nong and Yang, Yi (2022). Single image based 3D human pose estimation via uncertainty learning. Pattern Recognition, 132 108934, 108934. doi: 10.1016/j.patcog.2022.108934

Single image based 3D human pose estimation via uncertainty learning

2022

Conference Publication

MHR-Net: Multiple-Hypothesis Reconstruction of Non-Rigid Shapes from 2D Views

Zeng, Haitian, Yu, Xin, Miao, Jiaxu and Yang, Yi (2022). MHR-Net: Multiple-Hypothesis Reconstruction of Non-Rigid Shapes from 2D Views. European Conference on Computer Vision (ECCV 2022), Tel Aviv, Israel, 23-27 October 2022. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-20086-1_1

MHR-Net: Multiple-Hypothesis Reconstruction of Non-Rigid Shapes from 2D Views

2022

Conference Publication

Instance as identity: a generic online paradigm for video instance segmentation

Zhu, Feng, Yang, Zongxin, Yu, Xin, Yang, Yi and Wei, Yunchao (2022). Instance as identity: a generic online paradigm for video instance segmentation. Computer Vision – ECCV 2022 17th European Conference, Tel Aviv, Israel, 23–27 October 2022. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-19818-2_30

Instance as identity: a generic online paradigm for video instance segmentation

2022

Journal Article

Recursive copy and paste GAN: face hallucination from shaded thumbnails

Zhang, Yang, Tsang, Ivor W., Luo, Yawei, Hu, Changhui, Lu, Xiaobo and Yu, Xin (2022). Recursive copy and paste GAN: face hallucination from shaded thumbnails. IEEE Transactions On Pattern Analysis and Machine Intelligence, 44 (8), 4321-4338. doi: 10.1109/TPAMI.2021.3061312

Recursive copy and paste GAN: face hallucination from shaded thumbnails

2022

Conference Publication

Learning Implicit Body Representations from Double Diffusion Based Neural Radiance Fields

Yao, Guangming, Wu, Hongzhi, Yuan, Yi, Li, Lincheng, Zhou, Kun and Yu, Xin (2022). Learning Implicit Body Representations from Double Diffusion Based Neural Radiance Fields. Thirty-First International Joint Conference on Artificial Intelligence IJCAI-ECAI 2022, Vienna, Austria, 23-29 July 2022. Los Angeles, CA United States: International Joint Conferences on Artificial Intelligence Organization. doi: 10.24963/ijcai.2022/218

Learning Implicit Body Representations from Double Diffusion Based Neural Radiance Fields

2022

Conference Publication

One-shot talking face generation from single-speaker audio-visual correlation learning

Wang, Suzhen, Li, Lincheng, Ding, Yu and Yu, Xin (2022). One-shot talking face generation from single-speaker audio-visual correlation learning. 36th AAAI Conference on Artificial Intelligence / 34th Conference on Innovative Applications of Artificial Intelligence / 12th Symposium on Educational Advances in Artificial Intelligence, Online, 22 February –1 March 2022. Palo Alto, CA United States: ASSOC. doi: 10.1609/aaai.v36i3.20154

One-shot talking face generation from single-speaker audio-visual correlation learning

2022

Conference Publication

Monocular camera-based point-goal navigation by learning depth channel and cross-modality pyramid fusion

Tang, Tianqi, Du, Heming, Yu, Xin and Yang, Yi (2022). Monocular camera-based point-goal navigation by learning depth channel and cross-modality pyramid fusion. 36th AAAI Conference on Artificial Intelligence / 34th Conference on Innovative Applications of Artificial Intelligence / 12th Symposium on Educational Advances in Artificial Intelligence, Online, 22 February –1 March 2022. Palo Alto, CA United States: Association for the Advancement of Artificial Intelligence. doi: 10.1609/aaai.v36i5.20480

Monocular camera-based point-goal navigation by learning depth channel and cross-modality pyramid fusion

2022

Journal Article

High frame rate video reconstruction based on an event camera

Pan, Liyuan, Hartley, Richard, Scheerlinck, Cedric, Liu, Miaomiao, Yu, Xin and Dai, Yuchao (2022). High frame rate video reconstruction based on an event camera. IEEE Transactions On Pattern Analysis and Machine Intelligence, 44 (5), 2519-2533. doi: 10.1109/TPAMI.2020.3036667

High frame rate video reconstruction based on an event camera

2022

Journal Article

Single-image deraining via recurrent residual multiscale networks

Zheng, Yupei, Yu, Xin, Liu, Miaomiao and Zhang, Shunli (2022). Single-image deraining via recurrent residual multiscale networks. IEEE Transactions On Neural Networks and Learning Systems, 33 (3), 1310-1323. doi: 10.1109/TNNLS.2020.3041752

Single-image deraining via recurrent residual multiscale networks

2022

Journal Article

Weakly supervised RGB-D salient object detection with prediction consistency training and active scribble boosting

Xu, Yunqiu, Yu, Xin, Zhang, Jing, Zhu, Linchao and Wang, Dadong (2022). Weakly supervised RGB-D salient object detection with prediction consistency training and active scribble boosting. IEEE Transactions On Image Processing, 31, 2148-2161. doi: 10.1109/TIP.2022.3151999

Weakly supervised RGB-D salient object detection with prediction consistency training and active scribble boosting

2022

Journal Article

Pro-UIGAN: progressive face hallucination from occluded thumbnails

Zhang, Yang, Yu, Xin, Lu, Xiaobo and Liu, Ping (2022). Pro-UIGAN: progressive face hallucination from occluded thumbnails. IEEE Transactions On Image Processing, 31, 3236-3250. doi: 10.1109/TIP.2022.3167280

Pro-UIGAN: progressive face hallucination from occluded thumbnails

2022

Conference Publication

Batch Multi-Fidelity Active Learning with Budget Constraints

Li, Shibo, Phillips, Jeff M., Yu, Xin, Kirby, Robert M. and Zhe, Shandian (2022). Batch Multi-Fidelity Active Learning with Budget Constraints. 36th Conference on Neural Information Processing Systems (NeurIPS 2022), Online, 28 November - 9 December 2022. Maryland Heights, MO United States: Morgan Kaufmann Publishers.

Batch Multi-Fidelity Active Learning with Budget Constraints

2022

Journal Article

Understanding atomic hand-object interaction with human intention

Fan, Hehe, Zhuo, Tao, Yu, Xin, Yang, Yi and Kankanhalli, Mohan (2022). Understanding atomic hand-object interaction with human intention. IEEE Transactions On Circuits and Systems for Video Technology, 32 (1), 275-285. doi: 10.1109/TCSVT.2021.3058688

Understanding atomic hand-object interaction with human intention

2021

Conference Publication

End-to-end multi-instance robotic reaching from monocular vision

Zhuang, Zheyu, Yu, Xin and Mahony, Robert (2021). End-to-end multi-instance robotic reaching from monocular vision. IEEE International Conference on Robotics and Automation (ICRA), Xian, China, 30 May - 5 June 2021. Washington, DC United States: IEEE Computer Society. doi: 10.1109/ICRA48506.2021.9561518

End-to-end multi-instance robotic reaching from monocular vision

2021

Conference Publication

Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion

Wang, Suzhen, Li, Lincheng, Ding, Yu, Fan, Changjie and Yu, Xin (2021). Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion. Thirtieth International Joint Conference on Artificial Intelligence, Montreal, Canada, 19-27 August 2021. Los Angeles, CA United States: International Joint Conferences on Artificial Intelligence Organization. doi: 10.24963/ijcai.2021/152

Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion

2021

Conference Publication

The IKEA ASM dataset: understanding people assembling furniture through actions, objects and pose

Ben-Shabat, Yizhak, Yu, Xin, Saleh, Fatemeh, Campbell, Dylan, Rodriguez-Opazo, Cristian, Li, Hongdong and Gould, Stephen (2021). The IKEA ASM dataset: understanding people assembling furniture through actions, objects and pose. IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 5-9 January 2021. Piscataway, NJ United States: IEEE Computer Society. doi: 10.1109/WACV48630.2021.00089

The IKEA ASM dataset: understanding people assembling furniture through actions, objects and pose

2021

Conference Publication

Auto-navigator: decoupled neural architecture search for visual navigation

Tang, Tianqi, Yu, Xin, Dong, Xuanyi and Yang, Yi (2021). Auto-navigator: decoupled neural architecture search for visual navigation. IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 5-9 January 2021. Piscataway, NJ United States: IEEE. doi: 10.1109/WACV48630.2021.00379

Auto-navigator: decoupled neural architecture search for visual navigation

2021

Conference Publication

Modeling the probabilistic distribution of unlabeled data for one-shot medical image segmentation

Ding, Yuhang, Yu, Xin and Yang, Yi (2021). Modeling the probabilistic distribution of unlabeled data for one-shot medical image segmentation. 35th AAAI Conference on Artificial Intelligence / 33rd Conference on Innovative Applications of Artificial Intelligence / 11th Symposium on Educational Advances in Artificial Intelligence, Online, 2–9 February 2021. Palo Alto, CA United States: Association for the Advancement of Artificial Intelligence. doi: 10.1609/aaai.v35i2.16212

Modeling the probabilistic distribution of unlabeled data for one-shot medical image segmentation