2023 Conference Publication Sim2RealVS: A new benchmark for video stabilization with a strong baselineRao, Qi, Yu, Xin, Navasardyan, Shant and Shi, Humphrey (2023). Sim2RealVS: A new benchmark for video stabilization with a strong baseline. 23rd IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 3-7 January 2023. Piscataway, NJ United States: IEEE. doi: 10.1109/wacv56688.2023.00537 |
2023 Conference Publication Meta knowledge condensation for federated learningLiu, Ping, Yu, Xin and Zhou, Joey Tianyi (2023). Meta knowledge condensation for federated learning. 11th International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, 1-5 May 2023. International Conference on Learning Representations, ICLR. |
2023 Conference Publication Auslan-Daily: Australian sign language translation for daily communication and newsShen, Xin, Yuan, Shaozu, Sheng, Hongwei, Du, Heming and Yu, Xin (2023). Auslan-Daily: Australian sign language translation for daily communication and news. 37th Conference on Neural Information Processing Systems (NeurIPS 2023), New Orleans, LA, United States, 10-16 December 2023. San Diego, CA, United States: Neural Information Processing Systems Foundation. |
2023 Conference Publication CVLNet: Cross-view Semantic Correspondence Learning for Video-Based Camera LocalizationShi, Yujiao, Yu, Xin, Wang, Shan and Li, Hongdong (2023). CVLNet: Cross-view Semantic Correspondence Learning for Video-Based Camera Localization. 16th Asian Conference on Computer Vision, Macao, China, 4–8 December 2022. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-26319-4_8 |
2023 Conference Publication RVD: a handheld device-based fundus video dataset for retinal vessel segmentationKhan, Md Wahiduzzaman, Sheng, Hongwei, Zhang, Hu, Du, Heming, Wang, Sen, Coroneo, Minas Theodore, Hajati, Farshid, Shariflou, Sahar, Kalloniatis, Michael, Phu, Jack, Agar, Ashish, Huang, Zi, Golzan, Mojtaba and Yu, Xin (2023). RVD: a handheld device-based fundus video dataset for retinal vessel segmentation. 37th Conference on Neural Information Processing Systems (NeurIPS 2023) Track on Datasets and Benchmarks, New Orleans, LA, United States, 10 - 16 December 2023. Maryland Heights, MO, United States: Morgan Kaufmann Publishers. |
2022 Conference Publication MHR-Net: Multiple-Hypothesis Reconstruction of Non-Rigid Shapes from 2D ViewsZeng, Haitian, Yu, Xin, Miao, Jiaxu and Yang, Yi (2022). MHR-Net: Multiple-Hypothesis Reconstruction of Non-Rigid Shapes from 2D Views. European Conference on Computer Vision (ECCV 2022), Tel Aviv, Israel, 23-27 October 2022. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-20086-1_1 |
2022 Conference Publication Instance as identity: a generic online paradigm for video instance segmentationZhu, Feng, Yang, Zongxin, Yu, Xin, Yang, Yi and Wei, Yunchao (2022). Instance as identity: a generic online paradigm for video instance segmentation. Computer Vision – ECCV 2022 17th European Conference, Tel Aviv, Israel, 23–27 October 2022. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-19818-2_30 |
2022 Conference Publication Learning Implicit Body Representations from Double Diffusion Based Neural Radiance FieldsYao, Guangming, Wu, Hongzhi, Yuan, Yi, Li, Lincheng, Zhou, Kun and Yu, Xin (2022). Learning Implicit Body Representations from Double Diffusion Based Neural Radiance Fields. Thirty-First International Joint Conference on Artificial Intelligence IJCAI-ECAI 2022, Vienna, Austria, 23-29 July 2022. Los Angeles, CA United States: International Joint Conferences on Artificial Intelligence Organization. doi: 10.24963/ijcai.2022/218 |
2022 Conference Publication One-shot talking face generation from single-speaker audio-visual correlation learningWang, Suzhen, Li, Lincheng, Ding, Yu and Yu, Xin (2022). One-shot talking face generation from single-speaker audio-visual correlation learning. 36th AAAI Conference on Artificial Intelligence / 34th Conference on Innovative Applications of Artificial Intelligence / 12th Symposium on Educational Advances in Artificial Intelligence, Online, 22 February –1 March 2022. Palo Alto, CA United States: ASSOC. doi: 10.1609/aaai.v36i3.20154 |
2022 Conference Publication Monocular camera-based point-goal navigation by learning depth channel and cross-modality pyramid fusionTang, Tianqi, Du, Heming, Yu, Xin and Yang, Yi (2022). Monocular camera-based point-goal navigation by learning depth channel and cross-modality pyramid fusion. 36th AAAI Conference on Artificial Intelligence / 34th Conference on Innovative Applications of Artificial Intelligence / 12th Symposium on Educational Advances in Artificial Intelligence, Online, 22 February –1 March 2022. Palo Alto, CA United States: Association for the Advancement of Artificial Intelligence. doi: 10.1609/aaai.v36i5.20480 |
2022 Conference Publication Batch Multi-Fidelity Active Learning with Budget ConstraintsLi, Shibo, Phillips, Jeff M., Yu, Xin, Kirby, Robert M. and Zhe, Shandian (2022). Batch Multi-Fidelity Active Learning with Budget Constraints. 36th Conference on Neural Information Processing Systems (NeurIPS 2022), Online, 28 November - 9 December 2022. Maryland Heights, MO United States: Morgan Kaufmann Publishers. |
2021 Conference Publication End-to-end multi-instance robotic reaching from monocular visionZhuang, Zheyu, Yu, Xin and Mahony, Robert (2021). End-to-end multi-instance robotic reaching from monocular vision. IEEE International Conference on Robotics and Automation (ICRA), Xian, China, 30 May - 5 June 2021. Washington, DC United States: IEEE Computer Society. doi: 10.1109/ICRA48506.2021.9561518 |
2021 Conference Publication Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head MotionWang, Suzhen, Li, Lincheng, Ding, Yu, Fan, Changjie and Yu, Xin (2021). Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion. Thirtieth International Joint Conference on Artificial Intelligence, Montreal, Canada, 19-27 August 2021. Los Angeles, CA United States: International Joint Conferences on Artificial Intelligence Organization. doi: 10.24963/ijcai.2021/152 |
2021 Conference Publication The IKEA ASM dataset: understanding people assembling furniture through actions, objects and poseBen-Shabat, Yizhak, Yu, Xin, Saleh, Fatemeh, Campbell, Dylan, Rodriguez-Opazo, Cristian, Li, Hongdong and Gould, Stephen (2021). The IKEA ASM dataset: understanding people assembling furniture through actions, objects and pose. IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 5-9 January 2021. Piscataway, NJ United States: IEEE Computer Society. doi: 10.1109/WACV48630.2021.00089 |
2021 Conference Publication Auto-navigator: decoupled neural architecture search for visual navigationTang, Tianqi, Yu, Xin, Dong, Xuanyi and Yang, Yi (2021). Auto-navigator: decoupled neural architecture search for visual navigation. IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 5-9 January 2021. Piscataway, NJ United States: IEEE. doi: 10.1109/WACV48630.2021.00379 |
2021 Conference Publication Modeling the probabilistic distribution of unlabeled data for one-shot medical image segmentationDing, Yuhang, Yu, Xin and Yang, Yi (2021). Modeling the probabilistic distribution of unlabeled data for one-shot medical image segmentation. 35th AAAI Conference on Artificial Intelligence / 33rd Conference on Innovative Applications of Artificial Intelligence / 11th Symposium on Educational Advances in Artificial Intelligence, Online, 2–9 February 2021. Palo Alto, CA United States: Association for the Advancement of Artificial Intelligence. doi: 10.1609/aaai.v35i2.16212 |
2021 Conference Publication Write-a-speaker: Text-based emotional and rhythmic talking-head generationLi, Lincheng, Wang, Suzhen, Zhang, Zhimeng, Ding, Yu, Zheng, Yixing, Yu, Xin and Changjie Fan (2021). Write-a-speaker: Text-based emotional and rhythmic talking-head generation. 35th AAAI Conference on Artificial Intelligence / 33rd Conference on Innovative Applications of Artificial Intelligence / 11th Symposium on Educational Advances in Artificial Intelligence, Online, 2–9 February 2021. Palo Alto, CA United States: Association for the Advancement of Artificial Intelligence. doi: 10.1609/aaai.v35i3.16286 |
2021 Conference Publication PSTNET: point spatio-temporal convolution on point cloud sequencesFan, Hehe, Yu, Xin, Ding, Yuhang, Yang, Yi and Kankanhalli, Mohan (2021). PSTNET: point spatio-temporal convolution on point cloud sequences. 9th International Conference on Learning Representations, Virtual, 3-7 May 2021. International Conference on Learning Representations, ICLR. |
2021 Conference Publication VTNET: visual transformer network for object goal navigationDu, Heming, Yu, Xin and Zheng, Liang (2021). VTNET: visual transformer network for object goal navigation. 9th International Conference on Learning Representations, Virtual, 3-7 May 2021. Appleton WI USA: International Conference on Learning Representations. |
2021 Conference Publication Leaping from 2D Detection to Efficient 6DoF Object Pose EstimationLiu, Jinhui, Zou, Zhikang, Ye, Xiaoqing, Tan, Xiao, Ding, Errui, Xu, Feng and Yu, Xin (2021). Leaping from 2D Detection to Efficient 6DoF Object Pose Estimation. European Conference on Computer Vision ECCV 2020, Glasgow, United Kingdom, 23–28 August 2020. Cham, Switzerland: Springer. doi: 10.1007/978-3-030-66096-3_47 |