|
2024 Conference Publication Language-guided multi-modal emotional mimicry intensity estimationQiu, Feng, Zhang, Wei, Liu, Chen, Li, Lincheng, Du, Heming, Guo, Tianchen and Yu, Xin (2024). Language-guided multi-modal emotional mimicry intensity estimation. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, United States, 17-18 June 2024. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvprw63382.2024.00477 |
|
2024 Conference Publication Learning transferable compound expressions from Masked AutoEncoder pretrainingQiu, Feng, Du, Heming, Zhang, Wei, Liu, Chen, Li, Lincheng, Guo, Tianchen and Yu, Xin (2024). Learning transferable compound expressions from Masked AutoEncoder pretraining. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, United States, 17-18 June 2024. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvprw63382.2024.00476 |
|
2024 Conference Publication DiPEx: Dispersing Prompt Expansion for class-agnostic object detectionLim, Jia Syuen, Chen, Zhuoxiao, Baktashmotlagh, Mahsa, Chen, Zhi, Yu, Xin, Huang, Zi and Luo, Yadan (2024). DiPEx: Dispersing Prompt Expansion for class-agnostic object detection. 38th International Conference on Neural Information Processing Systems, Vancouver, BC Canada, 10-15 December 2024. New York, NY USA: Association for Computing Machinery. |
|
2024 Conference Publication When 3D bounding-box meets SAM: point cloud instance segmentation with weak-and-noisy supervisionYu, Qingtao, Du, Heming, Liu, Chen and Yu, Xin (2024). When 3D bounding-box meets SAM: point cloud instance segmentation with weak-and-noisy supervision. 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, United States, 3-8 January 2024. Piscataway, NJ, United States: IEEE. doi: 10.1109/wacv57701.2024.00368 |
|
2024 Conference Publication Text-guided 3D face synthesis - from generation to editingWu, Yunjie, Meng, Yapeng, Hu, Zhipeng, Li, Lincheng, Wu, Haoqian, Zhou, Kun, Xu, Weiwei and Yu, Xin (2024). Text-guided 3D face synthesis - from generation to editing. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, United States, 16-22 June 2024. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/CVPR52733.2024.00126 |
|
2024 Conference Publication MM-WLAuslan: multi-view multi-modal word-level Australian Sign Language recognition datasetShen, Xin, Du, Heming, Sheng, Hongwei, Wang, Shuyun, Chen, Hui, Chen, Huiqiang, Wu, Zhuojie, Du, Xiaobiao, Ying, Jiaying, Lu, Ruihan, Xu, Qingzheng and Yu, Xin (2024). MM-WLAuslan: multi-view multi-modal word-level Australian Sign Language recognition dataset. NeurIPS 2024, Vancouver, BC, Canada, 10 - 15 December 2024. Maryland Heights, MO, United States: Morgan Kaufmann Publishers. |
|
2024 Conference Publication MMOOC: a multimodal misinformation dataset for out-of-context news analysisXu, Qingzheng, Du, Heming, Chen, Huiqiang, Liu, Bo and Yu, Xin (2024). MMOOC: a multimodal misinformation dataset for out-of-context news analysis. 29th Australasian Conference, ACISP 2024, Sydney, NSW, Australia, 15–17 July 2024. Heidelberg, Germany: Springer. doi: 10.1007/978-981-97-5101-3_24 |
|
2024 Conference Publication TPR: Topology-preserving reservoirs for generalized zero-shot learningChen, Hui, Liu, Yanbin, Ma, Yongqiang, Zheng, Nanning and Yu, Xin (2024). TPR: Topology-preserving reservoirs for generalized zero-shot learning. NIPS '24: 38th International Conference on Neural Information Processing Systems, Vancouver, BC, Canada, 10-15 December 2024. Maryland Heights, MO, United States: Morgan Kaufmann Publishers. |
|
2024 Conference Publication An empirical analysis on spatial reasoning capabilities of large multimodal modelsShiri, Fatemeh, Guo, Xiao-Yu, Far, Mona Golestan, Yu, Xin, Haffari, Gholamreza and Li, Yuan-Fang (2024). An empirical analysis on spatial reasoning capabilities of large multimodal models. 2024 Conference on Empirical Methods in Natural Language Processing, Miami, FL, United States, 12-16 November 2024. Kerrville, TX, United States: Association for Computational Linguistics (ACL). doi: 10.18653/v1/2024.emnlp-main.1195 |
|
2024 Conference Publication EfficientDreamer: high-fidelity and stable 3D creation via orthogonal-view diffusion priorsHu, Zhipeng, Zhao, Minda, Zhao, Chaoyi, Liang, Xinyue, Li, Lincheng, Zhao, Zeng, Fan, Changjie, Zhou, Xiaowei and Yu, Xin (2024). EfficientDreamer: high-fidelity and stable 3D creation via orthogonal-view diffusion priors. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, United States, 16-22 June 2024. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/CVPR52733.2024.00473 |
|
2024 Conference Publication AS-NeRF: learning auxiliary sampling for generalizable novel view synthesis from sparse viewsTang, Jilin, Li, Lincheng, Qi, Xingqun, Chen, Yingfeng, Fan, Changjie and Yu, Xin (2024). AS-NeRF: learning auxiliary sampling for generalizable novel view synthesis from sparse views. 2024 IEEE International Conference on Multimedia and Expo (ICME), Niagara Falls, ON, Canada, 15-19 July 2024. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/ICME57554.2024.10688126 |
|
2024 Conference Publication Benchmarking audio visual segmentation for long-untrimmed videosLiu, Chen, Li, Peike Patrick, Yu, Qingtao, Sheng, Hongwei, Wang, Dadong, Li, Lincheng and Yu, Xin (2024). Benchmarking audio visual segmentation for long-untrimmed videos. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, United States, 16-22 June 2024. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/CVPR52733.2024.02143 |
|
2023 Conference Publication Learning efficient unsupervised satellite image-based building damage detectionZhang, Yiyun, Wang, Zijian, Luo, Yadan, Yu, Xin and Huang, Zi (2023). Learning efficient unsupervised satellite image-based building damage detection. 2023 IEEE International Conference on Data Mining (ICDM), Shanghai, China, 1-4 December 2023. Piscataway, NJ, United States: IEEE. doi: 10.1109/icdm58522.2023.00206 |
|
2023 Conference Publication A new perspective of weakly supervised 3D instance segmentation via bounding boxesYu, Qingtao, Du, Heming and Yu, Xin (2023). A new perspective of weakly supervised 3D instance segmentation via bounding boxes. 36th Australasian Joint Conference on Artificial Intelligence, AJCAI 2023, Brisbane, QLD Australia, 28 November –1 December 2023. Singapore: Springer. doi: 10.1007/978-981-99-8388-9_9 |
|
2023 Conference Publication Context-based masking for spontaneous venous pulsations detectionSheng, Hongwei, Yu, Xin, Li, Xue and Golzan, Mojtaba (2023). Context-based masking for spontaneous venous pulsations detection. 36th Australasian Joint Conference on Artificial Intelligence, AJCAI 2023, Brisbane, QLD Australia, 28 November –1 December 2023. Singapore: Springer. doi: 10.1007/978-981-99-8388-9_42 |
|
2023 Conference Publication Toward a unified framework for RGB and RGB-D visual navigationDu, Heming, Huang, Zi, Chapman, Scott and Yu, Xin (2023). Toward a unified framework for RGB and RGB-D visual navigation. 36th Australasian Joint Conference on Artificial Intelligence, AJCAI 2023, Brisbane, QLD Australia, 28 November –1 December 2023. Singapore: Springer. doi: 10.1007/978-981-99-8391-9_29 |
|
2023 Conference Publication Towards reliable and efficient vegetation segmentation for Australian wheat data analysisYuan, Bowen, Wang, Zijian and Yu, Xin (2023). Towards reliable and efficient vegetation segmentation for Australian wheat data analysis. 34th Australasian Database Conference (ADC), Melbourne, NSW Australia, 1-3 November 2023. Cham, Switzerland: Springer Cham. doi: 10.1007/978-3-031-47843-7_9 |
|
2023 Conference Publication Audio-visual segmentation by exploring cross-modal mutual semanticsLiu, Chen, Li, Peike Patrick, Qi, Xingqun, Zhang, Hu, Li, Lincheng, Wang, Dadong and Yu, Xin (2023). Audio-visual segmentation by exploring cross-modal mutual semantics. MM '23: The 31st ACM International Conference on Multimedia, Ottawa, ON Canada, 29 October - 3 November 2023. New York, NY United States: Association for Computing Machinery. doi: 10.1145/3581783.3612373 |
|
2023 Conference Publication DyGait: exploiting dynamic representations for high-performance gait recognitionWang, Ming, Guo, Xianda, Lin, Beibei, Yang, Tian, Zhu, Zheng, Li, Lincheng, Zhang, Shunli and Yu, Xin (2023). DyGait: exploiting dynamic representations for high-performance gait recognition. IEEE/CVF International Conference on Computer Vision (ICCV), Paris, France, 2-6 October 2023. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/iccv51070.2023.01235 |
|
2023 Conference Publication Gait recognition with mask-based regularizationShen, Chuanfu, Lin, Beibei, Zhang, Shunli, Yu, Xin, Huang, George Q. and Yu, Shiqi (2023). Gait recognition with mask-based regularization. IEEE International Joint Conference on Biometrics (IJCB), Ljubljana, Slovenia, 25-28 September 2023. New York, NY, United States: IEEE. doi: 10.1109/ijcb57857.2023.10449112 |