|
2025 Conference Publication Robust audio-visual segmentation via audio-guided visual convergent alignmentLiu, Chen, Li, Peike, Yang, Liying, Wang, Dadong, Li, Lincheng and Yu, Xin (2025). Robust audio-visual segmentation via audio-guided visual convergent alignment. 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN USA, 10-17 June 2025. Piscataway, NJ USA: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvpr52734.2025.02693 |
|
2025 Conference Publication EasyCraft: a robust and efficient framework for automatic avatar craftingWang, Suzhen, Chen, Weijie, Zhang, Wei, Zhao, Minda, Li, Lincheng, Zhang, Rongsheng, Hu, Zhipeng and Yu, Xin (2025). EasyCraft: a robust and efficient framework for automatic avatar crafting. 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN USA, 10-17 June 2025. New York, NY USA: IEEE Computer Society. doi: 10.1109/CVPR52734.2025.00524 |
|
2025 Conference Publication Blind bitstream-corrupted video recovery via metadata-guided diffusion modelWang, Shuyun, Zhang, Hu, Shen, Xin, Wang, Dadong and Yu, Xin (2025). Blind bitstream-corrupted video recovery via metadata-guided diffusion model. 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN USA, 10-17 June 2025. New York, NY USA: IEEE Computer Society. doi: 10.1109/CVPR52734.2025.02139 |
|
2025 Conference Publication Dynamic derivation and elimination: audio visual segmentation with enhanced audio semanticsLiu, Chen, Yang, Liying, Li, Peike, Wang, Dadong, Li, Lincheng and Yu, Xin (2025). Dynamic derivation and elimination: audio visual segmentation with enhanced audio semantics. 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN USA, 10-17 June 2025. Piscataway, NJ USA: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvpr52734.2025.00298 |
|
2025 Conference Publication M3GYM: a large-scale multimodal multi-view multi-person pose dataset for fitness activity understanding in real-world settingsXu, Qingzheng, Cao, Ru, Shen, Xin, Du, Heming, Wang, Sen and Yu, Xin (2025). M3GYM: a large-scale multimodal multi-view multi-person pose dataset for fitness activity understanding in real-world settings. 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, United States, 10 - 17 June 2025. Washington, DC, United States: I E E E Computer Society. doi: 10.1109/cvpr52734.2025.01147 |
|
2025 Conference Publication Cross-view isolated sign language recognition challenge: design, results and future researchShen, Xin, Du, Heming, Xu, Miao, Liu, Miaomiao and Yu, Xin (2025). Cross-view isolated sign language recognition challenge: design, results and future research. WWW '25: The ACM Web Conference 2025, Sydney, NSW Australia, 28 April-2 May 2025. New York, NY USA: Association for Computing Machinery. doi: 10.1145/3701716.3717522 |
|
2025 Conference Publication MDAM 3: a misinformation detection and analysis framework for multitype multimodal mediaXu, Qingzheng, Du, Heming, Łukasik, Szymon, Zhu, Tianqing, Wang, Sen and Yu, Xin (2025). MDAM 3: a misinformation detection and analysis framework for multitype multimodal media. WWW '25: The ACM Web Conference 2025, Sydney, NSW Australia, 28 April-2 May 2025. New York, NY USA: Association for Computing Machinery. doi: 10.1145/3696410.3714498 |
|
2025 Conference Publication FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal GroundingCao, Zhuo, Zhang, Bingqing, Du, Heming, Yu, Xin, Li, Xue and Wang, Sen (2025). FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding. 2025 Winter Conference on Applications of Computer Vision-WACV, Tucson Az, Feb 28-Mar 04, 2025. LOS ALAMITOS: IEEE. doi: 10.1109/wacv61041.2025.00894 |
|
2025 Conference Publication TokenBinder: Text-Video Retrieval with One-to-Many Alignment ParadigmZhang, Bingqing, Cao, Zhuo, Du, Heming, Yu, Xin, Li, Xue, Liu, Jiajun and Wang, Sen (2025). TokenBinder: Text-Video Retrieval with One-to-Many Alignment Paradigm. IEEE. doi: 10.1109/wacv61041.2025.00485 |
|
2025 Conference Publication Affective behaviour analysis via progressive learningLiu, Chen, Zhang, Wei, Qiu, Feng, Li, Lincheng, Wang, Dadong and Yu, Xin (2025). Affective behaviour analysis via progressive learning. ECCV 2024 Workshops, Milan, Italy, 29 September - 4 October 2024. Heidelberg, Germany: Springer. doi: 10.1007/978-3-031-91581-9_26 |
|
2025 Conference Publication Vision-based abnormal action dataset for recognising body motion disordersYing, Jiaying, Shen, Xin and Yu, Xin (2025). Vision-based abnormal action dataset for recognising body motion disorders. 37th Australasian Joint Conference on Artificial Intelligence, AI 2024, Melbourne, VIC, Australia, 25 - 29 November 2024. Singapore, Singapore: Springer Nature Singapore. doi: 10.1007/978-981-96-0351-0_33 |
|
2025 Conference Publication Transferable attacks for semantic segmentationHe, Mengqi, Zhang, Jing and Yu, Xin (2025). Transferable attacks for semantic segmentation. 35th Australasian Database Conference, Gold Coast, QLD, Australia, 16-18 December 2024. Heidelberg, Germany: Springer. doi: 10.1007/978-981-96-1242-0_28 |
|
2025 Conference Publication Compound expression recognition via curriculum learningLiu, Chen, Qiu, Feng, Zhang, Wei, Li, Lincheng, Wang, Dadong and Yu, Xin (2025). Compound expression recognition via curriculum learning. ECCV 2024 Workshops, Milan, Italy, 29 September - 4 October 2024. Heidelberg, Germany: Springer. doi: 10.1007/978-3-031-91581-9_20 |
|
2024 Conference Publication CPT-VR: Improving Surface Rendering via Closest Point Transform with View-Reflection AppearanceHu, Zhipeng, Zhang, Yongqiang, Liu, Chen, Li, Lincheng, Peng, Sida, Zhou, Xiaowei, Fan, Changjie and Yu, Xin (2024). CPT-VR: Improving Surface Rendering via Closest Point Transform with View-Reflection Appearance. 18th European Conference on Computer Vision, ECCV 2024, Milan, Italy, 29 September –4 October 2024. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-73464-9_14 |
|
2024 Conference Publication FreeAvatar: robust 3D facial animation transfer by learning an expression foundation modelQiu, Feng, Zhang, Wei, Liu, Chen, An, Rudong, Li, Lincheng, Ding, Yu, Fan, Changjie, Hu, Zhipeng and Yu, Xin (2024). FreeAvatar: robust 3D facial animation transfer by learning an expression foundation model. SA '24: SIGGRAPH Asia 2024, Tokyo, Japan, 3-6 December 2024. New York, NY, United States: ACM. doi: 10.1145/3680528.3687669 |
|
2024 Conference Publication Snap and diagnose: an advanced multimodal retrieval system for identifying plant diseases in the wildWei, Tianqi, Chen, Zhi and Yu, Xin (2024). Snap and diagnose: an advanced multimodal retrieval system for identifying plant diseases in the wild. MMASIA ’24, Auckland, New Zealand, 3-6 December 2024. New York, United States: ACM. doi: 10.1145/3696409.3700293 |
|
2024 Conference Publication Benchmarking in-the-wild multimodal disease recognition and a versatile baselineWei, Tianqi, Chen, Zhi, Huang, Zi and Yu, Xin (2024). Benchmarking in-the-wild multimodal disease recognition and a versatile baseline. MM '24: The 32nd ACM International Conference on Multimedia, Melbourne, VIC, Australia, 28 October-1 November 2024. New York, United States: Association for Computing Machinery. doi: 10.1145/3664647.3680599 |
|
2024 Conference Publication Machine Unlearning via Null Space CalibrationChen, Huiqiang, Zhu, Tianqing, Yu, Xin and Zhou, Wanlei (2024). Machine Unlearning via Null Space Calibration. 33rd International Joint Conference on Artificial Intelligence (IJCAI), Jeju, South Korea, 3-9 August 2024. California: International Joint Conferences on Artificial Intelligence Organization. doi: 10.24963/ijcai.2024/40 |
|
2024 Conference Publication Language-guided multi-modal emotional mimicry intensity estimationQiu, Feng, Zhang, Wei, Liu, Chen, Li, Lincheng, Du, Heming, Guo, Tianchen and Yu, Xin (2024). Language-guided multi-modal emotional mimicry intensity estimation. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, United States, 17-18 June 2024. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvprw63382.2024.00477 |
|
2024 Conference Publication Learning transferable compound expressions from Masked AutoEncoder pretrainingQiu, Feng, Du, Heming, Zhang, Wei, Liu, Chen, Li, Lincheng, Guo, Tianchen and Yu, Xin (2024). Learning transferable compound expressions from Masked AutoEncoder pretraining. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, United States, 17-18 June 2024. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvprw63382.2024.00476 |