|
2025 Conference Publication Transferable attacks for semantic segmentationHe, Mengqi, Zhang, Jing and Yu, Xin (2025). Transferable attacks for semantic segmentation. 35th Australasian Database Conference, Gold Coast, QLD, Australia, 16-18 December 2024. Heidelberg, Germany: Springer. doi: 10.1007/978-981-96-1242-0_28 |
|
2024 Conference Publication CPT-VR: Improving Surface Rendering via Closest Point Transform with View-Reflection AppearanceHu, Zhipeng, Zhang, Yongqiang, Liu, Chen, Li, Lincheng, Peng, Sida, Zhou, Xiaowei, Fan, Changjie and Yu, Xin (2024). CPT-VR: Improving Surface Rendering via Closest Point Transform with View-Reflection Appearance. 18th European Conference on Computer Vision, ECCV 2024, Milan, Italy, 29 September –4 October 2024. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-73464-9_14 |
|
2024 Conference Publication FreeAvatar: robust 3D facial animation transfer by learning an expression foundation modelQiu, Feng, Zhang, Wei, Liu, Chen, An, Rudong, Li, Lincheng, Ding, Yu, Fan, Changjie, Hu, Zhipeng and Yu, Xin (2024). FreeAvatar: robust 3D facial animation transfer by learning an expression foundation model. SA '24: SIGGRAPH Asia 2024, Tokyo, Japan, 3-6 December 2024. New York, NY, United States: ACM. doi: 10.1145/3680528.3687669 |
|
2024 Conference Publication Snap and diagnose: an advanced multimodal retrieval system for identifying plant diseases in the wildWei, Tianqi, Chen, Zhi and Yu, Xin (2024). Snap and diagnose: an advanced multimodal retrieval system for identifying plant diseases in the wild. MMASIA ’24, Auckland, New Zealand, 3-6 December 2024. New York, United States: ACM. doi: 10.1145/3696409.3700293 |
|
2024 Journal Article M3 A: A multimodal misinformation dataset for media authenticity analysisXu, Qingzheng, Chen, Huiqiang, Du, Heming, Zhang, Hu, Łukasik, Szymon, Zhu, Tianqing and Yu, Xin (2024). M3 A: A multimodal misinformation dataset for media authenticity analysis. Computer Vision and Image Understanding, 249 104205. doi: 10.1016/j.cviu.2024.104205 |
|
2024 Book Chapter OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object DetectionZhang, Hu, Xu, Jianhua, Tang, Tao, Sun, Haiyang, Yu, Xin, Huang, Zi and Yu, Kaicheng (2024). OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection. Lecture Notes in Computer Science. (pp. 1-19) Cham: Springer Nature Switzerland. doi: 10.1007/978-3-031-72907-2_1 |
|
2024 Conference Publication Benchmarking in-the-wild multimodal disease recognition and a versatile baselineWei, Tianqi, Chen, Zhi, Huang, Zi and Yu, Xin (2024). Benchmarking in-the-wild multimodal disease recognition and a versatile baseline. MM '24: The 32nd ACM International Conference on Multimedia, Melbourne, VIC, Australia, 28 October-1 November 2024. New York, United States: Association for Computing Machinery. doi: 10.1145/3664647.3680599 |
|
2024 Journal Article Ethics-aware face recognition aided by synthetic face imagesDu, Xiaobiao, Yu, Xin, Liu, Jinhui, Dai, Beifen and Xu, Feng (2024). Ethics-aware face recognition aided by synthetic face images. Neurocomputing, 600 128129, 128129. doi: 10.1016/j.neucom.2024.128129 |
|
2024 Conference Publication Machine Unlearning via Null Space CalibrationChen, Huiqiang, Zhu, Tianqing, Yu, Xin and Zhou, Wanlei (2024). Machine Unlearning via Null Space Calibration. 33rd International Joint Conference on Artificial Intelligence (IJCAI), Jeju, South Korea, 3-9 August 2024. California: International Joint Conferences on Artificial Intelligence Organization. doi: 10.24963/ijcai.2024/40 |
|
2024 Conference Publication Learning transferable compound expressions from Masked AutoEncoder pretrainingQiu, Feng, Du, Heming, Zhang, Wei, Liu, Chen, Li, Lincheng, Guo, Tianchen and Yu, Xin (2024). Learning transferable compound expressions from Masked AutoEncoder pretraining. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, United States, 17-18 June 2024. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvprw63382.2024.00476 |
|
2024 Conference Publication An effective ensemble learning framework for affective behaviour analysisZhang, Wei, Qiu, Feng, Liu, Chen, Li, Lincheng, Du, Heming, Guo, Tianchen and Yu, Xin (2024). An effective ensemble learning framework for affective behaviour analysis. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, United States, 17-18 June 2024. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvprw63382.2024.00479 |
|
2024 Conference Publication Language-guided multi-modal emotional mimicry intensity estimationQiu, Feng, Zhang, Wei, Liu, Chen, Li, Lincheng, Du, Heming, Guo, Tianchen and Yu, Xin (2024). Language-guided multi-modal emotional mimicry intensity estimation. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, United States, 17-18 June 2024. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvprw63382.2024.00477 |
|
2024 Journal Article Proactive image manipulation detection via deep semi-fragile watermarkZhao, Yuan, Liu, Bo, Zhu, Tianqing, Ding, Ming, Yu, Xin and Zhou, Wanlei (2024). Proactive image manipulation detection via deep semi-fragile watermark. Neurocomputing, 585 127593. doi: 10.1016/j.neucom.2024.127593 |
|
2024 Conference Publication DiPEx: Dispersing Prompt Expansion for class-agnostic object detectionLim, Jia Syuen, Chen, Zhuoxiao, Baktashmotlagh, Mahsa, Chen, Zhi, Yu, Xin, Huang, Zi and Luo, Yadan (2024). DiPEx: Dispersing Prompt Expansion for class-agnostic object detection. 38th Annual Conference on Neural Information Processing Systems (NeurIPS 2024), Vancouver, BC, Canada, 10-15 December 2024. San Mateo, CA, United States: Morgan Kaufmann Publishers. doi: 10.52202/079017-0781 |
|
2024 Journal Article BAVS: Bootstrapping audio-visual segmentation by integrating foundation knowledgeLiu, Chen, Li, Peike, Zhang, Hu, Li, Lincheng, Huang, Zi, Wang, Dadong and Yu, Xin (2024). BAVS: Bootstrapping audio-visual segmentation by integrating foundation knowledge. IEEE Transactions on Multimedia, 26, 10015-10028. doi: 10.1109/tmm.2024.3405622 |
|
2024 Journal Article AI empowered Auslan learning for parents of deaf children and children of deaf adultsSheng, Hongwei, Shen, Xin, Du, Heming, Zhang, Hu, Huang, Zi and Yu, Xin (2024). AI empowered Auslan learning for parents of deaf children and children of deaf adults. AI and Ethics, 4 (4), 1-11. doi: 10.1007/s43681-024-00457-y |
|
2024 Journal Article Detecting facial action units from global-local fine-grained expressionsZhang, Wei, Li, Lincheng, Ding, Yu, Chen, Wei, Deng, Zhigang and Yu, Xin (2024). Detecting facial action units from global-local fine-grained expressions. IEEE Transactions on Circuits and Systems for Video Technology, 34 (2), 983-994. doi: 10.1109/tcsvt.2023.3288903 |
|
2024 Conference Publication When 3D bounding-box meets SAM: point cloud instance segmentation with weak-and-noisy supervisionYu, Qingtao, Du, Heming, Liu, Chen and Yu, Xin (2024). When 3D bounding-box meets SAM: point cloud instance segmentation with weak-and-noisy supervision. 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, United States, 3-8 January 2024. Piscataway, NJ, United States: IEEE. doi: 10.1109/wacv57701.2024.00368 |
|
2024 Journal Article StyleTalk++: A unified framework for controlling the speaking styles of talking headsWang, Suzhen, Ma, Yifeng, Ding, Yu, Hu, Zhipeng, Fan, Changjie, Lv, Tangjie, Deng, Zhidong and Yu, Xin (2024). StyleTalk++: A unified framework for controlling the speaking styles of talking heads. IEEE Transactions on Pattern Analysis and Machine Intelligence, 46 (6), 4331-4347. doi: 10.1109/tpami.2024.3357808 |
|
2024 Conference Publication An empirical analysis on spatial reasoning capabilities of large multimodal modelsShiri, Fatemeh, Guo, Xiao-Yu, Far, Mona Golestan, Yu, Xin, Haffari, Gholamreza and Li, Yuan-Fang (2024). An empirical analysis on spatial reasoning capabilities of large multimodal models. 2024 Conference on Empirical Methods in Natural Language Processing, Miami, FL, United States, 12-16 November 2024. Kerrville, TX, United States: Association for Computational Linguistics (ACL). doi: 10.18653/v1/2024.emnlp-main.1195 |