Skip to menu Skip to content Skip to footer

2025

Conference Publication

Vision-based abnormal action dataset for recognising body motion disorders

Ying, Jiaying, Shen, Xin and Yu, Xin (2025). Vision-based abnormal action dataset for recognising body motion disorders. 37th Australasian Joint Conference on Artificial Intelligence, AI 2024, Melbourne, VIC, Australia, 25 - 29 November 2024. Singapore, Singapore: Springer Nature Singapore. doi: 10.1007/978-981-96-0351-0_33

Vision-based abnormal action dataset for recognising body motion disorders

2025

Conference Publication

Compound expression recognition via curriculum learning

Liu, Chen, Qiu, Feng, Zhang, Wei, Li, Lincheng, Wang, Dadong and Yu, Xin (2025). Compound expression recognition via curriculum learning. ECCV 2024 Workshops, Milan, Italy, 29 September - 4 October 2024. Heidelberg, Germany: Springer. doi: 10.1007/978-3-031-91581-9_20

Compound expression recognition via curriculum learning

2025

Conference Publication

Who is Being Impersonated? Deepfake audio detection and impersonated identification via extraction of ID-specific features

Guo, Tianchen, Du, Heming, Huo, Huan, Liu, Bo and Yu, Xin (2025). Who is Being Impersonated? Deepfake audio detection and impersonated identification via extraction of ID-specific features. 24th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP 2024), Macau, China, 29-31 October 2024. Singapore: Springer. doi: 10.1007/978-981-96-1548-3_21

Who is Being Impersonated? Deepfake audio detection and impersonated identification via extraction of ID-specific features

2024

Conference Publication

CPT-VR: Improving Surface Rendering via Closest Point Transform with View-Reflection Appearance

Hu, Zhipeng, Zhang, Yongqiang, Liu, Chen, Li, Lincheng, Peng, Sida, Zhou, Xiaowei, Fan, Changjie and Yu, Xin (2024). CPT-VR: Improving Surface Rendering via Closest Point Transform with View-Reflection Appearance. 18th European Conference on Computer Vision, ECCV 2024, Milan, Italy, 29 September –4 October 2024. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-73464-9_14

CPT-VR: Improving Surface Rendering via Closest Point Transform with View-Reflection Appearance

2024

Conference Publication

FreeAvatar: robust 3D facial animation transfer by learning an expression foundation model

Qiu, Feng, Zhang, Wei, Liu, Chen, An, Rudong, Li, Lincheng, Ding, Yu, Fan, Changjie, Hu, Zhipeng and Yu, Xin (2024). FreeAvatar: robust 3D facial animation transfer by learning an expression foundation model. SA '24: SIGGRAPH Asia 2024, Tokyo, Japan, 3-6 December 2024. New York, NY, United States: ACM. doi: 10.1145/3680528.3687669

FreeAvatar: robust 3D facial animation transfer by learning an expression foundation model

2024

Conference Publication

Snap and diagnose: an advanced multimodal retrieval system for identifying plant diseases in the wild

Wei, Tianqi, Chen, Zhi and Yu, Xin (2024). Snap and diagnose: an advanced multimodal retrieval system for identifying plant diseases in the wild. MMASIA ’24, Auckland, New Zealand, 3-6 December 2024. New York, United States: ACM. doi: 10.1145/3696409.3700293

Snap and diagnose: an advanced multimodal retrieval system for identifying plant diseases in the wild

2024

Journal Article

M3 A: A multimodal misinformation dataset for media authenticity analysis

Xu, Qingzheng, Chen, Huiqiang, Du, Heming, Zhang, Hu, Łukasik, Szymon, Zhu, Tianqing and Yu, Xin (2024). M3 A: A multimodal misinformation dataset for media authenticity analysis. Computer Vision and Image Understanding, 249 104205. doi: 10.1016/j.cviu.2024.104205

M3 A: A multimodal misinformation dataset for media authenticity analysis

2024

Book Chapter

OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection

Zhang, Hu, Xu, Jianhua, Tang, Tao, Sun, Haiyang, Yu, Xin, Huang, Zi and Yu, Kaicheng (2024). OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection. Lecture Notes in Computer Science. (pp. 1-19) Cham: Springer Nature Switzerland. doi: 10.1007/978-3-031-72907-2_1

OpenSight: A Simple Open-Vocabulary Framework for LiDAR-Based Object Detection

2024

Conference Publication

Benchmarking in-the-wild multimodal disease recognition and a versatile baseline

Wei, Tianqi, Chen, Zhi, Huang, Zi and Yu, Xin (2024). Benchmarking in-the-wild multimodal disease recognition and a versatile baseline. MM '24: The 32nd ACM International Conference on Multimedia, Melbourne, VIC, Australia, 28 October-1 November 2024. New York, United States: Association for Computing Machinery. doi: 10.1145/3664647.3680599

Benchmarking in-the-wild multimodal disease recognition and a versatile baseline

2024

Journal Article

Ethics-aware face recognition aided by synthetic face images

Du, Xiaobiao, Yu, Xin, Liu, Jinhui, Dai, Beifen and Xu, Feng (2024). Ethics-aware face recognition aided by synthetic face images. Neurocomputing, 600 128129, 128129. doi: 10.1016/j.neucom.2024.128129

Ethics-aware face recognition aided by synthetic face images

2024

Conference Publication

Machine Unlearning via Null Space Calibration

Chen, Huiqiang, Zhu, Tianqing, Yu, Xin and Zhou, Wanlei (2024). Machine Unlearning via Null Space Calibration. 33rd International Joint Conference on Artificial Intelligence (IJCAI), Jeju, South Korea, 3-9 August 2024. California: International Joint Conferences on Artificial Intelligence Organization. doi: 10.24963/ijcai.2024/40

Machine Unlearning via Null Space Calibration

2024

Conference Publication

Learning transferable compound expressions from Masked AutoEncoder pretraining

Qiu, Feng, Du, Heming, Zhang, Wei, Liu, Chen, Li, Lincheng, Guo, Tianchen and Yu, Xin (2024). Learning transferable compound expressions from Masked AutoEncoder pretraining. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, United States, 17-18 June 2024. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvprw63382.2024.00476

Learning transferable compound expressions from Masked AutoEncoder pretraining

2024

Conference Publication

An effective ensemble learning framework for affective behaviour analysis

Zhang, Wei, Qiu, Feng, Liu, Chen, Li, Lincheng, Du, Heming, Guo, Tianchen and Yu, Xin (2024). An effective ensemble learning framework for affective behaviour analysis. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, United States, 17-18 June 2024. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvprw63382.2024.00479

An effective ensemble learning framework for affective behaviour analysis

2024

Conference Publication

Language-guided multi-modal emotional mimicry intensity estimation

Qiu, Feng, Zhang, Wei, Liu, Chen, Li, Lincheng, Du, Heming, Guo, Tianchen and Yu, Xin (2024). Language-guided multi-modal emotional mimicry intensity estimation. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, United States, 17-18 June 2024. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvprw63382.2024.00477

Language-guided multi-modal emotional mimicry intensity estimation

2024

Journal Article

Proactive image manipulation detection via deep semi-fragile watermark

Zhao, Yuan, Liu, Bo, Zhu, Tianqing, Ding, Ming, Yu, Xin and Zhou, Wanlei (2024). Proactive image manipulation detection via deep semi-fragile watermark. Neurocomputing, 585 127593. doi: 10.1016/j.neucom.2024.127593

Proactive image manipulation detection via deep semi-fragile watermark

2024

Conference Publication

DiPEx: Dispersing Prompt Expansion for class-agnostic object detection

Lim, Jia Syuen, Chen, Zhuoxiao, Baktashmotlagh, Mahsa, Chen, Zhi, Yu, Xin, Huang, Zi and Luo, Yadan (2024). DiPEx: Dispersing Prompt Expansion for class-agnostic object detection. 38th Annual Conference on Neural Information Processing Systems (NeurIPS 2024), Vancouver, BC, Canada, 10-15 December 2024. San Mateo, CA, United States: Morgan Kaufmann Publishers. doi: 10.52202/079017-0781

DiPEx: Dispersing Prompt Expansion for class-agnostic object detection

2024

Journal Article

BAVS: Bootstrapping audio-visual segmentation by integrating foundation knowledge

Liu, Chen, Li, Peike, Zhang, Hu, Li, Lincheng, Huang, Zi, Wang, Dadong and Yu, Xin (2024). BAVS: Bootstrapping audio-visual segmentation by integrating foundation knowledge. IEEE Transactions on Multimedia, 26, 10015-10028. doi: 10.1109/tmm.2024.3405622

BAVS: Bootstrapping audio-visual segmentation by integrating foundation knowledge

2024

Journal Article

AI empowered Auslan learning for parents of deaf children and children of deaf adults

Sheng, Hongwei, Shen, Xin, Du, Heming, Zhang, Hu, Huang, Zi and Yu, Xin (2024). AI empowered Auslan learning for parents of deaf children and children of deaf adults. AI and Ethics, 4 (4), 1-11. doi: 10.1007/s43681-024-00457-y

AI empowered Auslan learning for parents of deaf children and children of deaf adults

2024

Journal Article

Detecting facial action units from global-local fine-grained expressions

Zhang, Wei, Li, Lincheng, Ding, Yu, Chen, Wei, Deng, Zhigang and Yu, Xin (2024). Detecting facial action units from global-local fine-grained expressions. IEEE Transactions on Circuits and Systems for Video Technology, 34 (2), 983-994. doi: 10.1109/tcsvt.2023.3288903

Detecting facial action units from global-local fine-grained expressions

2024

Conference Publication

When 3D bounding-box meets SAM: point cloud instance segmentation with weak-and-noisy supervision

Yu, Qingtao, Du, Heming, Liu, Chen and Yu, Xin (2024). When 3D bounding-box meets SAM: point cloud instance segmentation with weak-and-noisy supervision. 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, United States, 3-8 January 2024. Piscataway, NJ, United States: IEEE. doi: 10.1109/wacv57701.2024.00368

When 3D bounding-box meets SAM: point cloud instance segmentation with weak-and-noisy supervision