Skip to menu Skip to content Skip to footer

2025

Conference Publication

FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding

Cao, Zhuo, Zhang, Bingqing, Du, Heming, Yu, Xin, Li, Xue and Wang, Sen (2025). FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding. IEEE. doi: 10.1109/wacv61041.2025.00894

FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding

2025

Conference Publication

TokenBinder: Text-Video Retrieval with One-to-Many Alignment Paradigm

Zhang, Bingqing, Cao, Zhuo, Du, Heming, Yu, Xin, Li, Xue, Liu, Jiajun and Wang, Sen (2025). TokenBinder: Text-Video Retrieval with One-to-Many Alignment Paradigm. IEEE. doi: 10.1109/wacv61041.2025.00485

TokenBinder: Text-Video Retrieval with One-to-Many Alignment Paradigm

2025

Conference Publication

Transferable Attacks for Semantic Segmentation

He, Mengqi, Zhang, Jing and Yu, Xin (2025). Transferable Attacks for Semantic Segmentation. 35th Australasian Database Conference, Gold Coast Australia, Dec 16-18, 2024. SINGAPORE: Springer Science and Business Media Deutschland GmbH. doi: 10.1007/978-981-96-1242-0_28

Transferable Attacks for Semantic Segmentation

2025

Conference Publication

Vision-based abnormal action dataset for recognising body motion disorders

Ying, Jiaying, Shen, Xin and Yu, Xin (2025). Vision-based abnormal action dataset for recognising body motion disorders. 37th Australasian Joint Conference on Artificial Intelligence, AI 2024, Melbourne, VIC, Australia, 25 - 29 November 2024. Singapore, Singapore: Springer Nature Singapore. doi: 10.1007/978-981-96-0351-0_33

Vision-based abnormal action dataset for recognising body motion disorders

2024

Conference Publication

CPT-VR: Improving Surface Rendering via Closest Point Transform with View-Reflection Appearance

Hu, Zhipeng, Zhang, Yongqiang, Liu, Chen, Li, Lincheng, Peng, Sida, Zhou, Xiaowei, Fan, Changjie and Yu, Xin (2024). CPT-VR: Improving Surface Rendering via Closest Point Transform with View-Reflection Appearance. 18th European Conference on Computer Vision, ECCV 2024, Milan, Italy, 29 September –4 October 2024. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-73464-9_14

CPT-VR: Improving Surface Rendering via Closest Point Transform with View-Reflection Appearance

2024

Conference Publication

FreeAvatar: robust 3D facial animation transfer by learning an expression foundation model

Qiu, Feng, Zhang, Wei, Liu, Chen, An, Rudong, Li, Lincheng, Ding, Yu, Fan, Changjie, Hu, Zhipeng and Yu, Xin (2024). FreeAvatar: robust 3D facial animation transfer by learning an expression foundation model. SA '24: SIGGRAPH Asia 2024, Tokyo, Japan, 3-6 December 2024. New York, NY, United States: ACM. doi: 10.1145/3680528.3687669

FreeAvatar: robust 3D facial animation transfer by learning an expression foundation model

2024

Conference Publication

Snap and diagnose: an advanced multimodal retrieval system for identifying plant diseases in the wild

Wei, Tianqi, Chen, Zhi and Yu, Xin (2024). Snap and diagnose: an advanced multimodal retrieval system for identifying plant diseases in the wild. MMASIA ’24, Auckland, New Zealand, 3-6 December 2024. New York, United States: ACM. doi: 10.1145/3696409.3700293

Snap and diagnose: an advanced multimodal retrieval system for identifying plant diseases in the wild

2024

Conference Publication

Benchmarking in-the-wild multimodal disease recognition and a versatile baseline

Wei, Tianqi, Chen, Zhi, Huang, Zi and Yu, Xin (2024). Benchmarking in-the-wild multimodal disease recognition and a versatile baseline. MM '24: The 32nd ACM International Conference on Multimedia, Melbourne, VIC, Australia, 28 October-1 November 2024. New York, United States: Association for Computing Machinery. doi: 10.1145/3664647.3680599

Benchmarking in-the-wild multimodal disease recognition and a versatile baseline

2024

Conference Publication

Recent update on the Tsinghua tabletop Kibble balance

Li, S., Ma, Y., Ma, K., Liu, W., Li, N., Liu, X., Peng, L., Zhao, W., Huang, S. and Yu, X. (2024). Recent update on the Tsinghua tabletop Kibble balance. Conference on Precision Electromagnetic Measurements (CPEM) / Joint NCSL-International Annual Workshop and Symposium (NCSLI), Denver, CO United States, 8-12 July 2024. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cpem61406.2024.10645985

Recent update on the Tsinghua tabletop Kibble balance

2024

Conference Publication

Learning transferable compound expressions from Masked AutoEncoder pretraining

Qiu, Feng, Du, Heming, Zhang, Wei, Liu, Chen, Li, Lincheng, Guo, Tianchen and Yu, Xin (2024). Learning transferable compound expressions from Masked AutoEncoder pretraining. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, United States, 17-18 June 2024. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvprw63382.2024.00476

Learning transferable compound expressions from Masked AutoEncoder pretraining

2024

Conference Publication

An effective ensemble learning framework for affective behaviour analysis

Zhang, Wei, Qiu, Feng, Liu, Chen, Li, Lincheng, Du, Heming, Guo, Tianchen and Yu, Xin (2024). An effective ensemble learning framework for affective behaviour analysis. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, United States, 17-18 June 2024. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvprw63382.2024.00479

An effective ensemble learning framework for affective behaviour analysis

2024

Conference Publication

Language-guided multi-modal emotional mimicry intensity estimation

Qiu, Feng, Zhang, Wei, Liu, Chen, Li, Lincheng, Du, Heming, Guo, Tianchen and Yu, Xin (2024). Language-guided multi-modal emotional mimicry intensity estimation. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, United States, 17-18 June 2024. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvprw63382.2024.00477

Language-guided multi-modal emotional mimicry intensity estimation

2024

Conference Publication

When 3D bounding-box meets SAM: point cloud instance segmentation with weak-and-noisy supervision

Yu, Qingtao, Du, Heming, Liu, Chen and Yu, Xin (2024). When 3D bounding-box meets SAM: point cloud instance segmentation with weak-and-noisy supervision. 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, United States, 3-8 January 2024. Piscataway, NJ, United States: IEEE. doi: 10.1109/wacv57701.2024.00368

When 3D bounding-box meets SAM: point cloud instance segmentation with weak-and-noisy supervision

2024

Conference Publication

Benchmarking audio visual segmentation for long-untrimmed videos

Liu, Chen, Li, Peike Patrick, Yu, Qingtao, Sheng, Hongwei, Wang, Dadong, Li, Lincheng and Yu, Xin (2024). Benchmarking audio visual segmentation for long-untrimmed videos. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, United States, 16-22 June 2024. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/CVPR52733.2024.02143

Benchmarking audio visual segmentation for long-untrimmed videos

2024

Conference Publication

MMOOC: a multimodal misinformation dataset for out-of-context news analysis

Xu, Qingzheng, Du, Heming, Chen, Huiqiang, Liu, Bo and Yu, Xin (2024). MMOOC: a multimodal misinformation dataset for out-of-context news analysis. 29th Australasian Conference, ACISP 2024, Sydney, NSW, Australia, 15–17 July 2024. Heidelberg, Germany: Springer. doi: 10.1007/978-981-97-5101-3_24

MMOOC: a multimodal misinformation dataset for out-of-context news analysis

2024

Conference Publication

EfficientDreamer: high-fidelity and stable 3D creation via orthogonal-view diffusion priors

Hu, Zhipeng, Zhao, Minda, Zhao, Chaoyi, Liang, Xinyue, Li, Lincheng, Zhao, Zeng, Fan, Changjie, Zhou, Xiaowei and Yu, Xin (2024). EfficientDreamer: high-fidelity and stable 3D creation via orthogonal-view diffusion priors. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, United States, 16-22 June 2024. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/CVPR52733.2024.00473

EfficientDreamer: high-fidelity and stable 3D creation via orthogonal-view diffusion priors

2024

Conference Publication

Text-guided 3D face synthesis - from generation to editing

Wu, Yunjie, Meng, Yapeng, Hu, Zhipeng, Li, Lincheng, Wu, Haoqian, Zhou, Kun, Xu, Weiwei and Yu, Xin (2024). Text-guided 3D face synthesis - from generation to editing. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, United States, 16-22 June 2024. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/CVPR52733.2024.00126

Text-guided 3D face synthesis - from generation to editing

2024

Conference Publication

AS-NeRF: learning auxiliary sampling for generalizable novel view synthesis from sparse views

Tang, Jilin, Li, Lincheng, Qi, Xingqun, Chen, Yingfeng, Fan, Changjie and Yu, Xin (2024). AS-NeRF: learning auxiliary sampling for generalizable novel view synthesis from sparse views. 2024 IEEE International Conference on Multimedia and Expo (ICME), Niagara Falls, ON, Canada, 15-19 July 2024. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/ICME57554.2024.10688126

AS-NeRF: learning auxiliary sampling for generalizable novel view synthesis from sparse views

2024

Conference Publication

Pupil-fMRI correlation-based Explainable AI to classify Alzheimer’s Disease

Liu, Xiaochen, Xu, William, Hike, David, Xie, Zeping, Liu, Andy, Choi, Sangcheon, Zhu, Biyue, Ran, Chongzhao, Jiang, Yuanyuan and Yu, Xin (2024). Pupil-fMRI correlation-based Explainable AI to classify Alzheimer’s Disease. 2024 ISMRM & ISMRT Annual Meeting, Singapore, 4-9 May 2024. Concord, CA United States: ISMRM. doi: 10.58530/2024/1124

Pupil-fMRI correlation-based Explainable AI to classify Alzheimer’s Disease

2023

Conference Publication

Learning efficient unsupervised satellite image-based building damage detection

Zhang, Yiyun, Wang, Zijian, Luo, Yadan, Yu, Xin and Huang, Zi (2023). Learning efficient unsupervised satellite image-based building damage detection. 2023 IEEE International Conference on Data Mining (ICDM), Shanghai, China, 1-4 December 2023. Piscataway, NJ, United States: IEEE. doi: 10.1109/icdm58522.2023.00206

Learning efficient unsupervised satellite image-based building damage detection