2025 Conference Publication FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal GroundingCao, Zhuo, Zhang, Bingqing, Du, Heming, Yu, Xin, Li, Xue and Wang, Sen (2025). FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding. IEEE. doi: 10.1109/wacv61041.2025.00894 |
2025 Conference Publication TokenBinder: Text-Video Retrieval with One-to-Many Alignment ParadigmZhang, Bingqing, Cao, Zhuo, Du, Heming, Yu, Xin, Li, Xue, Liu, Jiajun and Wang, Sen (2025). TokenBinder: Text-Video Retrieval with One-to-Many Alignment Paradigm. IEEE. doi: 10.1109/wacv61041.2025.00485 |
2025 Conference Publication Transferable Attacks for Semantic SegmentationHe, Mengqi, Zhang, Jing and Yu, Xin (2025). Transferable Attacks for Semantic Segmentation. 35th Australasian Database Conference, Gold Coast Australia, Dec 16-18, 2024. SINGAPORE: Springer Science and Business Media Deutschland GmbH. doi: 10.1007/978-981-96-1242-0_28 |
2025 Conference Publication Vision-based abnormal action dataset for recognising body motion disordersYing, Jiaying, Shen, Xin and Yu, Xin (2025). Vision-based abnormal action dataset for recognising body motion disorders. 37th Australasian Joint Conference on Artificial Intelligence, AI 2024, Melbourne, VIC, Australia, 25 - 29 November 2024. Singapore, Singapore: Springer Nature Singapore. doi: 10.1007/978-981-96-0351-0_33 |
2024 Conference Publication CPT-VR: Improving Surface Rendering via Closest Point Transform with View-Reflection AppearanceHu, Zhipeng, Zhang, Yongqiang, Liu, Chen, Li, Lincheng, Peng, Sida, Zhou, Xiaowei, Fan, Changjie and Yu, Xin (2024). CPT-VR: Improving Surface Rendering via Closest Point Transform with View-Reflection Appearance. 18th European Conference on Computer Vision, ECCV 2024, Milan, Italy, 29 September –4 October 2024. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-73464-9_14 |
2024 Conference Publication FreeAvatar: robust 3D facial animation transfer by learning an expression foundation modelQiu, Feng, Zhang, Wei, Liu, Chen, An, Rudong, Li, Lincheng, Ding, Yu, Fan, Changjie, Hu, Zhipeng and Yu, Xin (2024). FreeAvatar: robust 3D facial animation transfer by learning an expression foundation model. SA '24: SIGGRAPH Asia 2024, Tokyo, Japan, 3-6 December 2024. New York, NY, United States: ACM. doi: 10.1145/3680528.3687669 |
2024 Conference Publication Snap and diagnose: an advanced multimodal retrieval system for identifying plant diseases in the wildWei, Tianqi, Chen, Zhi and Yu, Xin (2024). Snap and diagnose: an advanced multimodal retrieval system for identifying plant diseases in the wild. MMASIA ’24, Auckland, New Zealand, 3-6 December 2024. New York, United States: ACM. doi: 10.1145/3696409.3700293 |
2024 Conference Publication Benchmarking in-the-wild multimodal disease recognition and a versatile baselineWei, Tianqi, Chen, Zhi, Huang, Zi and Yu, Xin (2024). Benchmarking in-the-wild multimodal disease recognition and a versatile baseline. MM '24: The 32nd ACM International Conference on Multimedia, Melbourne, VIC, Australia, 28 October-1 November 2024. New York, United States: Association for Computing Machinery. doi: 10.1145/3664647.3680599 |
2024 Conference Publication Recent update on the Tsinghua tabletop Kibble balanceLi, S., Ma, Y., Ma, K., Liu, W., Li, N., Liu, X., Peng, L., Zhao, W., Huang, S. and Yu, X. (2024). Recent update on the Tsinghua tabletop Kibble balance. Conference on Precision Electromagnetic Measurements (CPEM) / Joint NCSL-International Annual Workshop and Symposium (NCSLI), Denver, CO United States, 8-12 July 2024. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cpem61406.2024.10645985 |
2024 Conference Publication Learning transferable compound expressions from Masked AutoEncoder pretrainingQiu, Feng, Du, Heming, Zhang, Wei, Liu, Chen, Li, Lincheng, Guo, Tianchen and Yu, Xin (2024). Learning transferable compound expressions from Masked AutoEncoder pretraining. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, United States, 17-18 June 2024. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvprw63382.2024.00476 |
2024 Conference Publication An effective ensemble learning framework for affective behaviour analysisZhang, Wei, Qiu, Feng, Liu, Chen, Li, Lincheng, Du, Heming, Guo, Tianchen and Yu, Xin (2024). An effective ensemble learning framework for affective behaviour analysis. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, United States, 17-18 June 2024. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvprw63382.2024.00479 |
2024 Conference Publication Language-guided multi-modal emotional mimicry intensity estimationQiu, Feng, Zhang, Wei, Liu, Chen, Li, Lincheng, Du, Heming, Guo, Tianchen and Yu, Xin (2024). Language-guided multi-modal emotional mimicry intensity estimation. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, United States, 17-18 June 2024. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvprw63382.2024.00477 |
2024 Conference Publication When 3D bounding-box meets SAM: point cloud instance segmentation with weak-and-noisy supervisionYu, Qingtao, Du, Heming, Liu, Chen and Yu, Xin (2024). When 3D bounding-box meets SAM: point cloud instance segmentation with weak-and-noisy supervision. 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, United States, 3-8 January 2024. Piscataway, NJ, United States: IEEE. doi: 10.1109/wacv57701.2024.00368 |
2024 Conference Publication Benchmarking audio visual segmentation for long-untrimmed videosLiu, Chen, Li, Peike Patrick, Yu, Qingtao, Sheng, Hongwei, Wang, Dadong, Li, Lincheng and Yu, Xin (2024). Benchmarking audio visual segmentation for long-untrimmed videos. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, United States, 16-22 June 2024. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/CVPR52733.2024.02143 |
2024 Conference Publication MMOOC: a multimodal misinformation dataset for out-of-context news analysisXu, Qingzheng, Du, Heming, Chen, Huiqiang, Liu, Bo and Yu, Xin (2024). MMOOC: a multimodal misinformation dataset for out-of-context news analysis. 29th Australasian Conference, ACISP 2024, Sydney, NSW, Australia, 15–17 July 2024. Heidelberg, Germany: Springer. doi: 10.1007/978-981-97-5101-3_24 |
2024 Conference Publication EfficientDreamer: high-fidelity and stable 3D creation via orthogonal-view diffusion priorsHu, Zhipeng, Zhao, Minda, Zhao, Chaoyi, Liang, Xinyue, Li, Lincheng, Zhao, Zeng, Fan, Changjie, Zhou, Xiaowei and Yu, Xin (2024). EfficientDreamer: high-fidelity and stable 3D creation via orthogonal-view diffusion priors. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, United States, 16-22 June 2024. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/CVPR52733.2024.00473 |
2024 Conference Publication Text-guided 3D face synthesis - from generation to editingWu, Yunjie, Meng, Yapeng, Hu, Zhipeng, Li, Lincheng, Wu, Haoqian, Zhou, Kun, Xu, Weiwei and Yu, Xin (2024). Text-guided 3D face synthesis - from generation to editing. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, United States, 16-22 June 2024. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/CVPR52733.2024.00126 |
2024 Conference Publication AS-NeRF: learning auxiliary sampling for generalizable novel view synthesis from sparse viewsTang, Jilin, Li, Lincheng, Qi, Xingqun, Chen, Yingfeng, Fan, Changjie and Yu, Xin (2024). AS-NeRF: learning auxiliary sampling for generalizable novel view synthesis from sparse views. 2024 IEEE International Conference on Multimedia and Expo (ICME), Niagara Falls, ON, Canada, 15-19 July 2024. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/ICME57554.2024.10688126 |
2024 Conference Publication Pupil-fMRI correlation-based Explainable AI to classify Alzheimer’s DiseaseLiu, Xiaochen, Xu, William, Hike, David, Xie, Zeping, Liu, Andy, Choi, Sangcheon, Zhu, Biyue, Ran, Chongzhao, Jiang, Yuanyuan and Yu, Xin (2024). Pupil-fMRI correlation-based Explainable AI to classify Alzheimer’s Disease. 2024 ISMRM & ISMRT Annual Meeting, Singapore, 4-9 May 2024. Concord, CA United States: ISMRM. doi: 10.58530/2024/1124 |
2023 Conference Publication Learning efficient unsupervised satellite image-based building damage detectionZhang, Yiyun, Wang, Zijian, Luo, Yadan, Yu, Xin and Huang, Zi (2023). Learning efficient unsupervised satellite image-based building damage detection. 2023 IEEE International Conference on Data Mining (ICDM), Shanghai, China, 1-4 December 2023. Piscataway, NJ, United States: IEEE. doi: 10.1109/icdm58522.2023.00206 |