|
2025 Conference Publication LatentHOI: on the generalizable hand object motion generation with latent hand diffusionLi, Muchen, Christen, Sammy, Wan, Chengde, Cai, Yujun, Liao, Renjie, Sigal, Leonid and Ma, Shugao (2025). LatentHOI: on the generalizable hand object motion generation with latent hand diffusion. 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, United States, 10-17 June 2025. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/CVPR52734.2025.01623 |
|
2025 Conference Publication Vulnerability of LLMs to vertically aligned text manipulationsLi, Zhecheng, Wang, Yiwei, Hooi, Bryan, Cai, Yujun, Xiong, Zhen, Peng, Nanyun and Chang, Kai-Wei (2025). Vulnerability of LLMs to vertically aligned text manipulations. 63rd Annual Meeting of the Association for Computational Linguistics, Vienna, Austria, 27 July-1 August 2025. Stroudsburg, PA USA: Association for Computational Linguistics. doi: 10.18653/v1/2025.acl-long.978 |
|
2025 Conference Publication Exploring visual vulnerabilities via multi-loss adversarial search for jailbreaking vision-language modelsHao, Shuyang, Hooi, Bryan, Liu, Jun, Chang, Kai-Wei, Huang, Zi and Cai, Yujun (2025). Exploring visual vulnerabilities via multi-loss adversarial search for jailbreaking vision-language models. 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TX, United States, 10 - 17 June 2025. Washington, DC, United States: I E E E Computer Society. doi: 10.1109/cvpr52734.2025.01852 |
|
2025 Conference Publication DiMo-GUI: Advancing Test-time Scaling in GUI Grounding via Modality-Aware Visual ReasoningWu, Hang, Chen, Hongkai, Cai, Yujun, Liu, Chang, Ye, Qingwen, Yang, Ming-Hsuan and Wang, Yiwei (2025). DiMo-GUI: Advancing Test-time Scaling in GUI Grounding via Modality-Aware Visual Reasoning. Association for Computational Linguistics (ACL). doi: 10.18653/v1/2025.emnlp-main.1334 |
|
2025 Conference Publication CON-RECALL: Detecting Pre-training Data in LLMs via Contrastive DecodingWang, Cheng, Wang, Yiwei, Hooi, Bryan, Cai, Yujun, Peng, Nanyun and Chang, Kai-Wei (2025). CON-RECALL: Detecting Pre-training Data in LLMs via Contrastive Decoding. 31st International Conference on Computational Linguistics, Abu Dhabi, United Arab Emirates, 19-24 January 2025. Stroudsburg, PA, United States: Association for Computational Linguistics (ACL). |
|
2025 Conference Publication Mapping the Minds of LLMs: A Graph-Based Analysis of Reasoning LLMsXiong, Zhen, Cai, Yujun, Li, Zhecheng and Wang, Yiwei (2025). Mapping the Minds of LLMs: A Graph-Based Analysis of Reasoning LLMs. Association for Computational Linguistics (ACL). doi: 10.18653/v1/2025.emnlp-main.896 |
|
2025 Conference Publication Tricking retrievers with influential tokens: an efficient black-box corpus poisoning attackWang, Cheng, Wang, Yiwei, Cai, Yujun and Hooi, Bryan (2025). Tricking retrievers with influential tokens: an efficient black-box corpus poisoning attack. 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, Albuquerque, New Mexico, 29 April-4 May 2025. Albuquerque, New Mexico: Association for Computational Linguistics (ACL). doi: 10.18653/v1/2025.naacl-long.210 |
|
2025 Conference Publication Energy-Calibrated VAE with Test Time Free LunchLuo, Yihong, Qiu, Siya, Tao, Xingjian, Cai, Yujun and Tang, Jing (2025). Energy-Calibrated VAE with Test Time Free Lunch. 18th European Conference on Computer Vision (ECCV), Milan Italy, Sep 29-Oct 04, 2024. Heidelberg, Germany: Springer. doi: 10.1007/978-3-031-73013-9_19 |
|
2025 Conference Publication VistaWise: Building Cost-Effective Agent with Cross-Modal Knowledge Graph for MinecraftFu, Honghao, Ren, Junlong, Chai, Qi, Ye, Deheng, Cai, Yujun and Wang, Hao (2025). VistaWise: Building Cost-Effective Agent with Cross-Modal Knowledge Graph for Minecraft. Association for Computational Linguistics (ACL). doi: 10.18653/v1/2025.emnlp-main.1111 |
|
2025 Conference Publication SemVink: Advancing VLMs' Semantic Understanding of Optical Illusions via Visual Global ThinkingLi, Sifan, Cai, Yujun and Wang, Yiwei (2025). SemVink: Advancing VLMs' Semantic Understanding of Optical Illusions via Visual Global Thinking. Association for Computational Linguistics (ACL). doi: 10.18653/v1/2025.emnlp-main.1381 |
|
2025 Conference Publication DRS: Deep question reformulation with structured outputLi, Zhecheng, Wang, Yiwei, Hooi, Bryan, Cai, Yujun, Peng, Nanyun and Chang, Kai-Wei (2025). DRS: Deep question reformulation with structured output. 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025), Vienna, Austria, 27 July-1 August 2025. Stroudsburg, PA, United States: Association for Computational Linguistics. doi: 10.18653/v1/2025.findings-acl.666 |
|
2024 Conference Publication STMG: a machine learning microgesture recognition system for supporting thumb-based VR/AR inputKin, Kenrick, Wan, Chengde, Koh, Ken, Marin, Andrei, Camgöz, Necati Cihan, Zhang, Yubo, Cai, Yujun, Kovalev, Fedor, Ben-Zacharia, Moshe, Hoople, Shannon, Nunes-Ueno, Marcos, Sanchez-Rodriguez, Mariel, Bhargava, Ayush, Wang, Robert, Sauser, Eric and Ma, Shugao (2024). STMG: a machine learning microgesture recognition system for supporting thumb-based VR/AR input. CHI '24: CHI Conference on Human Factors in Computing Systems, Honolulu, HI USA, 11-16 May 2024. New York, NY USA: Association for Computing Machinery. doi: 10.1145/3613904.3642702 |
|
2024 Conference Publication Social diffusion: long-term multiple human motion anticipationTanke, Julian, Zhang, Linguang, Zhao, Amy, Tang, Chengcheng, Cai, Yujun, Wang, Lezi, Wu, Po-Chen, Gall, Juergen and Keskin, Cem (2024). Social diffusion: long-term multiple human motion anticipation. 2023 IEEE/CVF International Conference on Computer Vision (ICCV), Paris, France, 1-6 October 2023. Piscataway, NJ USA: Institute of Electrical and Electronics Engineers. doi: 10.1109/ICCV51070.2023.00880 |
|
2024 Conference Publication DisC-GS: discontinuity-aware Gaussian splattingQu, Haoxuan, Li, Zhuoling, Rahmani, Hossein, Cai, Yujun and Liu, Jun (2024). DisC-GS: discontinuity-aware Gaussian splatting. 38th Annual Conference on Neural Information Processing Systems (NeurIPS 2024), Vancouver, BC, Canada, 10-15 December 2024. San Mateo, CA, United States: Morgan Kaufmann Publishers. doi: 10.52202/079017-3566 |
|
2024 Conference Publication LLMs are good action recognizersQu, Haoxuan, Cai, Yujun and Liu, Jun (2024). LLMs are good action recognizers. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, United States, 16-22 June 2024. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/CVPR52733.2024.01741 |
|
2024 Conference Publication emg2pose: a large and diverse benchmark for surface electromyographic hand pose estimationSalter, Sasha, Warren, Richard, Schlager, Collin, Spurr, Adrian, Han, Shangchen, Bhasin, Rohin, Cai, Yujun, Walkington, Peter, Bolarinwa, Anuoluwapo, Wang, Robert, Danielson, Nathan, Merel, Josh, Pnevmatikakis, Eftychios and Marshall, Jesse (2024). emg2pose: a large and diverse benchmark for surface electromyographic hand pose estimation. 38th Annual Conference on Neural Information Processing Systems (NeurIPS 2024), Vancouver, BC, Canada, 10-15 December 2024. San Mateo, CA, United States: Morgan Kaufmann Publishers. doi: 10.52202/079017-1770 |
|
2024 Conference Publication 6D-Diff: a keypoint diffusion framework for 6D object pose estimationXu, Li, Qu, Haoxuan, Cai, Yujun and Liu, Jun (2024). 6D-Diff: a keypoint diffusion framework for 6D object pose estimation. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, United States, 16-22 June 2024. Piscataway, NJ, United States: IEEE Computer Society. doi: 10.1109/CVPR52733.2024.00924 |
|
2023 Conference Publication LMC: large model collaboration with cross-assessment for training-free open-set object recognitionQu, Haoxuan, Hui, Xiaofei, Cai, Yujun and Liu, Jun (2023). LMC: large model collaboration with cross-assessment for training-free open-set object recognition. NIPS'23: 37th International Conference on Neural Information Processing Systems, New Orleans, LA USA, 10-16 December 2023. Maryland Heights, MO USA: Morgan Kaufmann Publishers. doi: 10.5555/3666122.3668138 |
|
2023 Conference Publication Primacy effect of ChatGPTWang, Yiwei, Cai, Yujun, Chen, Muhao, Liang, Yuxuan and Hooi, Bryan (2023). Primacy effect of ChatGPT. 2023 Conference on Empirical Methods in Natural Language Processing, Singapore, Singapore, 6-10 December 2023. Kerrville, TX USA: Association for Computational Linguistics. doi: 10.18653/v1/2023.emnlp-main.8 |
|
2023 Conference Publication How fragile is relation extraction under entity replacements?Wang, Yiwei, Hooi, Bryan, Wang, Fei, Cai, Yujun, Liang, Yuxuan, Zhou, Wenxuan, Tang, Jing, Duan, Manjuan and Chen, Muhao (2023). How fragile is relation extraction under entity replacements?. 27th Conference on Computational Natural Language Learning (CoNLL), Singapore, Singapore, 6-7 December 2023. Kerrville, TX USA: Association for Computational Linguistics. doi: 10.18653/v1/2023.conll-1.27 |