Skip to menu Skip to content Skip to footer

2024

Conference Publication

Find n’ Propagate: Open-Vocabulary 3D Object Detection in Urban Environments

Etchegaray, Djamahl, Huang, Zi, Harada, Tatsuya and Luo, Yadan (2024). Find n’ Propagate: Open-Vocabulary 3D Object Detection in Urban Environments. 18th European Conference on Computer Vision ECCV 2024, Milan, Italy, 29 September – 4 October 2024. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-73661-2_8

Find n’ Propagate: Open-Vocabulary 3D Object Detection in Urban Environments

2024

Conference Publication

Navigating Adversarial Robustness in Multimodal Systems

Huang, Zi Helen (2024). Navigating Adversarial Robustness in Multimodal Systems. 2nd International Workshop on Methodologies for Multimedia (Meet4MM), Melbourne Australia, Oct 28-Nov 01, 2024. New York, NY, USA: ACM. doi: 10.1145/3689089.3689708

Navigating Adversarial Robustness in Multimodal Systems

2024

Conference Publication

Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed Environments

Hong, Haodong, Wang, Sen, Huang, Zi, Wu, Qi and Liu, Jiajun (2024). Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed Environments. New York, NY, USA: ACM. doi: 10.1145/3664647.3681640

Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed Environments

2024

Conference Publication

DPO: Dual-Perturbation Optimization for Test-time Adaptation in 3D Object Detection

Chen, Zhuoxiao, Wang, Zixin, Luo, Yadan, Wang, Sen and Huang, Zi (2024). DPO: Dual-Perturbation Optimization for Test-time Adaptation in 3D Object Detection. New York, NY, USA: ACM. doi: 10.1145/3664647.3681040

DPO: Dual-Perturbation Optimization for Test-time Adaptation in 3D Object Detection

2024

Conference Publication

Benchmarking In-the-Wild Multimodal Disease Recognition and A Versatile Baseline

Wei, Tianqi, Chen, Zhi, Huang, Zi and Yu, Xin (2024). Benchmarking In-the-Wild Multimodal Disease Recognition and A Versatile Baseline. New York, NY, USA: ACM. doi: 10.1145/3664647.3680599

Benchmarking In-the-Wild Multimodal Disease Recognition and A Versatile Baseline

2024

Conference Publication

Generative AI in Multimedia: Challenges and Opportunities for Academic and Industrial Impact

Huang, Zi Helen, Chen, Phoebe and Yan, Shuicheng (2024). Generative AI in Multimedia: Challenges and Opportunities for Academic and Industrial Impact. New York, NY, USA: ACM. doi: 10.1145/3664647.3687170

Generative AI in Multimedia: Challenges and Opportunities for Academic and Industrial Impact

2024

Conference Publication

Physics-guided Active Sample Reweighting for Urban Flow Prediction

Jiang, Wei, Chen, Tong, Ye, Guanhua, Zhang, Wentao, Cui, Lizhen, Huang, Zi and Yin, Hongzhi (2024). Physics-guided Active Sample Reweighting for Urban Flow Prediction. 33rd ACM International Conference on Information and Knowledge Management (CIKM), Boise Id, Oct 21-25, 2024. New York, NY, USA: ACM. doi: 10.1145/3627673.3679738

Physics-guided Active Sample Reweighting for Urban Flow Prediction

2024

Conference Publication

Universal adversarial perturbations for vision-language pre-trained models

Zhang, Peng-Fei, Huang, Zi and Bai, Guangdong (2024). Universal adversarial perturbations for vision-language pre-trained models. 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, Washington, DC, United States, 14-18 July 2024. New York, NY, United States: ACM. doi: 10.1145/3626772.3657781

Universal adversarial perturbations for vision-language pre-trained models

2024

Conference Publication

CaseLink: inductive graph learning for legal case retrieval

Tang, Yanran, Qiu, Ruihong, Yin, Hongzhi, Li, Xue and Huang, Zi (2024). CaseLink: inductive graph learning for legal case retrieval. 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, Washington, DC, United States, 14-18 July 2024. New York, NY, United States: ACM. doi: 10.1145/3626772.3657693

CaseLink: inductive graph learning for legal case retrieval

2024

Conference Publication

Edge deployable online domain adaptation for underwater object detection

Etchegaray, Djamahl, Luo, Yadan, Li, Yang, Do, Brendan, Liu, Jiajun, Huang, Zi and Kusy, Branislav (2024). Edge deployable online domain adaptation for underwater object detection. 2024 International Joint Conference on Neural Networks (IJCNN), Yokohama, Japan, 30 June - 5 July 2024. Piscataway, NJ, United States: IEEE. doi: 10.1109/ijcnn60899.2024.10650705

Edge deployable online domain adaptation for underwater object detection

2024

Conference Publication

Abstract and explore: a novel behavioral metric with cyclic dynamics in reinforcement learning

Zhu, Anjie, Zhang, Peng-Fei, Qiu, Ruihong, Zheng, Zetao, Huang, Zi and Shao, Jie (2024). Abstract and explore: a novel behavioral metric with cyclic dynamics in reinforcement learning. 38th AAAI Conference on Artificial Intelligence (AAAI) / 36th Conference on Innovative Applications of Artificial Intelligence / 14th Symposium on Educational Advances in Artificial Intelligence, Vancouver, Canada, 20-27 February 2024. Palo Alto, CA, United States: Association for the Advancement of Artificial Intelligence. doi: 10.1609/aaai.v38i15.29660

Abstract and explore: a novel behavioral metric with cyclic dynamics in reinforcement learning

2024

Conference Publication

Event-content-oriented dialogue generation in short video

Cheng, Fenghua, Li, Xue, Huang, Zi, Wang, Jinxiang and Wang, Sen (2024). Event-content-oriented dialogue generation in short video. 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Mexico City, Mexico, 16-21 June 2024. Kerrville, TX, United States: Association for Computational Linguistics (ACL). doi: 10.18653/v1/2024.naacl-long.229

Event-content-oriented dialogue generation in short video

2024

Conference Publication

CaseGNN: graph neural networks for legal case retrieval with text-attributed graphs

Tang, Yanran, Qiu, Ruihong, Liu, Yilun, Li, Xue and Huang, Zi (2024). CaseGNN: graph neural networks for legal case retrieval with text-attributed graphs. 46th European Conference on Information Retrieval, ECIR 2024, Glasgow, United Kingdom, 24 - 28 March 2024. Cham, Switzerland: Springer Nature Switzerland. doi: 10.1007/978-3-031-56060-6_6

CaseGNN: graph neural networks for legal case retrieval with text-attributed graphs

2023

Conference Publication

Multi-head Siamese prototype learning against both data and label corruption

Zhang, Peng-Fei and Huang, Zi Helen (2023). Multi-head Siamese prototype learning against both data and label corruption. MMAsia '23: 5th ACM International Conference on Multimedia in Asia, Tainan, Taiwan, 6-8 December 2023. New York, NY, United States: ACM. doi: 10.1145/3595916.3626435

Multi-head Siamese prototype learning against both data and label corruption

2023

Conference Publication

Learning efficient unsupervised satellite image-based building damage detection

Zhang, Yiyun, Wang, Zijian, Luo, Yadan, Yu, Xin and Huang, Zi (2023). Learning efficient unsupervised satellite image-based building damage detection. 2023 IEEE International Conference on Data Mining (ICDM), Shanghai, China, 1-4 December 2023. Piscataway, NJ, United States: IEEE. doi: 10.1109/icdm58522.2023.00206

Learning efficient unsupervised satellite image-based building damage detection

2023

Conference Publication

CaT: balanced continual graph learning with graph condensation

Liu, Yilun, Qiu, Ruihong and Huang, Zi (2023). CaT: balanced continual graph learning with graph condensation. 23rd IEEE International Conference on Data Mining (IEEE ICDM), Shanghai, China, 1-4 December 2023. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/icdm58522.2023.00141

CaT: balanced continual graph learning with graph condensation

2023

Conference Publication

Toward a unified framework for RGB and RGB-D visual navigation

Du, Heming, Huang, Zi, Chapman, Scott and Yu, Xin (2023). Toward a unified framework for RGB and RGB-D visual navigation. 36th Australasian Joint Conference on Artificial Intelligence, AJCAI 2023, Brisbane, QLD Australia, 28 November –1 December 2023. Singapore: Springer. doi: 10.1007/978-981-99-8391-9_29

Toward a unified framework for RGB and RGB-D visual navigation

2023

Conference Publication

Cal-SFDA: Source-free domain-adaptive semantic segmentation with differentiable expected calibration error

Wang, Zixin, Luo, Yadan, Chen, Zhi, Wang, Sen and Huang, Zi (2023). Cal-SFDA: Source-free domain-adaptive semantic segmentation with differentiable expected calibration error. MM '23: The 31st ACM International Conference on Multimedia, Ottawa, ON Canada, 29 October - 3 November 2023. New York, NY United States: Association for Computing Machinery. doi: 10.1145/3581783.3611808

Cal-SFDA: Source-free domain-adaptive semantic segmentation with differentiable expected calibration error

2023

Conference Publication

Zero-shot learning by harnessing adversarial samples

Chen, Zhi, Zhang, Pengfei, Li, Jingjing, Wang, Sen and Huang, Zi (2023). Zero-shot learning by harnessing adversarial samples. MM '23: The 31st ACM International Conference on Multimedia, Ottawa, ON Canada, 29 October - 3 November 2023. New York, NY United States: Association for Computing Machinery. doi: 10.1145/3581783.3611823

Zero-shot learning by harnessing adversarial samples

2023

Conference Publication

How Far Pre-trained Models Are from Neural Collapse on the Target Dataset Informs their Transferability

Wang, Zijian, Luo, Yadan, Zheng, Liang, Huang, Zi and Baktashmotlagh, Mahsa (2023). How Far Pre-trained Models Are from Neural Collapse on the Target Dataset Informs their Transferability. IEEE/CVF International Conference on Computer Vision 2023 (ICCV), Paris, France, 2-6 October 2023. Paris, France: Computer Vision Foundation. doi: 10.1109/iccv51070.2023.00511

How Far Pre-trained Models Are from Neural Collapse on the Target Dataset Informs their Transferability