Expert publications - About - The University of Queensland

All (121) Journal Article (42) Other Outputs (1) Edited Outputs (3) Conference Publication (74) Book Chapter (1)

2026

Journal Article

Understanding the Effects of Projectors in Knowledge Distillation

Chen, Yudong, Wang, Sen, Liu, Jiajun, Xu, Xuwei, Hoog, Frank de, Kusy, Brano and Huang, Zi (2026). Understanding the Effects of Projectors in Knowledge Distillation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1-16. doi: 10.1109/tpami.2026.3677028

Understanding the Effects of Projectors in Knowledge Distillation

2026

Journal Article

Distributed Zero-Shot Learning for Visual Recognition

Chen, Zhi, Luo, Yadan, Huang, Zi, Li, Jingjing, Wang, Sen and Yu, Xin (2026). Distributed Zero-Shot Learning for Visual Recognition. IEEE Transactions on Multimedia, PP (99), 1-12. doi: 10.1109/TMM.2026.3673561

Distributed Zero-Shot Learning for Visual Recognition

2025

Conference Publication

MARCO: a cooperative knowledge transfer framework for personalized cross-domain recommendations

Xie, Lili, Zhang, Yi, Qiu, Ruihong, Liu, Jiajun and Wang, Sen (2025). MARCO: a cooperative knowledge transfer framework for personalized cross-domain recommendations. 2025 Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region, Xi'an, China, 7-10 December 2025. New York, NY United States: Association for Computing Machinery. doi: 10.1145/3767695.3769481

MARCO: a cooperative knowledge transfer framework for personalized cross-domain recommendations

2025

Conference Publication

Is Less More? Exploring Token Condensation as Training-Free Test-Time Adaptation

Wang, Zixin, Gong, Dong, Wang, Sen, Huang, Zi and Luo, Yadan (2025). Is Less More? Exploring Token Condensation as Training-Free Test-Time Adaptation. IEEE. doi: 10.1109/iccv51701.2025.00021

Is Less More? Exploring Token Condensation as Training-Free Test-Time Adaptation

2025

Conference Publication

Quantifying and narrowing the unknown: interactive text-to-video retrieval via uncertainty minimization

Zhang, Bingqing, Cao, Zhuo, Du, Heming, Li, Yang, Li, Xue, Liu, Jiajun and Wang, Sen (2025). Quantifying and narrowing the unknown: interactive text-to-video retrieval via uncertainty minimization. 2025 IEEE/CVF International Conference on Computer Vision (ICCV), Honolulu, HI, United States, 19-25 October 2025. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/iccv51701.2025.02054

Quantifying and narrowing the unknown: interactive text-to-video retrieval via uncertainty minimization

2025

Conference Publication

MIPO: mutual integration of patient journey and medical ontology for healthcare representation learning

Peng, Xueping, Long, Guodong, Shen, Tao, Wang, Sen, Zhang, Chengqi, Clarke, Allison and Schlegel, Clement (2025). MIPO: mutual integration of patient journey and medical ontology for healthcare representation learning. 2025 International Joint Conference on Neural Networks (IJCNN), Rome, Italy, 30 June-5 July 2025. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/ijcnn64981.2025.11228235

MIPO: mutual integration of patient journey and medical ontology for healthcare representation learning

2025

Conference Publication

Building efficient segmentation models from large open-vocabulary foundation models without any labels

Li, Yang, Chen, Diqi, Wang, Sen, Kusy, Brano and Liu, Jiajun (2025). Building efficient segmentation models from large open-vocabulary foundation models without any labels. 2025 International Joint Conference on Neural Networks (IJCNN), Rome, Italy, 30 June-5 July 2025. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/ijcnn64981.2025.11228002

Building efficient segmentation models from large open-vocabulary foundation models without any labels

2025

Conference Publication

FlashVTG: feature layering and adaptive score handling network for video temporal grounding

Cao, Zhuo, Zhang, Bingqing, Du, Heming, Yu, Xin, Li, Xue and Wang, Sen (2025). FlashVTG: feature layering and adaptive score handling network for video temporal grounding. 2025 Winter Conference on Applications of Computer Vision-WACV, Tucson, AZ, United States, 28 February-4 March 2025. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/wacv61041.2025.00894

FlashVTG: feature layering and adaptive score handling network for video temporal grounding

2025

Conference Publication

TokenBinder: Text-Video Retrieval with One-to-Many Alignment Paradigm

Zhang, Bingqing, Cao, Zhuo, Du, Heming, Yu, Xin, Li, Xue, Liu, Jiajun and Wang, Sen (2025). TokenBinder: Text-Video Retrieval with One-to-Many Alignment Paradigm. 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Tucson, AZ United States, 26 February - 6 March 2025. Piscataway, NJ United States: IEEE. doi: 10.1109/wacv61041.2025.00485

TokenBinder: Text-Video Retrieval with One-to-Many Alignment Paradigm

2024

Conference Publication

EMIT - event-based masked auto encoding for irregular time series

Patel, Hrishikesh, Qiu, Ruihong, Irwin, Adam, Sadiq, Shazia and Wang, Sen (2024). EMIT - event-based masked auto encoding for irregular time series. 2024 IEEE International Conference on Data Mining (ICDM), Abu Dhabi, United Arab Emirates, 9-12 December 2024. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/icdm59182.2024.00044

EMIT - event-based masked auto encoding for irregular time series

2024

Conference Publication

Navigating beyond instructions: vision-and-language navigation in obstructed environments

Hong, Haodong, Wang, Sen, Huang, Zi, Wu, Qi and Liu, Jiajun (2024). Navigating beyond instructions: vision-and-language navigation in obstructed environments. MM '24: The 32nd ACM International Conference on Multimedia, Melbourne, VIC, Australia, 28 October-1 November 2024. New York, United States: Association for Computing Machinery. doi: 10.1145/3664647.3681640

Navigating beyond instructions: vision-and-language navigation in obstructed environments

2024

Conference Publication

DPO: dual-perturbation optimization for test-time adaptation in 3D object detection

Chen, Zhuoxiao, Wang, Zixin, Luo, Yadan, Wang, Sen and Huang, Zi (2024). DPO: dual-perturbation optimization for test-time adaptation in 3D object detection. MM '24: The 32nd ACM International Conference on Multimedia, Melbourne, VIC, Australia, 28 October-1 November 2024. New York, United States: Association for Computing Machinery. doi: 10.1145/3664647.3681040

DPO: dual-perturbation optimization for test-time adaptation in 3D object detection

2024

Conference Publication

ROLeR: effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems

Zhang, Yi, Qiu, Ruihong, Liu, Jiajun and Wang, Sen (2024). ROLeR: effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems. 33rd ACM International Conference on Information and Knowledge Management (CIKM), Boise, ID USA, 21-25 October 2024. New York, NY USA: Association for Computing Machinery. doi: 10.1145/3627673.3679633

ROLeR: effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems

2024

Journal Article

In search of lost online test-time adaptation: a survey

Wang, Zixin, Luo, Yadan, Zheng, Liang, Chen, Zhuoxiao, Wang, Sen and Huang, Zi (2024). In search of lost online test-time adaptation: a survey. International Journal of Computer Vision, 133 (3), 1106-1139. doi: 10.1007/s11263-024-02213-5

In search of lost online test-time adaptation: a survey

2024

Conference Publication

GTP-ViT: efficient vision transformers via graph-based token propagation

Xu, Xuwei, Wang, Sen, Chen, Yudong, Zheng, Yanping, Wei, Zhewei and Liu, Jiajun (2024). GTP-ViT: efficient vision transformers via graph-based token propagation. 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, United States, 3-8 January 2024. Piscataway, NJ, United States: IEEE. doi: 10.1109/wacv57701.2024.00016

GTP-ViT: efficient vision transformers via graph-based token propagation

2024

Book Chapter

Towards cost-efficient federated multi-agent RL with learnable aggregation

Zhang, Yi, Wang, Sen, Chen, Zhi, Xu, Xuwei, Funiak, Stano and Liu, Jiajun (2024). Towards cost-efficient federated multi-agent RL with learnable aggregation. Advances in knowledge discovery and data mining. (pp. 171-183) Heidelberg, Germany: Springer. doi: 10.1007/978-981-97-2253-2_14

Towards cost-efficient federated multi-agent RL with learnable aggregation

2024

Conference Publication

Event-content-oriented dialogue generation in short video

Cheng, Fenghua, Li, Xue, Huang, Zi, Wang, Jinxiang and Wang, Sen (2024). Event-content-oriented dialogue generation in short video. 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Mexico City, Mexico, 16-21 June 2024. Kerrville, TX, United States: Association for Computational Linguistics (ACL). doi: 10.18653/v1/2024.naacl-long.229

Event-content-oriented dialogue generation in short video

2024

Conference Publication

Why only text: empowering vision-and-language navigation with multi-modal prompts

Hong, Haodong, Wang, Sen, Huang, Zi, Wu, Qi and Liu, Jiajun (2024). Why only text: empowering vision-and-language navigation with multi-modal prompts. 33rd International Joint Conference on Artificial Intelligence (IJCAI), Jeju, South Korea, 3-9 August 2024. Palo Alto, CA, United States: AAAI Press. doi: 10.24963/ijcai.2024/93

Why only text: empowering vision-and-language navigation with multi-modal prompts

2023

Conference Publication

No token left behind: efficient vision transformer via dynamic token idling

Xu, Xuwei, Li, Changlin, Chen, Yudong, Chang, Xiaojun, Liu, Jiajun and Wang, Sen (2023). No token left behind: efficient vision transformer via dynamic token idling. 36th Australasian Joint Conference on Artificial Intelligence, Brisbane, QLD Australia, 28 November-1 December 2023. Singapore: Springer. doi: 10.1007/978-981-99-8388-9_3

No token left behind: efficient vision transformer via dynamic token idling

2023

Conference Publication

Zero-shot learning by harnessing adversarial samples

Chen, Zhi, Zhang, Pengfei, Li, Jingjing, Wang, Sen and Huang, Zi (2023). Zero-shot learning by harnessing adversarial samples. MM '23: The 31st ACM International Conference on Multimedia, Ottawa, ON Canada, 29 October - 3 November 2023. New York, NY United States: Association for Computing Machinery. doi: 10.1145/3581783.3611823

Zero-shot learning by harnessing adversarial samples

Understanding the Effects of Projectors in Knowledge Distillation

Distributed Zero-Shot Learning for Visual Recognition

MARCO: a cooperative knowledge transfer framework for personalized cross-domain recommendations

Is Less More? Exploring Token Condensation as Training-Free Test-Time Adaptation

Quantifying and narrowing the unknown: interactive text-to-video retrieval via uncertainty minimization

MIPO: mutual integration of patient journey and medical ontology for healthcare representation learning

Building efficient segmentation models from large open-vocabulary foundation models without any labels

FlashVTG: feature layering and adaptive score handling network for video temporal grounding

TokenBinder: Text-Video Retrieval with One-to-Many Alignment Paradigm

EMIT - event-based masked auto encoding for irregular time series

Navigating beyond instructions: vision-and-language navigation in obstructed environments

DPO: dual-perturbation optimization for test-time adaptation in 3D object detection

ROLeR: effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems

In search of lost online test-time adaptation: a survey

GTP-ViT: efficient vision transformers via graph-based token propagation

Towards cost-efficient federated multi-agent RL with&nbsp;learnable aggregation

Event-content-oriented dialogue generation in short video

Why only text: empowering vision-and-language navigation with multi-modal prompts

No token left behind: efficient vision transformer via&nbsp;dynamic token idling

Zero-shot learning by harnessing adversarial samples

Towards cost-efficient federated multi-agent RL with learnable aggregation

No token left behind: efficient vision transformer via dynamic token idling