Skip to menu Skip to content Skip to footer

2026

Journal Article

Distributed Zero-Shot Learning for Visual Recognition

Chen, Zhi, Luo, Yadan, Huang, Zi, Li, Jingjing, Wang, Sen and Yu, Xin (2026). Distributed Zero-Shot Learning for Visual Recognition. IEEE Transactions on Multimedia, 1-12. doi: 10.1109/TMM.2026.3673561

Distributed Zero-Shot Learning for Visual Recognition

2025

Conference Publication

MARCO: a cooperative knowledge transfer framework for personalized cross-domain recommendations

Xie, Lili, Zhang, Yi, Qiu, Ruihong, Liu, Jiajun and Wang, Sen (2025). MARCO: a cooperative knowledge transfer framework for personalized cross-domain recommendations. 2025 Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region, Xi'an, China, 7-10 December 2025. New York, NY United States: Association for Computing Machinery. doi: 10.1145/3767695.3769481

MARCO: a cooperative knowledge transfer framework for personalized cross-domain recommendations

2025

Conference Publication

MIPO: mutual integration of patient journey and medical ontology for healthcare representation learning

Peng, Xueping, Long, Guodong, Shen, Tao, Wang, Sen, Zhang, Chengqi, Clarke, Allison and Schlegel, Clement (2025). MIPO: mutual integration of patient journey and medical ontology for healthcare representation learning. 2025 International Joint Conference on Neural Networks (IJCNN), Rome, Italy, 30 June-5 July 2025. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/ijcnn64981.2025.11228235

MIPO: mutual integration of patient journey and medical ontology for healthcare representation learning

2025

Conference Publication

Building efficient segmentation models from large open-vocabulary foundation models without any labels

Li, Yang, Chen, Diqi, Wang, Sen, Kusy, Brano and Liu, Jiajun (2025). Building efficient segmentation models from large open-vocabulary foundation models without any labels. 2025 International Joint Conference on Neural Networks (IJCNN), Rome, Italy, 30 June-5 July 2025. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/ijcnn64981.2025.11228002

Building efficient segmentation models from large open-vocabulary foundation models without any labels

2025

Conference Publication

TokenBinder: Text-Video Retrieval with One-to-Many Alignment Paradigm

Zhang, Bingqing, Cao, Zhuo, Du, Heming, Yu, Xin, Li, Xue, Liu, Jiajun and Wang, Sen (2025). TokenBinder: Text-Video Retrieval with One-to-Many Alignment Paradigm. 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Tucson, AZ United States, 26 February - 6 March 2025. Piscataway, NJ United States: IEEE. doi: 10.1109/wacv61041.2025.00485

TokenBinder: Text-Video Retrieval with One-to-Many Alignment Paradigm

2025

Conference Publication

FlashVTG: feature layering and adaptive score handling network for video temporal grounding

Cao, Zhuo, Zhang, Bingqing, Du, Heming, Yu, Xin, Li, Xue and Wang, Sen (2025). FlashVTG: feature layering and adaptive score handling network for video temporal grounding. 2025 Winter Conference on Applications of Computer Vision-WACV, Tucson, AZ, United States, 28 February-4 March 2025. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/wacv61041.2025.00894

FlashVTG: feature layering and adaptive score handling network for video temporal grounding

2024

Conference Publication

EMIT - event-based masked auto encoding for irregular time series

Patel, Hrishikesh, Qiu, Ruihong, Irwin, Adam, Sadiq, Shazia and Wang, Sen (2024). EMIT - event-based masked auto encoding for irregular time series. 2024 IEEE International Conference on Data Mining (ICDM), Abu Dhabi, United Arab Emirates, 9-12 December 2024. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/icdm59182.2024.00044

EMIT - event-based masked auto encoding for irregular time series

2024

Conference Publication

Navigating beyond instructions: vision-and-language navigation in obstructed environments

Hong, Haodong, Wang, Sen, Huang, Zi, Wu, Qi and Liu, Jiajun (2024). Navigating beyond instructions: vision-and-language navigation in obstructed environments. MM '24: The 32nd ACM International Conference on Multimedia, Melbourne, VIC, Australia, 28 October-1 November 2024. New York, United States: Association for Computing Machinery. doi: 10.1145/3664647.3681640

Navigating beyond instructions: vision-and-language navigation in obstructed environments

2024

Conference Publication

DPO: dual-perturbation optimization for test-time adaptation in 3D object detection

Chen, Zhuoxiao, Wang, Zixin, Luo, Yadan, Wang, Sen and Huang, Zi (2024). DPO: dual-perturbation optimization for test-time adaptation in 3D object detection. MM '24: The 32nd ACM International Conference on Multimedia, Melbourne, VIC, Australia, 28 October-1 November 2024. New York, United States: Association for Computing Machinery. doi: 10.1145/3664647.3681040

DPO: dual-perturbation optimization for test-time adaptation in 3D object detection

2024

Conference Publication

ROLeR: effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems

Zhang, Yi, Qiu, Ruihong, Liu, Jiajun and Wang, Sen (2024). ROLeR: effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems. 33rd ACM International Conference on Information and Knowledge Management (CIKM), Boise, ID USA, 21-25 October 2024. New York, NY USA: Association for Computing Machinery. doi: 10.1145/3627673.3679633

ROLeR: effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems

2024

Journal Article

In search of lost online test-time adaptation: a survey

Wang, Zixin, Luo, Yadan, Zheng, Liang, Chen, Zhuoxiao, Wang, Sen and Huang, Zi (2024). In search of lost online test-time adaptation: a survey. International Journal of Computer Vision, 133 (3), 1106-1139. doi: 10.1007/s11263-024-02213-5

In search of lost online test-time adaptation: a survey

2024

Conference Publication

GTP-ViT: efficient vision transformers via graph-based token propagation

Xu, Xuwei, Wang, Sen, Chen, Yudong, Zheng, Yanping, Wei, Zhewei and Liu, Jiajun (2024). GTP-ViT: efficient vision transformers via graph-based token propagation. 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, United States, 3-8 January 2024. Piscataway, NJ, United States: IEEE. doi: 10.1109/wacv57701.2024.00016

GTP-ViT: efficient vision transformers via graph-based token propagation

2024

Book Chapter

Towards cost-efficient federated multi-agent RL with learnable aggregation

Zhang, Yi, Wang, Sen, Chen, Zhi, Xu, Xuwei, Funiak, Stano and Liu, Jiajun (2024). Towards cost-efficient federated multi-agent RL with learnable aggregation. Advances in knowledge discovery and data mining. (pp. 171-183) Heidelberg, Germany: Springer. doi: 10.1007/978-981-97-2253-2_14

Towards cost-efficient federated multi-agent RL with learnable aggregation

2024

Conference Publication

Event-content-oriented dialogue generation in short video

Cheng, Fenghua, Li, Xue, Huang, Zi, Wang, Jinxiang and Wang, Sen (2024). Event-content-oriented dialogue generation in short video. 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Mexico City, Mexico, 16-21 June 2024. Kerrville, TX, United States: Association for Computational Linguistics (ACL). doi: 10.18653/v1/2024.naacl-long.229

Event-content-oriented dialogue generation in short video

2024

Conference Publication

Why only text: empowering vision-and-language navigation with multi-modal prompts

Hong, Haodong, Wang, Sen, Huang, Zi, Wu, Qi and Liu, Jiajun (2024). Why only text: empowering vision-and-language navigation with multi-modal prompts. 33rd International Joint Conference on Artificial Intelligence (IJCAI), Jeju, South Korea, 3-9 August 2024. Palo Alto, CA, United States: AAAI Press. doi: 10.24963/ijcai.2024/93

Why only text: empowering vision-and-language navigation with multi-modal prompts

2023

Conference Publication

No token left behind: efficient vision transformer via dynamic token idling

Xu, Xuwei, Li, Changlin, Chen, Yudong, Chang, Xiaojun, Liu, Jiajun and Wang, Sen (2023). No token left behind: efficient vision transformer via dynamic token idling. 36th Australasian Joint Conference on Artificial Intelligence, Brisbane, QLD Australia, 28 November-1 December 2023. Singapore: Springer. doi: 10.1007/978-981-99-8388-9_3

No token left behind: efficient vision transformer via dynamic token idling

2023

Conference Publication

Cal-SFDA: Source-free domain-adaptive semantic segmentation with differentiable expected calibration error

Wang, Zixin, Luo, Yadan, Chen, Zhi, Wang, Sen and Huang, Zi (2023). Cal-SFDA: Source-free domain-adaptive semantic segmentation with differentiable expected calibration error. MM '23: The 31st ACM International Conference on Multimedia, Ottawa, ON Canada, 29 October - 3 November 2023. New York, NY United States: Association for Computing Machinery. doi: 10.1145/3581783.3611808

Cal-SFDA: Source-free domain-adaptive semantic segmentation with differentiable expected calibration error

2023

Conference Publication

Zero-shot learning by harnessing adversarial samples

Chen, Zhi, Zhang, Pengfei, Li, Jingjing, Wang, Sen and Huang, Zi (2023). Zero-shot learning by harnessing adversarial samples. MM '23: The 31st ACM International Conference on Multimedia, Ottawa, ON Canada, 29 October - 3 November 2023. New York, NY United States: Association for Computing Machinery. doi: 10.1145/3581783.3611823

Zero-shot learning by harnessing adversarial samples

2023

Conference Publication

Object detection difficulty: suppressing over-aggregation for faster and better video object detection

Zhang, Bingqing, Wang, Sen, Liu, Yifan, Kusy, Brano, Li, Xue and Liu, Jiajun (2023). Object detection difficulty: suppressing over-aggregation for faster and better video object detection. 31st ACM International Conference on Multimedia, MM 2023, Ottawa, ON Canada, 29 October - 3 November 2023. New York, NY United States: Association for Computing Machinery. doi: 10.1145/3581783.3612090

Object detection difficulty: suppressing over-aggregation for faster and better video object detection

2023

Journal Article

Recent applications of machine learning in alloy design: a review

Hu, Mingwei, Tan, Qiyang, Knibbe, Ruth, Xu, Miao, Jiang, Bin, Wang, Sen, Li, Xue and Zhang, Ming-Xing (2023). Recent applications of machine learning in alloy design: a review. Materials Science and Engineering: R: Reports, 155 100746, 100746. doi: 10.1016/j.mser.2023.100746

Recent applications of machine learning in alloy design: a review