Skip to menu Skip to content Skip to footer

2025

Conference Publication

MARCO: a cooperative knowledge transfer framework for personalized cross-domain recommendations

Xie, Lili, Zhang, Yi, Qiu, Ruihong, Liu, Jiajun and Wang, Sen (2025). MARCO: a cooperative knowledge transfer framework for personalized cross-domain recommendations. 2025 Annual International ACM SIGIR Conference on Research and Development in Information Retrieval in the Asia Pacific Region, Xi'an, China, 7-10 December 2025. New York, NY United States: Association for Computing Machinery. doi: 10.1145/3767695.3769481

MARCO: a cooperative knowledge transfer framework for personalized cross-domain recommendations

2025

Conference Publication

OmniRestore: robust universal image restoration from combined and unspecified degradations

Karnavar, Anjusree, Li, Yang, Liu, Jiajun, Zhou, Jun and Wang, Junhu (2025). OmniRestore: robust universal image restoration from combined and unspecified degradations. 2025 IEEE International Conference on Multimedia and Expo (ICME), Nantes, France, 30 October 2025. New York, NY United States: IEEE Computer Society. doi: 10.1109/ICME59968.2025.11209899

OmniRestore: robust universal image restoration from combined and unspecified degradations

2025

Conference Publication

Queryable 3D Scene Representation: A Multi-Modal Framework for Semantic Reasoning and Robotic Task Planning

Li, Xun, Santa Cruz, Rodrigo, Xi, Mingze, Zhang, Hu, Perera, Madhawa, Wang, Ziwei, Ravendran, Ahalya, Matthews, Brandon J., Xu, Feng, Adcock, Matt, Wang, Dadong and Liu, Jiajun (2025). Queryable 3D Scene Representation: A Multi-Modal Framework for Semantic Reasoning and Robotic Task Planning. New York, NY, USA: ACM. doi: 10.1145/3746027.3758177

Queryable 3D Scene Representation: A Multi-Modal Framework for Semantic Reasoning and Robotic Task Planning

2025

Conference Publication

RePaViT: scalable vision transformer acceleration via structural reparameterization on feedforward network layers

Xu, Xuwei, Li, Yang, Chen, Yudong, Liu, Jiajun and Wang, Sen (2025). RePaViT: scalable vision transformer acceleration via structural reparameterization on feedforward network layers. International Conference on Machine Learning, Vancouver, Canada, 13-19 July 2025. San Diego, CA, United States: ICML.

RePaViT: scalable vision transformer acceleration via structural reparameterization on feedforward network layers

2025

Conference Publication

DARLR: Dual-agent offline reinforcement learning for recommender systems with dynamic reward

Zhang, Yi, Qiu, Ruihong, Xu, Xuwei, Liu, Jiajun and Wang, Sen (2025). DARLR: Dual-agent offline reinforcement learning for recommender systems with dynamic reward. 48th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (ACM SIGIR 2025), Padua, Italy, 13-18 July 2025. New York, NY, United States: Association for Computing Machinery. doi: 10.1145/3726302.3729942

DARLR: Dual-agent offline reinforcement learning for recommender systems with dynamic reward

2025

Conference Publication

Building efficient segmentation models from large open-vocabulary foundation models without any labels

Li, Yang, Chen, Diqi, Wang, Sen, Kusy, Brano and Liu, Jiajun (2025). Building efficient segmentation models from large open-vocabulary foundation models without any labels. 2025 International Joint Conference on Neural Networks (IJCNN), Rome, Italy, 30 June-5 July 2025. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/ijcnn64981.2025.11228002

Building efficient segmentation models from large open-vocabulary foundation models without any labels

2025

Conference Publication

TokenBinder: Text-Video Retrieval with One-to-Many Alignment Paradigm

Zhang, Bingqing, Cao, Zhuo, Du, Heming, Yu, Xin, Li, Xue, Liu, Jiajun and Wang, Sen (2025). TokenBinder: Text-Video Retrieval with One-to-Many Alignment Paradigm. 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Tucson, AZ United States, 26 February - 6 March 2025. Piscataway, NJ United States: IEEE. doi: 10.1109/wacv61041.2025.00485

TokenBinder: Text-Video Retrieval with One-to-Many Alignment Paradigm

2025

Conference Publication

On-the-fly object-aware representative point selection in point cloud

Zhang, Xiaoyu, Wang, Ziwei, Dong, Hai, Bao, Zhifeng and Liu, Jiajun (2025). On-the-fly object-aware representative point selection in point cloud. 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Tucson, AZ, United States, 26 February-6 March 2025. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/wacv61041.2025.00174

On-the-fly object-aware representative point selection in point cloud

2025

Conference Publication

Effective tuning strategies for generalist robot manipulation policies

Zhang, Wenbo, Li, Yang, Qiao, Yanyuan, Huang, Siyuan, Liu, Jiajun, Dayoub, Feras, Ma, Xiao and Liu, Lingqiao (2025). Effective tuning strategies for generalist robot manipulation policies. 2025 International Conference on Robotics and Automation (ICRA), Atlanta, GA, United States, 19 - 23 May 2025. Washington, DC, United States: I E E E Computer Society. doi: 10.1109/ICRA55743.2025.11127492

Effective tuning strategies for generalist robot manipulation policies

2025

Conference Publication

LLM as copilot for coarse-grained vision-and-language navigation

Qiao, Yanyuan, Liu, Qianyi, Liu, Jiajun, Liu, Jing and Wu, Qi (2025). LLM as copilot for coarse-grained vision-and-language navigation. Computer Vision – ECCV 2024: 18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part V, Milan, Italy, 29 September – 4 October 2024. Heidelberg, Germany: Springer. doi: 10.1007/978-3-031-72652-1_27

LLM as copilot for coarse-grained vision-and-language navigation

2025

Conference Publication

Beyond static LLM policies: imitation-enhanced reinforcement learning for recommendation

Zhang, Yi, Xie, Lili, Qiu, Ruihong, Liu, Jiajun and Wang, Sen (2025). Beyond static LLM policies: imitation-enhanced reinforcement learning for recommendation. 2025 IEEE International Conference on Data Mining (ICDM), Washington, DC, United States, 12-15 November 2025. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/ICDM65498.2025.00098

Beyond static LLM policies: imitation-enhanced reinforcement learning for recommendation

2024

Conference Publication

BuildingSage: a safe and secure AI copilot for smart buildings

Dedeoglu, Volkan, Zhang, Qianggong, Li, Yang, Liu, Jiajun and Sethuvenkatraman, Subbu (2024). BuildingSage: a safe and secure AI copilot for smart buildings. BuildSys '24: The 11th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation, Hangzhou, China, 7-8 November 2024. New York, United States: BuildSys 2024 - Proceedings of the 2024 11th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation. doi: 10.1145/3671127.3699677

BuildingSage: a safe and secure AI copilot for smart buildings

2024

Conference Publication

Navigating beyond instructions: vision-and-language navigation in obstructed environments

Hong, Haodong, Wang, Sen, Huang, Zi, Wu, Qi and Liu, Jiajun (2024). Navigating beyond instructions: vision-and-language navigation in obstructed environments. MM '24: The 32nd ACM International Conference on Multimedia, Melbourne, VIC, Australia, 28 October-1 November 2024. New York, United States: Association for Computing Machinery. doi: 10.1145/3664647.3681640

Navigating beyond instructions: vision-and-language navigation in obstructed environments

2024

Conference Publication

ROLeR: effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems

Zhang, Yi, Qiu, Ruihong, Liu, Jiajun and Wang, Sen (2024). ROLeR: effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems. 33rd ACM International Conference on Information and Knowledge Management (CIKM), Boise, ID USA, 21-25 October 2024. New York, NY USA: Association for Computing Machinery. doi: 10.1145/3627673.3679633

ROLeR: effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems

2024

Conference Publication

Real-time Multi-modal Object Detection and Tracking on Edge for Regulatory Compliance Monitoring

Lim, Jia Syuen, Wang, Ziwei, Liu, Jiajun, Khamis, Abdelwahed, Arablouei, Reza, Barlow, Robert and McAllister, Ryan (2024). Real-time Multi-modal Object Detection and Tracking on Edge for Regulatory Compliance Monitoring. 33rd International Joint Conference on Artificial Intelligence (IJCAI), Jeju, South Korea, 3-9 August 2024. Freiburg, Germany: IJCAI. doi: 10.24963/ijcai.2024/1018

Real-time Multi-modal Object Detection and Tracking on Edge for Regulatory Compliance Monitoring

2024

Conference Publication

Edge deployable online domain adaptation for underwater object detection

Etchegaray, Djamahl, Luo, Yadan, Li, Yang, Do, Brendan, Liu, Jiajun, Huang, Zi and Kusy, Branislav (2024). Edge deployable online domain adaptation for underwater object detection. 2024 International Joint Conference on Neural Networks (IJCNN), Yokohama, Japan, 30 June - 5 July 2024. Piscataway, NJ, United States: IEEE. doi: 10.1109/ijcnn60899.2024.10650705

Edge deployable online domain adaptation for underwater object detection

2024

Conference Publication

DynAmic Token Pruning in plain vision transformers for semantic segmentation

Tang, Quan, Zhang, Bowen, Liu, Jiajun, Liu, Fagui and Liu, Yifan (2024). DynAmic Token Pruning in plain vision transformers for semantic segmentation. 2023 IEEE/CVF International Conference on Computer Vision (ICCV), Paris, France, 1-6 October 2023. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/ICCV51070.2023.00078

DynAmic Token Pruning in plain vision transformers for semantic segmentation

2024

Conference Publication

GTP-ViT: efficient vision transformers via graph-based token propagation

Xu, Xuwei, Wang, Sen, Chen, Yudong, Zheng, Yanping, Wei, Zhewei and Liu, Jiajun (2024). GTP-ViT: efficient vision transformers via graph-based token propagation. 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, United States, 3-8 January 2024. Piscataway, NJ, United States: IEEE. doi: 10.1109/wacv57701.2024.00016

GTP-ViT: efficient vision transformers via graph-based token propagation

2024

Conference Publication

Why only text: empowering vision-and-language navigation with multi-modal prompts

Hong, Haodong, Wang, Sen, Huang, Zi, Wu, Qi and Liu, Jiajun (2024). Why only text: empowering vision-and-language navigation with multi-modal prompts. 33rd International Joint Conference on Artificial Intelligence (IJCAI), Jeju, South Korea, 3-9 August 2024. Palo Alto, CA, United States: AAAI Press. doi: 10.24963/ijcai.2024/93

Why only text: empowering vision-and-language navigation with multi-modal prompts

2023

Conference Publication

No token left behind: efficient vision transformer via dynamic token idling

Xu, Xuwei, Li, Changlin, Chen, Yudong, Chang, Xiaojun, Liu, Jiajun and Wang, Sen (2023). No token left behind: efficient vision transformer via dynamic token idling. 36th Australasian Joint Conference on Artificial Intelligence, Brisbane, QLD Australia, 28 November-1 December 2023. Singapore: Springer. doi: 10.1007/978-981-99-8388-9_3

No token left behind: efficient vision transformer via dynamic token idling