2024 Book Chapter LLM as Copilot for Coarse-Grained Vision-and-Language NavigationQiao, Yanyuan, Liu, Qianyi, Liu, Jiajun, Liu, Jing and Wu, Qi (2024). LLM as Copilot for Coarse-Grained Vision-and-Language Navigation. Lecture Notes in Computer Science. (pp. 459-476) Cham: Springer Nature Switzerland. doi: 10.1007/978-3-031-72652-1_27 |
2024 Conference Publication BuildingSage: A safe and secure AI copilot for smart buildingsDedeoglu, Volkan, Zhang, Qianggong, Li, Yang, Liu, Jiajun and Sethuvenkatraman, Subbu (2024). BuildingSage: A safe and secure AI copilot for smart buildings. New York, NY, USA: ACM. doi: 10.1145/3671127.3699677 |
2024 Conference Publication Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed EnvironmentsHong, Haodong, Wang, Sen, Huang, Zi, Wu, Qi and Liu, Jiajun (2024). Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed Environments. New York, NY, USA: ACM. doi: 10.1145/3664647.3681640 |
2024 Conference Publication ROLeR: effective Reward Shaping in Offline Reinforcement Learning for Recommender SystemsZhang, Yi, Qiu, Ruihong, Liu, Jiajun and Wang, Sen (2024). ROLeR: effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems. 33rd ACM International Conference on Information and Knowledge Management (CIKM), Boise, ID USA, 21-25 October 2024. New York, NY USA: Association for Computing Machinery. doi: 10.1145/3627673.3679633 |
2024 Conference Publication GTP-ViT: efficient vision transformers via graph-based token propagationXu, Xuwei, Wang, Sen, Chen, Yudong, Zheng, Yanping, Wei, Zhewei and Liu, Jiajun (2024). GTP-ViT: efficient vision transformers via graph-based token propagation. 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, United States, 3-8 January 2024. Piscataway, NJ, United States: IEEE. doi: 10.1109/wacv57701.2024.00016 |
2024 Book Chapter Towards cost-efficient federated multi-agent RL with learnable aggregationZhang, Yi, Wang, Sen, Chen, Zhi, Xu, Xuwei, Funiak, Stano and Liu, Jiajun (2024). Towards cost-efficient federated multi-agent RL with learnable aggregation. Advances in knowledge discovery and data mining. (pp. 171-183) Heidelberg, Germany: Springer. doi: 10.1007/978-981-97-2253-2_14 |
2023 Conference Publication No token left behind: efficient vision transformer via dynamic token idlingXu, Xuwei, Li, Changlin, Chen, Yudong, Chang, Xiaojun, Liu, Jiajun and Wang, Sen (2023). No token left behind: efficient vision transformer via dynamic token idling. 36th Australasian Joint Conference on Artificial Intelligence, Brisbane, QLD Australia, 28 November-1 December 2023. Singapore, Singapore: Springer Nature Singapore. doi: 10.1007/978-981-99-8388-9_3 |
2023 Conference Publication SkySea: connecting satellite, UAV and underwater imagery for benthic habitat mappingDo, Brendan, Liu, Jiajun, Wang, Ziwei, Kusy, Brano, Merz, Torsten, Steven, Andy, Carlin, Geoffrey, Crosswell, Joseph, Li, Yang, Mortimer, Nicholas, Nayyeri, Fereshteh, Vanderklift, Mat and Wilson, Mark (2023). SkySea: connecting satellite, UAV and underwater imagery for benthic habitat mapping. 31st ACM International Conference on Multimedia, Ottawa, ON Canada, 2 November 2023. New York, NY USA: Association for Computing Machinery. doi: 10.1145/3607834.3616570 |
2023 Conference Publication Object detection difficulty: suppressing over-aggregation for faster and better video object detectionZhang, Bingqing, Wang, Sen, Liu, Yifan, Kusy, Brano, Li, Xue and Liu, Jiajun (2023). Object detection difficulty: suppressing over-aggregation for faster and better video object detection. 31st ACM International Conference on Multimedia, MM 2023, Ottawa, ON Canada, 29 October - 3 November 2023. New York, NY United States: Association for Computing Machinery. doi: 10.1145/3581783.3612090 |
2023 Conference Publication OCHID-Fi: Occlusion-Robust Hand Pose Estimation in 3D via RF-VisionZhang, Shujie, Zheng, Tianyue, Chen, Zhe, Hu, Jingzhi, Khamis, Abdelwahed, Liu, Jiajun and Luo, Jun (2023). OCHID-Fi: Occlusion-Robust Hand Pose Estimation in 3D via RF-Vision. 2023 IEEE/CVF International Conference on Computer Vision (ICCV), Paris, France, 1-6 October 2023. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/iccv51070.2023.01387 |
2022 Conference Publication InvisibiliTee: Angle-Agnostic Cloaking from Person-Tracking Systems with a TeeLi, Yaxian, Zhang, Bingqing, Zhao, Guoping, Zhang, Mingyu, Liu, Jiajun, Wang, Ziwei and Wen, Jirong (2022). InvisibiliTee: Angle-Agnostic Cloaking from Person-Tracking Systems with a Tee. 31st International Conference on Artificial Neural Networks, ICANN 2022, Bristol, United Kingdom, 6–9 September 2022. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-15934-3_14 |
2022 Conference Publication Integrating Dependency Tree into Self-Attention for Sentence RepresentationMa, Junhua, Li, Jiajun, Liu, Yuxuan, Zhou, Shangbo and Li, Xue (2022). Integrating Dependency Tree into Self-Attention for Sentence Representation. ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Singapore, 23-27 May 2022. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/ICASSP43922.2022.9747221 |
2021 Conference Publication PathSAGE: Spatial Graph Attention Neural Networks with Random Path SamplingMa, Junhua, Li, Jiajun, Li, Xueming and Li, Xu (2021). PathSAGE: Spatial Graph Attention Neural Networks with Random Path Sampling. 28th International Conference on Neural Information Processing ICONIP 2021, Bali, Indonesia, 8–12 December 2021. Heidelberg, Germany: Springer. doi: 10.1007/978-3-030-92270-2_10 |
2013 Journal Article On the influence propagation of web videosLiu, Jiajun, Yang, Yi, Huang, Zi, Yang, Yang and Shen, Heng Tao (2013). On the influence propagation of web videos. IEEE Transactions on Knowledge and Data Engineering, 26 (99) 6583164, 1961-1973. doi: 10.1109/TKDE.2013.142 |
2013 Journal Article Near-duplicate video retrieval: current research and future trendsLiu, Jiajun, Huang, Zi, Cai, Hongyun, Shen, Heng Tao, Ngo, Chong-wah and Wang, Wei (2013). Near-duplicate video retrieval: current research and future trends. ACM Computing Surveys, 45 (4) 2501658, 44.1-44.23. doi: 10.1145/2501654.2501658 |
2013 Journal Article Local image tagging via graph regularized joint group sparsityYang, Yang, Huang, Zi, Yang, Yi, Liu, Jiajun, Shen, Heng Tao and Luo, Jiebo (2013). Local image tagging via graph regularized joint group sparsity. Pattern Recognition, 46 (5), 1358-1368. doi: 10.1016/j.patcog.2012.10.026 |
2013 Journal Article A gram-based string paradigm for efficient video subsequence searchHuang, Zi, Liu, Jiajun, Cui, Bin and Du, Xiaoyong (2013). A gram-based string paradigm for efficient video subsequence search. IEEE Transactions On Multimedia, 15 (3) 6392966, 608-620. doi: 10.1109/TMM.2012.2236307 |
2013 Conference Publication Presenting diverse location views with real-time near-duplicate photo eliminationLiu, Jiajun, Huang, Zi, Cheng, Hong, Chen, Yueguo, Shen, Heng Tao and Zhang, Yanchun (2013). Presenting diverse location views with real-time near-duplicate photo elimination. 29th IEEE International Conference on Data Engineering (ICDE), Brisbane, Australia, 8-12 April 2013. Washington, United States: IEEE. doi: 10.1109/ICDE.2013.6544851 |
2012 Conference Publication Robust cross-media transfer for visual event detectionYang, Yang, Yang, Yi, Huang, Zi, Liu, Jianjun and Ma, Zhigang (2012). Robust cross-media transfer for visual event detection. 20th ACM International Conference on Multimedia (MM'12), Nara, Japan, 29 October - 2 November 2012. New York, United States: ACM. doi: 10.1145/2393347.2396379 |
2012 Conference Publication Discovering areas of interest with geo-tagged images and check-insLiu, Jiajun, Huang, Zi, Chen, Lei, Shen, Heng Tao and Yan, Zhixian (2012). Discovering areas of interest with geo-tagged images and check-ins. 20th ACM International Conference on Multimedia (MM'12), Nara, Japan, 29 October - 2 November 2012. New York, United States: ACM. doi: 10.1145/2393347.2393429 |