Skip to menu Skip to content Skip to footer

2024

Conference Publication

Language-guided multi-modal emotional mimicry intensity estimation

Qiu, Feng, Zhang, Wei, Liu, Chen, Li, Lincheng, Du, Heming, Guo, Tianchen and Yu, Xin (2024). Language-guided multi-modal emotional mimicry intensity estimation. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, United States, 17-18 June 2024. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvprw63382.2024.00477

Language-guided multi-modal emotional mimicry intensity estimation

2024

Conference Publication

Learning transferable compound expressions from Masked AutoEncoder pretraining

Qiu, Feng, Du, Heming, Zhang, Wei, Liu, Chen, Li, Lincheng, Guo, Tianchen and Yu, Xin (2024). Learning transferable compound expressions from Masked AutoEncoder pretraining. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, United States, 17-18 June 2024. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvprw63382.2024.00476

Learning transferable compound expressions from Masked AutoEncoder pretraining

2024

Conference Publication

DiPEx: Dispersing Prompt Expansion for class-agnostic object detection

Lim, Jia Syuen, Chen, Zhuoxiao, Baktashmotlagh, Mahsa, Chen, Zhi, Yu, Xin, Huang, Zi and Luo, Yadan (2024). DiPEx: Dispersing Prompt Expansion for class-agnostic object detection. 38th International Conference on Neural Information Processing Systems, Vancouver, BC Canada, 10-15 December 2024. New York, NY USA: Association for Computing Machinery.

DiPEx: Dispersing Prompt Expansion for class-agnostic object detection

2024

Conference Publication

When 3D bounding-box meets SAM: point cloud instance segmentation with weak-and-noisy supervision

Yu, Qingtao, Du, Heming, Liu, Chen and Yu, Xin (2024). When 3D bounding-box meets SAM: point cloud instance segmentation with weak-and-noisy supervision. 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, United States, 3-8 January 2024. Piscataway, NJ, United States: IEEE. doi: 10.1109/wacv57701.2024.00368

When 3D bounding-box meets SAM: point cloud instance segmentation with weak-and-noisy supervision

2024

Conference Publication

Text-guided 3D face synthesis - from generation to editing

Wu, Yunjie, Meng, Yapeng, Hu, Zhipeng, Li, Lincheng, Wu, Haoqian, Zhou, Kun, Xu, Weiwei and Yu, Xin (2024). Text-guided 3D face synthesis - from generation to editing. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, United States, 16-22 June 2024. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/CVPR52733.2024.00126

Text-guided 3D face synthesis - from generation to editing

2024

Conference Publication

MM-WLAuslan: multi-view multi-modal word-level Australian Sign Language recognition dataset

Shen, Xin, Du, Heming, Sheng, Hongwei, Wang, Shuyun, Chen, Hui, Chen, Huiqiang, Wu, Zhuojie, Du, Xiaobiao, Ying, Jiaying, Lu, Ruihan, Xu, Qingzheng and Yu, Xin (2024). MM-WLAuslan: multi-view multi-modal word-level Australian Sign Language recognition dataset. NeurIPS 2024, Vancouver, BC, Canada, 10 - 15 December 2024. Maryland Heights, MO, United States: Morgan Kaufmann Publishers.

MM-WLAuslan: multi-view multi-modal word-level Australian Sign Language recognition dataset

2024

Conference Publication

MMOOC: a multimodal misinformation dataset for out-of-context news analysis

Xu, Qingzheng, Du, Heming, Chen, Huiqiang, Liu, Bo and Yu, Xin (2024). MMOOC: a multimodal misinformation dataset for out-of-context news analysis. 29th Australasian Conference, ACISP 2024, Sydney, NSW, Australia, 15–17 July 2024. Heidelberg, Germany: Springer. doi: 10.1007/978-981-97-5101-3_24

MMOOC: a multimodal misinformation dataset for out-of-context news analysis

2024

Conference Publication

TPR: Topology-preserving reservoirs for generalized zero-shot learning

Chen, Hui, Liu, Yanbin, Ma, Yongqiang, Zheng, Nanning and Yu, Xin (2024). TPR: Topology-preserving reservoirs for generalized zero-shot learning. NIPS '24: 38th International Conference on Neural Information Processing Systems, Vancouver, BC, Canada, 10-15 December 2024. Maryland Heights, MO, United States: Morgan Kaufmann Publishers.

TPR: Topology-preserving reservoirs for generalized zero-shot learning

2024

Conference Publication

An empirical analysis on spatial reasoning capabilities of large multimodal models

Shiri, Fatemeh, Guo, Xiao-Yu, Far, Mona Golestan, Yu, Xin, Haffari, Gholamreza and Li, Yuan-Fang (2024). An empirical analysis on spatial reasoning capabilities of large multimodal models. 2024 Conference on Empirical Methods in Natural Language Processing, Miami, FL, United States, 12-16 November 2024. Kerrville, TX, United States: Association for Computational Linguistics (ACL). doi: 10.18653/v1/2024.emnlp-main.1195

An empirical analysis on spatial reasoning capabilities of large multimodal models

2024

Conference Publication

EfficientDreamer: high-fidelity and stable 3D creation via orthogonal-view diffusion priors

Hu, Zhipeng, Zhao, Minda, Zhao, Chaoyi, Liang, Xinyue, Li, Lincheng, Zhao, Zeng, Fan, Changjie, Zhou, Xiaowei and Yu, Xin (2024). EfficientDreamer: high-fidelity and stable 3D creation via orthogonal-view diffusion priors. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, United States, 16-22 June 2024. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/CVPR52733.2024.00473

EfficientDreamer: high-fidelity and stable 3D creation via orthogonal-view diffusion priors

2024

Conference Publication

AS-NeRF: learning auxiliary sampling for generalizable novel view synthesis from sparse views

Tang, Jilin, Li, Lincheng, Qi, Xingqun, Chen, Yingfeng, Fan, Changjie and Yu, Xin (2024). AS-NeRF: learning auxiliary sampling for generalizable novel view synthesis from sparse views. 2024 IEEE International Conference on Multimedia and Expo (ICME), Niagara Falls, ON, Canada, 15-19 July 2024. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/ICME57554.2024.10688126

AS-NeRF: learning auxiliary sampling for generalizable novel view synthesis from sparse views

2024

Conference Publication

Benchmarking audio visual segmentation for long-untrimmed videos

Liu, Chen, Li, Peike Patrick, Yu, Qingtao, Sheng, Hongwei, Wang, Dadong, Li, Lincheng and Yu, Xin (2024). Benchmarking audio visual segmentation for long-untrimmed videos. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, United States, 16-22 June 2024. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/CVPR52733.2024.02143

Benchmarking audio visual segmentation for long-untrimmed videos

2023

Conference Publication

Learning efficient unsupervised satellite image-based building damage detection

Zhang, Yiyun, Wang, Zijian, Luo, Yadan, Yu, Xin and Huang, Zi (2023). Learning efficient unsupervised satellite image-based building damage detection. 2023 IEEE International Conference on Data Mining (ICDM), Shanghai, China, 1-4 December 2023. Piscataway, NJ, United States: IEEE. doi: 10.1109/icdm58522.2023.00206

Learning efficient unsupervised satellite image-based building damage detection

2023

Conference Publication

A new perspective of weakly supervised 3D instance segmentation via bounding boxes

Yu, Qingtao, Du, Heming and Yu, Xin (2023). A new perspective of weakly supervised 3D instance segmentation via bounding boxes. 36th Australasian Joint Conference on Artificial Intelligence, AJCAI 2023, Brisbane, QLD Australia, 28 November –1 December 2023. Singapore: Springer. doi: 10.1007/978-981-99-8388-9_9

A new perspective of weakly supervised 3D instance segmentation via bounding boxes

2023

Conference Publication

Context-based masking for spontaneous venous pulsations detection

Sheng, Hongwei, Yu, Xin, Li, Xue and Golzan, Mojtaba (2023). Context-based masking for spontaneous venous pulsations detection. 36th Australasian Joint Conference on Artificial Intelligence, AJCAI 2023, Brisbane, QLD Australia, 28 November –1 December 2023. Singapore: Springer. doi: 10.1007/978-981-99-8388-9_42

Context-based masking for spontaneous venous pulsations detection

2023

Conference Publication

Toward a unified framework for RGB and RGB-D visual navigation

Du, Heming, Huang, Zi, Chapman, Scott and Yu, Xin (2023). Toward a unified framework for RGB and RGB-D visual navigation. 36th Australasian Joint Conference on Artificial Intelligence, AJCAI 2023, Brisbane, QLD Australia, 28 November –1 December 2023. Singapore: Springer. doi: 10.1007/978-981-99-8391-9_29

Toward a unified framework for RGB and RGB-D visual navigation

2023

Conference Publication

Towards reliable and efficient vegetation segmentation for Australian wheat data analysis

Yuan, Bowen, Wang, Zijian and Yu, Xin (2023). Towards reliable and efficient vegetation segmentation for Australian wheat data analysis. 34th Australasian Database Conference (ADC), Melbourne, NSW Australia, 1-3 November 2023. Cham, Switzerland: Springer Cham. doi: 10.1007/978-3-031-47843-7_9

Towards reliable and efficient vegetation segmentation for Australian wheat data analysis

2023

Conference Publication

Audio-visual segmentation by exploring cross-modal mutual semantics

Liu, Chen, Li, Peike Patrick, Qi, Xingqun, Zhang, Hu, Li, Lincheng, Wang, Dadong and Yu, Xin (2023). Audio-visual segmentation by exploring cross-modal mutual semantics. MM '23: The 31st ACM International Conference on Multimedia, Ottawa, ON Canada, 29 October - 3 November 2023. New York, NY United States: Association for Computing Machinery. doi: 10.1145/3581783.3612373

Audio-visual segmentation by exploring cross-modal mutual semantics

2023

Conference Publication

DyGait: exploiting dynamic representations for high-performance gait recognition

Wang, Ming, Guo, Xianda, Lin, Beibei, Yang, Tian, Zhu, Zheng, Li, Lincheng, Zhang, Shunli and Yu, Xin (2023). DyGait: exploiting dynamic representations for high-performance gait recognition. IEEE/CVF International Conference on Computer Vision (ICCV), Paris, France, 2-6 October 2023. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/iccv51070.2023.01235

DyGait: exploiting dynamic representations for high-performance gait recognition

2023

Conference Publication

Gait recognition with mask-based regularization

Shen, Chuanfu, Lin, Beibei, Zhang, Shunli, Yu, Xin, Huang, George Q. and Yu, Shiqi (2023). Gait recognition with mask-based regularization. IEEE International Joint Conference on Biometrics (IJCB), Ljubljana, Slovenia, 25-28 September 2023. New York, NY, United States: IEEE. doi: 10.1109/ijcb57857.2023.10449112

Gait recognition with mask-based regularization