|
2026 Conference Publication Mebm: Exploring the Synergy of Mixture of Experts in Background MattingWang, Yiru, Lu, Ming, Tian, Senmao, Yu, Xin and Zhang, Shunli (2026). Mebm: Exploring the Synergy of Mixture of Experts in Background Matting. IEEE. doi: 10.1109/icassp55912.2026.11464270 |
|
2026 Conference Publication Content-Aware Model Slimming for Image Super-Resolution with Large InputTian, Senmao, Hong, Gangyi, Wang, Shuyun, Yu, Xin and Zhang, Shunli (2026). Content-Aware Model Slimming for Image Super-Resolution with Large Input. IEEE. doi: 10.1109/icassp55912.2026.11461944 |
|
2026 Journal Article DFBSNet: Dual frequency-domain branch fusion and selection network for hyperspectral anomaly detectionYao, Yiming, Wang, Qing, Zhao, Dong, You, Mingtao, Xiang, Pei, Asano, Yuta, Yu, Xin, Wang, Chao, Zhou, Huixin and Ren, Jinchang (2026). DFBSNet: Dual frequency-domain branch fusion and selection network for hyperspectral anomaly detection. Pattern Recognition, 180 113967, 113967. doi: 10.1016/j.patcog.2026.113967 |
|
2026 Journal Article Cluster-aware prompt ensemble learning for few-shot vision-language model adaptationChen, Zhi, Yu, Xin, Tao, Xiaohui, Li, Yan and Huang, Zi (2026). Cluster-aware prompt ensemble learning for few-shot vision-language model adaptation. Pattern Recognition, 172 (C) 112596. doi: 10.1016/j.patcog.2025.112596 |
|
2026 Journal Article Mobile Auslan: A multimodal dialogue-centered sign language learning systemSheng, Hongwei, Shen, Xin, Du, Heming and Yu, Xin (2026). Mobile Auslan: A multimodal dialogue-centered sign language learning system. Computer Vision and Image Understanding, 265 104646, 104646-265. doi: 10.1016/j.cviu.2026.104646 |
|
2026 Book Chapter High-Resolution and Multimodal Optogenetic fMRI of Brain DynamicsHe, Yi, Yuan, Jianyu, Liang, Mingyao, Xie, Zeping and Yu, Xin (2026). High-Resolution and Multimodal Optogenetic fMRI of Brain Dynamics. Neuromethods. (pp. 3-18) New York, NY: Springer US. doi: 10.1007/978-1-0716-5178-0_1 |
|
2026 Journal Article PrefaceLiu, Miaomiao, Yu, Xin, Xu, Chang and Song, Yiliao (2026). Preface. Lecture Notes in Computer Science, 16370 LNAI, v-vi. |
|
2026 Journal Article Compression-Oriented Video Super-ResolutionWang, Shuyun, Liu, Yanbin, Lu, Ming, Wu, Zhuojie, Tian, Senmao, Guo, Yandong and Yu, Xin (2026). Compression-Oriented Video Super-Resolution. IEEE Transactions on Image Processing, PP (99), 1-1. doi: 10.1109/tip.2026.3682128 |
|
2026 Conference Publication Augment to Segment: Tackling Pixel-Level Imbalance in Wheat Disease and Pest SegmentationWei, Tianqi, Yu, Xin, Chen, Zhi, Chapman, Scott and Huang, Zi (2026). Augment to Segment: Tackling Pixel-Level Imbalance in Wheat Disease and Pest Segmentation. Springer Science and Business Media Deutschland GmbH. doi: 10.1007/978-981-95-6196-4_3 |
|
2026 Journal Article Safe and Reliable Diffusion Models via Subspace ProjectionChen, Huiqiang, Zhu, Tianqing, Wang, Linlin, Yu, Xin, Gao, Longxiang and Zhou, Wanlei (2026). Safe and Reliable Diffusion Models via Subspace Projection. IEEE Transactions on Dependable and Secure Computing, PP (99), 1-14. doi: 10.1109/TDSC.2026.3692493 |
|
2026 Journal Article Distributed Zero-Shot Learning for Visual RecognitionChen, Zhi, Luo, Yadan, Huang, Zi, Li, Jingjing, Wang, Sen and Yu, Xin (2026). Distributed Zero-Shot Learning for Visual Recognition. IEEE Transactions on Multimedia, PP (99), 1-12. doi: 10.1109/TMM.2026.3673561 |
|
2026 Conference Publication Dynamic Orchestration of Multi-agent System for Real-World Multi-image Agricultural VQAKe, Yan, Yu, Xin, Du, Heming, Chapman, Scott and Huang, Helen (2026). Dynamic Orchestration of Multi-agent System for Real-World Multi-image Agricultural VQA. Springer Science and Business Media Deutschland GmbH. doi: 10.1007/978-981-95-6196-4_11 |
|
2025 Journal Article Hyperspectral video object tracking with cross-modal spectral complementary and memory prompt networkJiang, Wenhao, Zhao, Dong, Wang, Chen, Yu, Xin, Arun, Pattathal V., Asano, Yuta, Xiang, Pei and Zhou, Huixin (2025). Hyperspectral video object tracking with cross-modal spectral complementary and memory prompt network. Knowledge-Based Systems, 330 (Part B) 114595, 1-16. doi: 10.1016/j.knosys.2025.114595 |
|
2025 Journal Article Analytical Survey of Learning with Low-Resource Data: From Analysis to InvestigationCao, Xiaofeng, Xu, Mingwei, Yu, Xin, Yao, Jiangchao, Ye, Wei, Huang, Shengjun, Zhang, Minling, Tsang, Ivor, Ong, Yew-Soon, Kwok, James T. and Shen, Heng Tao (2025). Analytical Survey of Learning with Low-Resource Data: From Analysis to Investigation. ACM Computing Surveys, 58 (6) 3773075, 1-47. doi: 10.1145/3773075 |
|
2025 Conference Publication Cross-View Isolated Sign Language Recognition via View Synthesis and Feature DisentanglementShen, Xin, Wang, Xinyu, Shen, Lei, Zhang, Kaihao and Yu, Xin (2025). Cross-View Isolated Sign Language Recognition via View Synthesis and Feature Disentanglement. IEEE. doi: 10.1109/iccv51701.2025.01920 |
|
2025 Conference Publication 3DRealCar: An In-the-Wild RGB-D Car Dataset with 360-Degree ViewsDu, Xiaobiao, Wang, Yida, Sun, Haiyang, Wu, Zhuojie, Sheng, Hongwei, Wang, Shuyun, Ying, Jiaying, Lu, Ming, Zhu, Tianqing, Zhan, Kun and Yu, Xin (2025). 3DRealCar: An In-the-Wild RGB-D Car Dataset with 360-Degree Views. IEEE. doi: 10.1109/iccv51701.2025.02458 |
|
2025 Conference Publication LDPose: Towards Inclusive Human Pose Estimation for Limb-Deficient Individuals in the WildYing, Jiaying, Du, Heming, Zhang, Kaihao, Li, Lincheng and Yu, Xin (2025). LDPose: Towards Inclusive Human Pose Estimation for Limb-Deficient Individuals in the Wild. IEEE. doi: 10.1109/iccv51701.2025.00920 |
|
2025 Conference Publication Robust audio-visual segmentation via audio-guided visual convergent alignmentLiu, Chen, Li, Peike, Yang, Liying, Wang, Dadong, Li, Lincheng and Yu, Xin (2025). Robust audio-visual segmentation via audio-guided visual convergent alignment. 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN USA, 10-17 June 2025. Piscataway, NJ USA: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvpr52734.2025.02693 |
|
2025 Conference Publication EasyCraft: a robust and efficient framework for automatic avatar craftingWang, Suzhen, Chen, Weijie, Zhang, Wei, Zhao, Minda, Li, Lincheng, Zhang, Rongsheng, Hu, Zhipeng and Yu, Xin (2025). EasyCraft: a robust and efficient framework for automatic avatar crafting. 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN USA, 10-17 June 2025. New York, NY USA: IEEE Computer Society. doi: 10.1109/CVPR52734.2025.00524 |
|
2025 Conference Publication Dynamic derivation and elimination: audio visual segmentation with enhanced audio semanticsLiu, Chen, Yang, Liying, Li, Peike, Wang, Dadong, Li, Lincheng and Yu, Xin (2025). Dynamic derivation and elimination: audio visual segmentation with enhanced audio semantics. 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN USA, 10-17 June 2025. Piscataway, NJ USA: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvpr52734.2025.00298 |