|
2025 Conference Publication Multimodal Deepfake Generation and Detection: Challenges, Methods, and Future DirectionsDhall, Abhinav, Cai, Zhixi and Ghosh, Shreya (2025). Multimodal Deepfake Generation and Detection: Challenges, Methods, and Future Directions. New York, NY, USA: ACM. doi: 10.1145/3747327.3762826 |
|
2025 Conference Publication GEMS: Group Emotion Profiling Through Multimodal Situational UnderstandingKataria, Anubhav, Madan, Surbhi, Ghosh, Shreya, Gedeon, Tom and Dhall, Abhinav (2025). GEMS: Group Emotion Profiling Through Multimodal Situational Understanding. IEEE. doi: 10.1109/mlsp62443.2025.11204342 |
|
2025 Journal Article Empathy detection from text, audiovisual, audio or physiological signals: a systematic review of task formulations and machine learning methodsHasan, Md Rakibul, Hossain, Md Zakir, Ghosh, Shreya, Krishna, Aneesh and Gedeon, Tom (2025). Empathy detection from text, audiovisual, audio or physiological signals: a systematic review of task formulations and machine learning methods. IEEE Transactions on Affective Computing, 16 (4), 1-20. doi: 10.1109/taffc.2025.3590107 |
|
2025 Conference Publication 7th ABAW competition: multi-task learning and compound expression recognitionKollias, Dimitrios, Zafeiriou, Stefanos, Kotsia, Irene, Dhall, Abhinav, Ghosh, Shreya, Shao, Chunchang and Hu, Guanyu (2025). 7th ABAW competition: multi-task learning and compound expression recognition. Computer Vision – ECCV 2024 Workshops, Milan, Italy, 29 September-4 October 2024. Cham, Switzerland: Springer Cham. doi: 10.1007/978-3-031-91581-9_3 |
|
2025 Conference Publication MIP-GAF: a MLLM-annotated benchmark for Most Important Person localization and group context understandingMadan, S., Ghosh, S., Sookha, L. R., Ganaie, M. A., Subramanian, R., Dhall, A. and Gedeon, T. (2025). MIP-GAF: a MLLM-annotated benchmark for Most Important Person localization and group context understanding. 2025 Winter Conference on Applications of Computer Vision-WACV, Tucson, AZ USA, 28 February-4 March 2025. Los Alamitos, CA USA: IEEE Computer Society. doi: 10.1109/wacv61041.2025.00150 |
|
2024 Conference Publication AV-Deepfake1M: a large-scale LLM-driven audio-visual deepfake datasetCai, Zhixi, Ghosh, Shreya, Adatia, Aman Pankaj, Hayat, Munawar, Dhall, Abhinav, Gedeon, Tom and Stefanov, Kalin (2024). AV-Deepfake1M: a large-scale LLM-driven audio-visual deepfake dataset. MM '24: The 32nd ACM International Conference on Multimedia, Melbourne, VIC Australia, 28 October-1 November 2024. New York, NY USA: Association for Computing Machinery. doi: 10.1145/3664647.3680795 |
|
2024 Conference Publication 1M-Deepfakes Detection ChallengeCai, Zhixi, Dhall, Abhinav, Ghosh, Shreya, Hayat, Munawar, Kollias, Dimitrios, Stefanov, Kalin and Tariq, Usman (2024). 1M-Deepfakes Detection Challenge. The 32nd ACM International Conference on Multimedia, Melbourne, VIC Australia, 28 October-1 November 2024. New York, NY USA: Association for Computing Machinery, Inc. doi: 10.1145/3664647.3689145 |
|
2024 Conference Publication MRAC '24 Chairs' WelcomeTao, Jianhua, Ghosh, Shreya, Lian, Zheng, Cai, Zhixi, Schuller, Björn W., Dhall, Abhinav, Zhao, Guoying, Kollias, Dimitrios, Cambria, Erik, Goecke, Roland and Gedeon, Tom (2024). MRAC '24 Chairs' Welcome. MM '24: The 32nd ACM International Conference on Multimedia, Melbourne, VIC Australia, 28 October - 1 November 2024. New York, NY United States: Association for Computing Machinery. |
|
2024 Conference Publication MRAC Track 1: 2nd Workshop on Multimodal, Generative and Responsible Affective ComputingGhosh, Shreya, Cai, Zhixi, Dhall, Abhinav, Kollias, Dimitrios, Goecke, Roland and Gedeon, Tom (2024). MRAC Track 1: 2nd Workshop on Multimodal, Generative and Responsible Affective Computing. The 32nd ACM International Conference on Multimedia, Melbourne, VIC Australia, 28 October-1 November 2024. New York, NY USA: Association for Computing Machinery. doi: 10.1145/3689092.3690042 |
|
2024 Conference Publication Emolysis: a multimodal open-source group emotion analysis and visualization toolkitGhosh, Shreya, Cai, Zhixi, Gupta, Parul, Sharma, Garima, Dhall, Abhinav, Hayat, Munawar and Gedeon, Tom (2024). Emolysis: a multimodal open-source group emotion analysis and visualization toolkit. 12th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos-ACIIW, Glasgow, Scotland, United Kingdom, 15 September 2024. Los Alamitos, CA USA: IEEE Computer Society. doi: 10.1109/aciiw63320.2024.00023 |
|
2024 Conference Publication Attention-Based Multi-layer Perceptron to Categorize Affective Videos from Viewer's Physiological SignalsShaiok, Lazib Sharar, Hoque, Ishtiaqul, Hasan, Md Rakibul, Ghosh, Shreya, Gedeon, Tom and Hossain, Md Zakir (2024). Attention-Based Multi-layer Perceptron to Categorize Affective Videos from Viewer's Physiological Signals. 16th Asian Conference on Intelligent Information and Database Systems (ACIIDS), Ras Al Khaimah, United Arab Emirates, 15-18 April 2024. Heidelberg, Germany: Springer. doi: 10.1007/978-981-97-5934-7_3 |
|
2024 Journal Article Automatic gaze analysis: a survey of deep learning based approachesGhosh, Shreya, Dhall, Abhinav, Hayat, Munawar, Knibbe, Jarrod and Ji, Qiang (2024). Automatic gaze analysis: a survey of deep learning based approaches. IEEE Transactions on Pattern Analysis and Machine Intelligence, 46 (1), 61-84. doi: 10.1109/tpami.2023.3321337 |
|
2023 Conference Publication EfficienTransNet: an automated chest X-ray report generation paradigmMondal, Chayan, Pham, Duc-Son, Gupta, Ashu, Ghosh, Shreya, Tan, Tele and Gedeon, Tom (2023). EfficienTransNet: an automated chest X-ray report generation paradigm. The 31st ACM International Conference on Multimedia, Ottawa, Canada, 29 October 2023. New York, NY USA: Association for Computing Machinery. doi: 10.1145/3607865.3616174 |
|
2023 Conference Publication GraphITTI: Attributed Graph-based Dominance Ranking in Social Interaction VideosSharma, Garima, Ghosh, Shreya, Dhall, Abhinav, Hayat, Munawar, Cai, Jianfei and Gedeon, Tom (2023). GraphITTI: Attributed Graph-based Dominance Ranking in Social Interaction Videos. ICMI '23 Companion: Companion Publication of the 25th International Conference on Multimodal Interaction, Paris, France, 9 - 13 October 2023. New York, NY United States: Association for Computing Machinery. doi: 10.1145/3610661.3616184 |
|
2023 Journal Article Glitch in the matrix: A large scale benchmark for content driven audio-visual forgery detection and localizationCai, Zhixi, Ghosh, Shreya, Dhall, Abhinav, Gedeon, Tom, Stefanov, Kalin and Hayat, Munawar (2023). Glitch in the matrix: A large scale benchmark for content driven audio-visual forgery detection and localization. Computer Vision and Image Understanding, 236 103818, 1-12. doi: 10.1016/j.cviu.2023.103818 |
|
2023 Conference Publication MARLIN: Masked Autoencoder for facial video Representation LearnINgCai, Zhixi, Ghosh, Shreya, Stefanov, Kalin, Dhall, Abhinav, Cai, Jianfei, Rezatofighi, Hamid, Haffari, Reza and Hayat, Munawar (2023). MARLIN: Masked Autoencoder for facial video Representation LearnINg. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC Canada, 17-24 June 2023. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvpr52729.2023.00150 |
|
2023 Conference Publication 'Labelling the gaps': a weakly supervised automatic eye gaze estimationGhosh, Shreya, Dhall, Abhinav, Hayat, Munawar and Knibbe, Jarrod (2023). 'Labelling the gaps': a weakly supervised automatic eye gaze estimation. 16th Asian Conference on Computer Vision (ACCV), Macao, Peoples Republic of China, 4-8 December 2022. Cham, Switzerland: Springer Cham. doi: 10.1007/978-3-031-26316-3_44 |
|
2022 Conference Publication AV-GAZE: a study on the effectiveness of audio guided visual attention estimation for non-profilic facesGhosh, Shreya, Dhall, Abhinav, Hayat, Munawar and Knibbe, Jarrod (2022). AV-GAZE: a study on the effectiveness of audio guided visual attention estimation for non-profilic faces. 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France, 16-19 October 2022. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/icip46576.2022.9897360 |
|
2022 Journal Article Automatic prediction of group cohesiveness in imagesGhosh, Shreya, Dhall, Abhinav, Sebe, Nicu and Gedeon, Tom (2022). Automatic prediction of group cohesiveness in images. IEEE Transactions on Affective Computing, 13 (3), 1677-1690. doi: 10.1109/taffc.2020.3026095 |
|
2022 Conference Publication MTGLS: Multi-Task Gaze Estimation with Limited SupervisionGhosh, Shreya, Hayat, Munawar, Dhall, Abhinav and Knibbe, Jarrod (2022). MTGLS: Multi-Task Gaze Estimation with Limited Supervision. 22nd IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 4-8 January 2022. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/WACV51458.2022.00123 |