Skip to menu Skip to content Skip to footer

2024

Conference Publication

Which legal requirements are relevant to a business process? Comparing AI-driven methods as expert aid

Sai, Catherine, Sadiq, Shazia, Han, Lei, Demartini, Gianluca and Rinderle-Ma, Stefanie (2024). Which legal requirements are relevant to a business process? Comparing AI-driven methods as expert aid. 18th International Conference, RCIS 2024, Guimarães, Portugal, 14-17 May 2024. Cham, Switzerland: Springer Nature Switzerland. doi: 10.1007/978-3-031-59465-6_11

Which legal requirements are relevant to a business process? Comparing AI-driven methods as expert aid

2023

Conference Publication

Exploring data workers’ behaviours in data quality discovery

Chen, Tianwa, Demartini, Gianluca, Indulska, Marta and Sadiq, Shazia (2023). Exploring data workers’ behaviours in data quality discovery. Australasian Conferences on Information Systems 2023, Wellington, New Zealand, 5-8 December 2023. Atlanta, GA USA: Association for Information Systems.

Exploring data workers’ behaviours in data quality discovery

2023

Journal Article

Active learning with feature matching for clinical named entity recognition

Le, Linh, Demartini, Gianluca, Zuccon, Guido, Zhao, Genghong and Zhang, Xia (2023). Active learning with feature matching for clinical named entity recognition. Natural Language Processing Journal, 4 100015, 1-11. doi: 10.1016/j.nlp.2023.100015

Active learning with feature matching for clinical named entity recognition

2023

Conference Publication

Perspectives on large language models for relevance judgment

Faggioli, Guglielmo, Dietz, Laura, Clarke, Charles L. A., Demartini, Gianluca, Hagen, Matthias, Hauff, Claudia, Kando, Noriko, Kanoulas, Evangelos, Potthast, Martin, Stein, Benno and Wachsmuth, Henning (2023). Perspectives on large language models for relevance judgment. 13th ACM SIGIR International Conference on the Theory of Information Retrieval (ICTIR), Taipei, Taiwan, 23 July 2023. New York, United States: Association for Computing Machinery. doi: 10.1145/3578337.3605136

Perspectives on large language models for relevance judgment

2023

Conference Publication

On the impact of data quality on image classification fairness

Barry, Aki, Han, Lei and Demartini, Gianluca (2023). On the impact of data quality on image classification fairness. SIGIR '23: The 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, Taipei, Taiwan, 23 - 27 July 2023. New York, NY United States: Association for Computing Machinery. doi: 10.1145/3539618.3592031

On the impact of data quality on image classification fairness

2023

Journal Article

A data-driven analysis of behaviors in data curation processes

Han, Lei, Chen, Tianwa, Demartini, Gianluca, Indulska, Marta and Sadiq, Shazia (2023). A data-driven analysis of behaviors in data curation processes. ACM Transactions on Information Systems, 41 (3) 72, 1-35. doi: 10.1145/3567419

A data-driven analysis of behaviors in data curation processes

2023

Conference Publication

Firearms on Twitter: A novel object detection pipeline

Harvey, Ryan, Lebret, Rémi, Massonnet, Stéphane, Aberer, Karl and Demartini, Gianluca (2023). Firearms on Twitter: A novel object detection pipeline. Seventeenth International AAAI Conference on Web and Social Media, Limassol, Cyprus, 5–8 June 2023. Palo Alto, CA United States: Association for the Advancement of Artificial Intelligence. doi: 10.1609/icwsm.v17i1.22221

Firearms on Twitter: A novel object detection pipeline

2023

Journal Article

DataOps-4G: on supporting generalists in data quality discovery

Yu, Shaochen, Chen, Tianwa, Han, Lei, Demartini, Gianluca and Sadiq, Shazia (2023). DataOps-4G: on supporting generalists in data quality discovery. IEEE Transactions on Knowledge and Data Engineering, 35 (5), 4668-4681. doi: 10.1109/tkde.2022.3151605

DataOps-4G: on supporting generalists in data quality discovery

2023

Conference Publication

The community notes observatory: can crowdsourced fact-checking be trusted in practice?

Righes, Luca, Saeed, Mohammed, Demartini, Gianluca and Papotti, Paolo (2023). The community notes observatory: can crowdsourced fact-checking be trusted in practice?. WW '23 Companion: ACM Web Conference 2023, Austin, TX, United States, 30 April - 4 May 2023. New York, NY, United States: ACM. doi: 10.1145/3543873.3587340

The community notes observatory: can crowdsourced fact-checking be trusted in practice?

2023

Conference Publication

Human-in-the-loop regular expression extraction for single column format inconsistency

Yu, Shaochen, Han, Lei, Indulska, Marta, Sadiq, Shazia and Demartini, Gianluca (2023). Human-in-the-loop regular expression extraction for single column format inconsistency. WWW '23: ACM Web Conference 2023, Austin, TX, United States, 30 April - 4 May 2023. New York, United States: Association for Computing Machinery. doi: 10.1145/3543507.3583515

Human-in-the-loop regular expression extraction for single column format inconsistency

2023

Conference Publication

Leveraging Semantic Type Dependencies for Clinical Named Entity Recognition

Le, Linh, Zuccon, Guido, Demartini, Gianluca, Zhao, Genghong and Zhang, Xia (2023). Leveraging Semantic Type Dependencies for Clinical Named Entity Recognition. AMIA 2022 Annual Symposium, Washington, DC United States, 5-9 November 2022. Bethesda, MD United States: American Medical Informatics Association.

Leveraging Semantic Type Dependencies for Clinical Named Entity Recognition

2023

Journal Article

On the role of human and machine metadata in relevance judgment tasks

Xu, Jiechen, Han, Lei, Sadiq, Shazia and Demartini, Gianluca (2023). On the role of human and machine metadata in relevance judgment tasks. Information Processing and Management, 60 (2) 103177, 1-14. doi: 10.1016/j.ipm.2022.103177

On the role of human and machine metadata in relevance judgment tasks

2023

Other Outputs

MELArt: Multimodal Entity Linking Evaluation Dataset for Art (Version 1.0)

Demartini, Gianluca, Le, Linh, Krestel, Ralf and Sierra, Alejandro (2023). MELArt: Multimodal Entity Linking Evaluation Dataset for Art (Version 1.0). The University of Queensland. (Dataset) doi: 10.48610/2a8ef30

MELArt: Multimodal Entity Linking Evaluation Dataset for Art (Version 1.0)

2023

Other Outputs

UQ Single Column Format Inconsistency Datasets

Demartini, Gianluca, Chen, Tianwa, Sadiq, Shazia, Fan, Shaoyang, Xu, Jiechen, Han, Lei and Yu, Shaochen (2023). UQ Single Column Format Inconsistency Datasets. The University of Queensland. (Dataset) doi: 10.48610/0ab54e7

UQ Single Column Format Inconsistency Datasets

2022

Journal Article

Combining human and machine confidence in truthfulness assessment

Qu, Yunke, Barbera, David La, Roitero, Kevin, Mizzaro, Stefano, Spina, Damiano and Demartini, Gianluca (2022). Combining human and machine confidence in truthfulness assessment. Journal of Data and Information Quality, 15 (1) 5, 1-17. doi: 10.1145/3546916

Combining human and machine confidence in truthfulness assessment

2022

Journal Article

Task design in complex crowdsourcing experiments: Item assignment optimization

Ceschia, Sara, Roitero, Kevin, Demartini, Gianluca, Mizzaro, Stefano, Di Gaspero, Luca and Schaerf, Andrea (2022). Task design in complex crowdsourcing experiments: Item assignment optimization. Computers and Operations Research, 148 105995, 1-13. doi: 10.1016/j.cor.2022.105995

Task design in complex crowdsourcing experiments: Item assignment optimization

2022

Journal Article

Report on the 1st Workshop on Human-in-the-Loop Data Curation (HIL-DC 2022) at CIKM 2022

Demartini, Gianluca, Yang, Jie and Sadiq, Shazia (2022). Report on the 1st Workshop on Human-in-the-Loop Data Curation (HIL-DC 2022) at CIKM 2022. ACM SIGIR Forum, 56 (2), 1-8. doi: 10.1145/3582900.3582921

Report on the 1st Workshop on Human-in-the-Loop Data Curation (HIL-DC 2022) at CIKM 2022

2022

Conference Publication

Automatic identification of 5C vaccine behaviour on social media

Sampath Kumar, Ajay Hemanth, Shausan, Aminath, Demartini, Gianluca and Rahimi, Afshin (2022). Automatic identification of 5C vaccine behaviour on social media. Eighth Workshop on Noisy User-generated Text (W-NUT 2022), Gyeongju, Republic of Korea, 12-17 October 2022. Gyeongju, Republic of Korea: International Conference on Computational Linguistics.

Automatic identification of 5C vaccine behaviour on social media

2022

Conference Publication

Crowdsourced fact-checking at Twitter : how does the crowd compare with experts?

Saeed, Mohammed, Traub, Nicolas, Nicolas, Maelle, Demartini, Gianluca and Papotti, Paolo (2022). Crowdsourced fact-checking at Twitter : how does the crowd compare with experts?. 31st ACM International Conference on Information & Knowledge, Atlanta, GA USA, 17-21 October 2022. New York, NY, USA: ACM. doi: 10.1145/3511808.3557279

Crowdsourced fact-checking at Twitter : how does the crowd compare with experts?

2022

Conference Publication

Workshop on human-in-the-loop data curation

Demartini, Gianluca, Yang, Jie and Sadiq, Shazia (2022). Workshop on human-in-the-loop data curation. 31st ACM International Conference on Information and Knowledge Management, Atlanta, GA, United States, 17-21 October 2022. New York, NY, United States: ACM. doi: 10.1145/3511808.3557498

Workshop on human-in-the-loop data curation