Skip to menu Skip to content Skip to footer

2026

Conference Publication

On-device large language models for sequential recommendation

Xia, Xin, Yin, Hongzhi and Culpepper, Shane (2026). On-device large language models for sequential recommendation. WSDM '26: Proceedings of the Nineteenth ACM International Conference on Web Search and Data Mining, Boise, ID, United States, 22-26 February 2026. New York, NY, United States: ACM. doi: 10.1145/3773966.3777961

On-device large language models for sequential recommendation

2026

Journal Article

Missing Value Imputation in Tabular Data Lakes Unleashed: A Hybrid Approach

Luo, Feng, Lan, Hai, Luo, Hui, Bao, Zhifeng, Culpepper, J. Shane, Sadiq, Shazia and Wang, Xiaoli (2026). Missing Value Imputation in Tabular Data Lakes Unleashed: A Hybrid Approach. The VLDB Journal, 35 (2) 11. doi: 10.1007/s00778-025-00957-1

Missing Value Imputation in Tabular Data Lakes Unleashed: A Hybrid Approach

2026

Book Chapter

Revisiting Human-vs-LLM Judgments Using the TREC Podcast Track

Mansour, Watheq, Shane Culpepper, J., Mackenzie, Joel and Yates, Andrew (2026). Revisiting Human-vs-LLM Judgments Using the TREC Podcast Track. Lecture Notes in Computer Science. (pp. 436-443) Cham: Springer Nature Switzerland. doi: 10.1007/978-3-032-21300-6_34

Revisiting Human-vs-LLM Judgments Using the TREC Podcast Track

2025

Conference Publication

IR for AAC Users: A Hyperdimensional Computing (Vector Symbolic Architectures) approach

Briegel, Hunter, Pagal, Maya and Culpepper, J. Shane (2025). IR for AAC Users: A Hyperdimensional Computing (Vector Symbolic Architectures) approach. 48th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (ACM SIGIR 2025), Padua, Italy, 13-18 July 2025. New York, NY, United States: Association for Computing Machinery. doi: 10.1145/3726302.3730273

IR for AAC Users: A Hyperdimensional Computing (Vector Symbolic Architectures) approach

2025

Conference Publication

The effects of demographic instructions on LLM personas

Magnossão de Paula, Angel Felipe, Culpepper, J. Shane, Moffat, Alistair, Pathiyan Cherumanal, Sachin, Scholer, Falk and Trippas, Johanne (2025). The effects of demographic instructions on LLM personas. 48th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (ACM SIGIR 2025), Padua, Italy, 13-18 July 2025. New York, NY, United States: Association for Computing Machinery. doi: 10.1145/3726302.3730255

The effects of demographic instructions on LLM personas

2025

Journal Article

Report from the 4th Strategic Workshop on Information Retrieval in Lorne (SWIRL 2025)

Trippas, Johanne R., Culpepper, J. Shane, Aliannejadi, Mohammad, Allan, James, Amigó, Enrique, Arguello, Jaime, Azzopardi, Leif, Bailey, Peter, Callan, Jamie, Capra, Rob, Craswell, Nick, Croft, Bruce, Dalton, Jeff, Demartini, Gianluca, Dietz, Laura, Dou, Zhicheng, Eickhoff, Carsten, Ekstrand, Michael, Ferro, Nicola, Fuhr, Norbert, Glowacka, Dorota, Hasibi, Faegheh, Hettiachchi, Danula, Jones, Rosie, Kamps, Jaap, Kando, Noriko, Karimi, Sarvnaz, Kato, Makoto P., Koopman, Bevan ... Zuccon, Guido (2025). Report from the 4th Strategic Workshop on Information Retrieval in Lorne (SWIRL 2025). ACM SIGIR Forum, 59 (1), 1-68. doi: 10.1145/3769733.3769739

Report from the 4th Strategic Workshop on Information Retrieval in Lorne (SWIRL 2025)

2025

Conference Publication

Dataset Discovery via Line Charts

Ji, Daomin, Luo, Hui, Bao, Zhifeng and Culpepper, J. Shane (2025). Dataset Discovery via Line Charts. 2025 IEEE 41st International Conference on Data Engineering (ICDE), Hong Kong, Hong Kong, 19-23 May 2025. Piscataway, NJ, United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/icde65448.2025.00046

Dataset Discovery via Line Charts

2025

Conference Publication

Let the Augmentation Path Speak

Ji, Daomin, Luo, Hui, Bao, Zhifeng, Sadiq, Shazia and Culpepper, J. Shane (2025). Let the Augmentation Path Speak. WWW '25: The ACM Web Conference 2025, Sydney, NSW, Australia, 28 April - 2 May 2025. New York, NY, United States: ACM. doi: 10.1145/3701716.3715183

Let the Augmentation Path Speak

2025

Journal Article

Table integration in data lakes unleashed: pairwise integrability judgment, integrable set discovery, and multi-tuple conflict resolution

Ji, Daomin, Luo, Hui, Bao, Zhifeng and Culpepper, J. Shane (2025). Table integration in data lakes unleashed: pairwise integrability judgment, integrable set discovery, and multi-tuple conflict resolution. The VLDB Journal, 34 (3) 36, 1-24. doi: 10.1007/s00778-025-00917-9

Table integration in data lakes unleashed: pairwise integrability judgment, integrable set discovery, and multi-tuple conflict resolution

2025

Conference Publication

Distinctiveness Maximization in Datasets Assemblage

Wang, Tingting, Huang, Shixun, Bao, Zhifeng, Culpepper, J. Shane, Dedeoglu, Volkan and Arablouei, Reza (2025). Distinctiveness Maximization in Datasets Assemblage. WWW '25: The ACM Web Conference 2025, Sydney, NSW, Australia, 28 April-2 May 2025. New York, NY United States: Association for Computing Machinery. doi: 10.1145/3696410.3714830

Distinctiveness Maximization in Datasets Assemblage

2025

Conference Publication

Examining the Impact of Transcript Variation on Podcast Search and Re-ranking

Mansour, Watheq, Culpepper, J. Shane and Mackenzie, Joel (2025). Examining the Impact of Transcript Variation on Podcast Search and Re-ranking. 47th European Conference on Information Retrieval, ECIR 2025, Lucca, Italy, 6–10 April 2025. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-88714-7_9

Examining the Impact of Transcript Variation on Podcast Search and Re-ranking

2025

Conference Publication

Multimodal feature extraction for assistive technology: evaluation and dataset

Briegel, Hunter, Pagal, Maya, Liddle, Jacki and Culpepper, Shane (2025). Multimodal feature extraction for assistive technology: evaluation and dataset. 47th European Conference on Information Retrieval, ECIR 2025, Lucca, Italy, 6 -10 April 2025. Cham, Switzerland: Springer Nature Switzerland. doi: 10.1007/978-3-031-88717-8_13

Multimodal feature extraction for assistive technology: evaluation and dataset

2024

Conference Publication

A fully on-disk updatable learned index

Lan, Hai, Bao, Zhifeng, Culpepper, J. Shane, Borovica-Gajic, Renata and Dong, Yu (2024). A fully on-disk updatable learned index. 2024 IEEE 40th International Conference on Data Engineering (ICDE), Utrecht, Netherlands, 13-16 May 2024. Washington, DC, United States: IEEE Computer Society. doi: 10.1109/icde60146.2024.00369

A fully on-disk updatable learned index

2024

Conference Publication

Enhancing human annotation: leveraging large language models and efficient batch processing

Zendel, Oleg, Culpepper, J. Shane, Scholer, Falk and Thomas, Paul (2024). Enhancing human annotation: leveraging large language models and efficient batch processing. CHIIR ’24: 2024 Conference on Human Information Interaction and Retrieval, Sheffield, United Kingdom, 10-14 March 2024. New York, NY, United States: ACM. doi: 10.1145/3627508.3638322

Enhancing human annotation: leveraging large language models and efficient batch processing

2024

Conference Publication

Optimizing data acquisition to enhance machine learning performance

Wang, Tingting, Huang, Shixun, Bao, Zhifeng, Culpepper, J. Shane, Dedeoglu, Volkan and Arablouei, Reza (2024). Optimizing data acquisition to enhance machine learning performance. 50th International Conference on Very Large Data Bases, Guangzhou, China, 26-30 August 2024. New York, NY, United States: Association for Computing Machinery. doi: 10.14778/3648160.3648172

Optimizing data acquisition to enhance machine learning performance

2024

Conference Publication

Navigating data repositories: utilizing line charts to discover relevant datasets

Ji, Daomin, Luo, Hui, Bao, Zhifeng and Culpepper, Shane (2024). Navigating data repositories: utilizing line charts to discover relevant datasets. 50th International Conference on Very Large Databases, Guangzhou, China, 26-30 August 2024. New York, NY, United States: Association for Computing Machinery. doi: 10.14778/3685800.3685857

Navigating data repositories: utilizing line charts to discover relevant datasets

2023

Journal Article

Updatable Learned Indexes Meet Disk-Resident DBMS - From Evaluations to Design Choices

Lan, Hai, Bao, Zhifeng, Culpepper, J. Shane and Borovica-Gajic, Renata (2023). Updatable Learned Indexes Meet Disk-Resident DBMS - From Evaluations to Design Choices. Proceedings of the ACM on Management of Data, 1 (2), 1-22. doi: 10.1145/3589284

Updatable Learned Indexes Meet Disk-Resident DBMS - From Evaluations to Design Choices

2023

Conference Publication

Facility relocation search for good: when facility exposure meets user convenience

Luo, Hui, Bao, Zhifeng, Culpepper, J. Shane, Li, Mingzhao and Zhao, Yanchang (2023). Facility relocation search for good: when facility exposure meets user convenience. ACM Web Conference 2023, Austin, TX, United States, 30 April - 4 May 2023. New York, NY, United States: ACM. doi: 10.1145/3543507.3583859

Facility relocation search for good: when facility exposure meets user convenience

2023

Conference Publication

Entropy-based query performance prediction for neural information retrieval systems

Zendel, Oleg, Liu, Binsheng, Culpepper, J. Shane and Scholer, Falk (2023). Entropy-based query performance prediction for neural information retrieval systems. QPP++ 2023: Query Performance Prediction and Its Evaluation in New Tasks, co-located with The 45th European Conference on Information Retrieval (ECIR), Dublin, Ireland, 2 - 6 April 2023. CEUR-WS.

Entropy-based query performance prediction for neural information retrieval systems

2022

Conference Publication

Representative routes discovery from massive trajectories

Wang, Tingting, Huang, Shixun, Bao, Zhifeng, Culpepper, J. Shane and Arablouei, Reza (2022). Representative routes discovery from massive trajectories. KDD '22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, United States, 14-18 August 2022. New York, NY, United States: Association for Computing Machinery. doi: 10.1145/3534678.3539079

Representative routes discovery from massive trajectories