Skip to menu Skip to content Skip to footer
Dr Martin Schweinberger
Dr

Martin Schweinberger

Email: 
Phone: 
+61 7 336 56892

Overview

Background

Martin Schweinberger uses big data and computational methods to explore the messy, fascinating reality of how people actually talk—including all the swear words, filler words, and informal expressions that traditional language education overlooks. As a Lecturer in Applied Linguistics at the University of Queensland, he bridges the gap between computer science and linguistics to understand how language evolves in our digital age.

Uncovering Hidden Language Patterns

Much of Martin's research focuses on the language phenomena that schools don't teach but that permeate everyday conversation. He analyzes massive datasets to study vulgarity and swearing patterns, as well as discourse markers—those ubiquitous filler words like "like," "you know," "well," and "I mean" that pepper our speech. By applying statistical methods to real-world language use, he reveals how these supposedly "incorrect" forms of expression actually follow sophisticated social and linguistic rules.

His work also tracks how language changes over time and varies between different social settings, using computational tools to identify patterns that would be impossible to detect through traditional research methods alone.

Building Australia's Language Data Future

As Director of the Language Technology and Data Analysis Laboratory (LADAL)—a free upskilling platform for language data science with hundreds of thousands of users worldwide—and a key figure in one of Australia's major research infrastructure projects, the Language Data Commons of Australia (LDaCA), Martin is helping build the digital infrastructure that will support language research across the country. LDaCA has received substantial funding to create accessible tools and resources that allow researchers to analyze text and speech data more effectively.

Championing Research Transparency

Beyond his linguistic research, Martin advocates for reproducibility and transparency in humanities and social science research. He provides guidance on how language researchers can adopt more rigorous, open research practices—addressing a growing concern about the reliability of academic findings across disciplines.

Martin's international visibility is reflected in his leadership roles: he serves as Vice-President Professional of the International Society for the Linguistics of English (ISLE) and sits on the board of The International Computer Archive of Modern and Medieval English (ICAME), one of the oldest and most reputable societies for corpus linguistics. These positions demonstrate his commitment to advancing computational language research on a global scale.

Potential topics for supervision

I would be particularly interested in supervising theses on the following topics:

Sociolinguistics / Language Variation and Change / World Englishes

  • General extenders
  • Terms-of-address and salutations
  • Discourse particles and markers
  • Vulgarity
  • Adjective amplification

Learner Language / Applied Linguistics / Corpus Phonetics / Learner Corpus Research

  • Vowel production among L1 speakers and learners of English
  • Voice-onset-times among L1 speakers and learners of English
  • Fluency and pauses in learner and L1 speech.
  • Accent and intelligibility / comprehension.

Text Analytics / Digital Humanities / Corpus Linguistics

  • Applied word embedding applications in the language sciences.
  • Comparison of different association / keyness measures

Availability

Dr Martin Schweinberger is:
Available for supervision
Media expert

Qualifications

  • Doctor of Philosophy, Universität Hamburg

Research interests

  • Vulgarity and Swearing

    I investigate how swear words and taboo language are used in everyday speech and online discourse. Contrary to popular belief, vulgar language follows systematic social and linguistic rules. My research uncovers how such expressions function in communication and what they reveal about speakers’ identities, emotions, and group memberships.

  • Discourse Markers and Filler Words

    I study words like like, you know, and well—terms often dismissed as meaningless. Using computational analysis, I show how these elements structure conversations and convey nuanced meanings. My work demonstrates that such "filler" words play important roles in signaling attitudes, managing interactions, and guiding listener expectations.

  • Open Science and Research Transparency

    I actively promote reproducible, open research practices in the humanities and social sciences. I provide practical training and resources to help language researchers adopt transparent workflows. My advocacy supports greater academic rigor and long-term trust in empirical research.

  • Text Analytics and Computational Linguistics

    I apply computational methods—like machine learning and statistical modelling—to large corpora to uncover hidden linguistic patterns. These tools help quantify language use in a way that supports replicable, empirical research. My work is at the intersection of computer science and linguistics, making it especially relevant in the digital age.

  • Digital Infrastructure and Research Tools

    As Director of LADAL and a lead in LDaCA, I am building accessible digital platforms that support large-scale language analysis. These initiatives democratize access to language data and computational tools for researchers, students, and educators alike. My infrastructure work enhances the capacity for advanced language research in Australia and beyond.

  • Language Variation and Change

    I explore how language evolves over time and across different social settings. By analyzing large-scale linguistic datasets, I identify subtle patterns of variation in how people speak, particularly in informal and digital contexts. This research helps reveal how social norms and technology influence the way we communicate.

  • Learner Language and Second Language Acquisition

    I analyze how learners of English produce sounds, manage fluency, and develop pronunciation over time. This includes examining features like vowel quality, voice-onset time, pauses, and accent intelligibility. By comparing learner and native speaker data, my research informs language teaching and helps improve learner outcomes.

  • Corpus Phonetics

    I use corpus-based methods to investigate the phonetic characteristics of spoken language, including pronunciation patterns among both native and non-native speakers. I focus on measurable acoustic features such as vowel production and timing cues. This approach allows for the large-scale, data-driven analysis of speech in real-life settings.

Research impacts

As director and initiator of the Language Technology and Data Analysis Laboratory (LADAL) I am very proud that LADAL has emerged as one of Australia’s most prominent web-based collaborative support infrastructures for digital and computational humanities with more than 1.1 million page views of more than 500,000 active users in nearly 750,000 engaged sessions since 2021.

Works

Search Professor Martin Schweinberger’s works on UQ eSpace

94 works between 2008 and 2025

61 - 80 of 94 works

2018

Conference Publication

Methoden linguistischer Datenanalyse: quantitativ-statistische Modellierung sprachlicher Variation

Schweinberger, Martin (2018). Methoden linguistischer Datenanalyse: quantitativ-statistische Modellierung sprachlicher Variation. Ringvorlesung Empirieformate in der linguistischen Forschung, Hamburg, 16 January 2018.

Methoden linguistischer Datenanalyse: quantitativ-statistische Modellierung sprachlicher Variation

2018

Conference Publication

The sociolinguistics of emotional language - emotive use in Irish English

Schweinberger, Martin (2018). The sociolinguistics of emotional language - emotive use in Irish English. NPIE 5 (5th New Perspectives on Irish English meeting), Potsdam, Germany, 25-27 April 2018.

The sociolinguistics of emotional language - emotive use in Irish English

2017

Conference Publication

Using intensifier-adjective collocations to determine mechanisms of change

Schweinberger, Martin (2017). Using intensifier-adjective collocations to determine mechanisms of change. ICAME 38 (38th meeting of the International Computer Archive of Modern and Medieval English), Prague, Czech Republic, 24-28 May 2017.

Using intensifier-adjective collocations to determine mechanisms of change

2017

Conference Publication

Using semantic vector space models to investigate lexical replacement - a corpus based study of ongoing changes in intensifier systems

Schweinberger, Martin (2017). Using semantic vector space models to investigate lexical replacement - a corpus based study of ongoing changes in intensifier systems. 16th International Conference on Methods in Dialectology, Tokyo, Japan, 7-11 August 2017.

Using semantic vector space models to investigate lexical replacement - a corpus based study of ongoing changes in intensifier systems

2017

Conference Publication

Assessing differences in the English vocalic systems of L1-German learners and native speakers of English

Schweinberger, Martin, Stedman, Nina, Buzuk, Jelena, Sarkodie-Gyan, Sabrina, Boye, Naomi and Gerspacher, Sandra (2017). Assessing differences in the English vocalic systems of L1-German learners and native speakers of English. CuTLi 2017 (Current Trends in Linguistics 2017), Hamburg, Germany, 21-22 January 2017.

Assessing differences in the English vocalic systems of L1-German learners and native speakers of English

2017

Conference Publication

VowelChartProject. Erstellung personalisierter Vokaltrapeze zur Verbesserung der Zielsprachennähe im Zweitspracherwerb bei Lehramtsstudierenden

Schweinberger, Martin, Stedman, Nina, Buzuk, Jelena, Sarkodie-Gyan, Sabrina, Boye, Naomi and Gerspacher, Sandra (2017). VowelChartProject. Erstellung personalisierter Vokaltrapeze zur Verbesserung der Zielsprachennähe im Zweitspracherwerb bei Lehramtsstudierenden. CuTLi 2017 (Current Trends in Linguistics 2017), Hamburg, Germany, 21-22 January 2017.

VowelChartProject. Erstellung personalisierter Vokaltrapeze zur Verbesserung der Zielsprachennähe im Zweitspracherwerb bei Lehramtsstudierenden

2017

Conference Publication

VowelChartProject: Erstellung personalisierter Vokaltrapeze zur Verbesserung der Zielsprachennähe im Zweitspracherwerb bei Lehramtsstudierenden

Schweinberger, Martin and Gerspacher, Sandra (2017). VowelChartProject: Erstellung personalisierter Vokaltrapeze zur Verbesserung der Zielsprachennähe im Zweitspracherwerb bei Lehramtsstudierenden. New International Perspectives on Future Teachers' Professional Competencies, Profale Congress 2017, Hamburg, Germany, 21-23 September 2017.

VowelChartProject: Erstellung personalisierter Vokaltrapeze zur Verbesserung der Zielsprachennähe im Zweitspracherwerb bei Lehramtsstudierenden

2016

Conference Publication

Ongoing changes in the Irish intensifier system

Schweinberger, Martin (2016). Ongoing changes in the Irish intensifier system. NPIE4 (4th meeting on New Perspectives on Irish English), Bergen, Norway, 5-7 July 2016.

Ongoing changes in the Irish intensifier system

2016

Conference Publication

Ongoing change in the New Zealand English Intensifier System

Schweinberger, Martin (2016). Ongoing change in the New Zealand English Intensifier System. ISLE 4 (4th meeting of the International Society for the Linguistics of English), Poznan, Poland, 18-21 September 2016.

Ongoing change in the New Zealand English Intensifier System

2016

Conference Publication

Using sentiment analysis to investigate the social stratification of emotives in Irish English

Schweinberger, Martin (2016). Using sentiment analysis to investigate the social stratification of emotives in Irish English. ST & D 2016 (2016 meeting of the Society for Text & Discourse), Kassel, Germany, 18-20 July 2016.

Using sentiment analysis to investigate the social stratification of emotives in Irish English

2016

Conference Publication

On the acquisition of intensifier constructions

Schweinberger, Martin (2016). On the acquisition of intensifier constructions. 17. Norddeutsches Linguistisches Kolloquium (17. NLK), Hamburg, Germany, 31 March - 2 April 2016.

On the acquisition of intensifier constructions

2015

Conference Publication

Why we need better statistics - A case study on eh in New Zealand English

Schweinberger, Martin (2015). Why we need better statistics - A case study on eh in New Zealand English. 16. Norddeutsches Linguistisches Kolloquium (16. NLK), Hannover, Germany, 26-28 March 2015.

Why we need better statistics - A case study on eh in New Zealand English

2015

Book Chapter

A comparative study of the pragmatic marker like in Irish English and in south-eastern varieties of British English

Schweinberger, Martin (2015). A comparative study of the pragmatic marker like in Irish English and in south-eastern varieties of British English. Pragmatic markers in Irish English. (pp. 114-134) edited by Carolina P. Amador-Moreno, Kevin McCafferty and Elaine Vaughan. Amsterdam: John Benjamins Publishing Company. doi: 10.1075/pbns.258.05sch

A comparative study of the pragmatic marker like in Irish English and in south-eastern varieties of British English

2014

Journal Article

Studying the linguistic ecology of Singapore: a comparison of college and university students

Siemund, Peter, Schulz, Monika Edith and Schweinberger, Martin (2014). Studying the linguistic ecology of Singapore: a comparison of college and university students. World Englishes, 33 (3), 340-362. doi: 10.1111/weng.12094

Studying the linguistic ecology of Singapore: a comparison of college and university students

2014

Other Outputs

The discourse marker LIKE: a corpus-based analysis of selected varieties of English

Schweinberger, Martin (2014). The discourse marker LIKE: a corpus-based analysis of selected varieties of English. PhD Thesis, Institut für Anglistik und Amerikanistik, Universität Hamburg.

The discourse marker LIKE: a corpus-based analysis of selected varieties of English

2014

Book Chapter

Frequency, dispersion and register variation of selected discoursepragmatic particles in Singapore English

Schweinberger, Martin (2014). Frequency, dispersion and register variation of selected discoursepragmatic particles in Singapore English. Language contact around the globe. (pp. 259-273) edited by Amei Koll-Stobbe and Sebastian Knospe. Frankfurt, Germany: Peter Lang.

Frequency, dispersion and register variation of selected discoursepragmatic particles in Singapore English

2013

Book Chapter

A corpus-based study of language internal and language external determinants of non-standard uses of noun phrase structures - a primer

Schweinberger, Martin and Burmeister, Melanie (2013). A corpus-based study of language internal and language external determinants of non-standard uses of noun phrase structures - a primer. Korpuslinguistische Untersuchungen: Analysen einzelsprachlicher Phänomene. (pp. 131-149) edited by Anja Hennemann and Claudia Schlaak. Berlin, Germany: Frank & Timme.

A corpus-based study of language internal and language external determinants of non-standard uses of noun phrase structures - a primer

2013

Book Chapter

A sociolinguistic analysis of discourse marker LIKE in Northern Ireland. A look behind the scenes of quantitative reasoning

Schweinberger, Martin (2013). A sociolinguistic analysis of discourse marker LIKE in Northern Ireland. A look behind the scenes of quantitative reasoning. New approaches to the analysis of linguistic variability. (pp. 13-39) edited by Markus Bieswanger and Amei Koll-Stobbe. Frankfurt, Germany: Peter Lang.

A sociolinguistic analysis of discourse marker LIKE in Northern Ireland. A look behind the scenes of quantitative reasoning

2012

Book Chapter

The discourse marker LIKE in Irish English

Schweinberger, Martin (2012). The discourse marker LIKE in Irish English. New perspectives on Irish English. (pp. 179-202) Amsterdam, Netherlands: John Benjamins Publishing Company. doi: 10.1075/veaw.g44.09sch

The discourse marker LIKE in Irish English

2012

Conference Publication

How multilingual are Singaporeans really? A sociological and sociolinguistic analysis of the linguistic ecology of Singapore

Siemund, Peter, Schulz, Monika Edith and Schweinberger, Martin (2012). How multilingual are Singaporeans really? A sociological and sociolinguistic analysis of the linguistic ecology of Singapore. English in Asia's Languages Habitats and Europe's Asia Competence, Berlin, Germany, 9-12 May 2012.

How multilingual are Singaporeans really? A sociological and sociolinguistic analysis of the linguistic ecology of Singapore

Funding

Current funding

  • 2024 - 2028
    Language Data Commons of Australia (LDaCA-RDC)
    Australian Research Data Commons Limited
    Open grant

Past funding

  • 2021 - 2024
    Language Data Commons of Australia HASS RDC (LDaCA-RDC)
    ARDC - Australian Data Partnerships
    Open grant

Supervision

Availability

Dr Martin Schweinberger is:
Available for supervision

Looking for a supervisor? Read our advice on how to choose a supervisor.

Supervision history

Current supervision

  • Doctor Philosophy

    A multifactorial study of morpho-syntactic errors across different L1 backgrounds and language proficiency levels

    Principal Advisor

    Other advisors: Associate Professor Peter Crosthwaite

  • Doctor Philosophy

    Corpus-based investigation of three-minute thesis presentations: Register perspective

    Associate Advisor

    Other advisors: Associate Professor Peter Crosthwaite

  • Doctor Philosophy

    The Relationship Between Writing Tasks and Second Language Writers¿ Use of Metadiscourse

    Associate Advisor

    Other advisors: Associate Professor Peter Crosthwaite

  • Doctor Philosophy

    Integrating Artificial Intelligence and Machine Learning in TESOL: A Study on Personalised Learning and Impact on Student Engagement and Motivation in A Rural Indonesian University

    Associate Advisor

    Other advisors: Associate Professor Peter Crosthwaite

  • Doctor Philosophy

    Enhancing Lexical Resources for Argumentative Essay Writing through Corpus Integration

    Associate Advisor

    Other advisors: Associate Professor Peter Crosthwaite

Completed supervision

Media

Enquiries

Contact Dr Martin Schweinberger directly for media enquiries about their areas of expertise.

Need help?

For help with finding experts, story ideas and media enquiries, contact our Media team:

communications@uq.edu.au