Dr Ben Foley

Language Data Scientist

School of Languages and Cultures
Faculty of Humanities, Arts and Social Sciences

Language Data Scientist

School of Languages and Cultures
Faculty of Humanities, Arts and Social Sciences

Overview

Qualifications

  • Bachelor of Visual Communication Design, Queensland University of Technology

Publications

  • Foley, Benedict (2024). Developing useful and usable language technologies. PhD Thesis, School of Electrical Engineering and Computer Science, The University of Queensland. doi: 10.14264/a44d89b

  • San, Nay, Bartelds, Martijn, Billings, Blaine, de Falco, Ella, Feriza, Hendi, Safri, Johan, Sahrozi, Wawan, Foley, Ben, McDonnell, Bradley and Jurafsky, Dan (2023). Leveraging supplementary text data to kick-start automatic speech recognition system development with limited transcriptions. COMPUTEL 2023 - 6th Workshop on the Use of Computational Methods in the Study of Endangered Languages, Online, 5 - 6 March 2023. Stroudsburg, PA United States: Association for Computational Linguistics.

  • Maxwell-Smith, Zara and Foley, Ben (2023). Automated speech recognition of Indonesian-English language lessons on YouTube using transfer learning. Second Workshop on NLP Applications to Field Linguistics, Dubrovnik, Croatia, 2 - 6 May 2023. Stroudsburg, PA, United States: Association for Computational Linguistics. doi: 10.18653/v1/2023.fieldmatters-1.1

View all Publications

Publications

Book Chapter

  • Foley, Ben, van Esch, Daan and San, Nay (2022). Managing transcription data for automatic speech recognition with Elpis. The open handbook of linguistic data management. (pp. 437-446) edited by Andrea L. Berez-Kroeker, Bradley McDonnell, Eve Koller and Lauren B. Collister. Cambridge, MA, United States: The MIT Press. doi: 10.7551/mitpress/12200.003.0041

Conference Publication

  • San, Nay, Bartelds, Martijn, Billings, Blaine, de Falco, Ella, Feriza, Hendi, Safri, Johan, Sahrozi, Wawan, Foley, Ben, McDonnell, Bradley and Jurafsky, Dan (2023). Leveraging supplementary text data to kick-start automatic speech recognition system development with limited transcriptions. COMPUTEL 2023 - 6th Workshop on the Use of Computational Methods in the Study of Endangered Languages, Online, 5 - 6 March 2023. Stroudsburg, PA United States: Association for Computational Linguistics.

  • Maxwell-Smith, Zara and Foley, Ben (2023). Automated speech recognition of Indonesian-English language lessons on YouTube using transfer learning. Second Workshop on NLP Applications to Field Linguistics, Dubrovnik, Croatia, 2 - 6 May 2023. Stroudsburg, PA, United States: Association for Computational Linguistics. doi: 10.18653/v1/2023.fieldmatters-1.1

  • Wisniewski, Guillaume, Macaire, Cécile, Galliot, Benjamin, Adams, Oliver, Lambourne, Nicholas, Foley, Ben, Wiles, Janet, Michaud, Alexis, Guillaume, Séverine and Jacques, Guillaume (2023). Natural language processing for language documentation: a progress report for Japhug and Na. Sixth Workshop on Sino-Tibetan Languages of Southwest China 2021, Kobe, Japan, 7 - 11 September 2021.

  • Adams, Oliver, Galliot, Benjamin, Wisniewski, Guillaume, Lambourne, Nicholas, Foley, Ben, Sanders-Dwyer, Rahasya, Wiles, Janet, Michaud, Alexis, Guillaume, Séverine, Besacier, Laurent, Cox, Christopher, Aplonova, Katya, Jacques, Guillaume and Hill, Nathan (2021). User-friendly automatic transcription of low-resource languages: plugging ESPnet into Elpis. 4th Workshop on the Use of Computational Methods in the Study of Endangered Languages, Online, 2-3 March 2021. Stroudsburg, PA USA: Association for Computational Linguistics.

  • Maxwelll-Smith, Zara and Foley, Ben (2021). Developing ASR for Indonesian-English Bilingual Language Teaching. Fifth Workshop on Computational Approaches to Linguistic Code-Switching, Virtual, 11 June 2021. Stroudsburg, PA USA: Association for Computational Linguistics. doi: 10.18653/v1/2021.calcs-1.17

  • Maxwell-Smith, Zara, Foley, Ben, Ochoa, Simon Gonzalez and Suominen, Hanna (2020). Applications of natural language processing in bilingual language teaching: an Indonesian-English case study. 15th Workshop on Innovative Use of NLP for Building Educational Applications, Online, 10 July 2020. Stroudsburg, PA, United States: Association for Computational Linguistics . doi: 10.18653/v1/2020.bea-1.12

  • Buckeridge, Nicholas and Foley, Ben (2020). Scaling language data import/export with a data transformer interface. 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages (SLTU) and Collaboration and Computing for Under-Resourced Languages (CCURL), Marseille, France, 11-16 May 2020. Paris, France: European Language Resources association.

  • Foley, Ben, van Esch, Daan and San, Nay (2019). Transcription acceleration for language documentation with ELPIS. 6th International Conference on Language Documentation and Conservation (ICLDC), Honolulu, HA USA, 28February-3 March 2019.

  • Esch, Daan van, Foley, Ben and San, Nay (2019). Future directions in technological support for language documentation. 3rd Workshop on Computational Methods for Endangered Languages, Honolulu, HA USA, 26-27 February 2019.

  • Foley, Ben, Rakhi, Alina, Lambourne, Nicholas, Buckeridge, Nicholas and Wiles, Janet (2019). Elpis, an accessible speech-to-text tool. INTERSPEECH 2019: Show & Tell, Graz, Austria, 15-19 September 2019. Baxias, France: International Speech Communication Association. doi: 10.21437/Interspeech.2019-8006

  • Foley, Ben, Arnold, Josh, Coto-Solano, Rolando, Durantin, Gautier, Ellison, T. Mark, van Esch, Daan, Heath, Scott, Kratochvíl, František, Maxwell-Smith, Zara, Nash, David, Olsson, Ola, Richards, Mark, San, Nay, Stoakes, Hywel, Thieberger, Nick and Wiles, Janet (2018). Building speech recognition systems for language documentation: the CoEDL Endangered Language Pipeline and Inference System (ELPIS). SLTU 2018: 6th Workshop on Spoken Language Technologies for Under-resourced Languages, Gurugram, India, 29-31 August 2018. Baxias, France: International Speech Communication Association. doi: 10.21437/sltu.2018-43

  • Green, Jennifer, Woods, Gail and Foley, Ben (2011). Looking at language: appropriate design for sign language resources in remote Australian Indigenous communities. Sustainable data from digital research: Humanities perspectives on digital scholarship, The University of Melbourne, VIC Australia, 12-14th December 2011.

Other Outputs