Mr Ben Foley

Language Data Scientist

School of Languages and Cultures
Faculty of Humanities, Arts and Social Sciences

Overview

Qualifications

  • Bachelor of Visual Communication Design, Queensland University of Technology

Publications

  • Foley, Benedict (2024). Developing useful and usable language technologies. PhD Thesis, School of Electrical Engineering and Computer Science, The University of Queensland. doi: 10.14264/a44d89b

  • San, Nay, Bartelds, Martijn, Billings, Blaine, de Falco, Ella, Feriza, Hendi, Safri, Johan, Sahrozi, Wawan, Foley, Ben, McDonnell, Bradley and Jurafsky, Dan (2023). Leveraging supplementary text data to kick-start automatic speech recognition system development with limited transcriptions. COMPUTEL 2023 - 6th Workshop on the Use of Computational Methods in the Study of Endangered Languages, Online, 5 - 6 March 2023. Stroudsburg, PA United States: Association for Computational Linguistics.

  • Maxwell-Smith, Zara and Foley, Ben (2023). Automated speech recognition of Indonesian-English language lessons on YouTube using transfer learning. Second Workshop on NLP Applications to Field Linguistics, Dubrovnik, Croatia, 2 - 6 May 2023. Stroudsburg, PA, United States: Association for Computational Linguistics. doi: 10.18653/v1/2023.fieldmatters-1.1

View all Publications

Publications

Book Chapter

  • Foley, Ben, van Esch, Daan and San, Nay (2022). Managing transcription data for automatic speech recognition with Elpis. The open handbook of linguistic data management. (pp. 437-446) edited by Andrea L. Berez-Kroeker, Bradley McDonnell, Eve Koller and Lauren B. Collister. Cambridge, MA, United States: The MIT Press. doi: 10.7551/mitpress/12200.003.0041

Conference Publication

  • San, Nay, Bartelds, Martijn, Billings, Blaine, de Falco, Ella, Feriza, Hendi, Safri, Johan, Sahrozi, Wawan, Foley, Ben, McDonnell, Bradley and Jurafsky, Dan (2023). Leveraging supplementary text data to kick-start automatic speech recognition system development with limited transcriptions. COMPUTEL 2023 - 6th Workshop on the Use of Computational Methods in the Study of Endangered Languages, Online, 5 - 6 March 2023. Stroudsburg, PA United States: Association for Computational Linguistics.

  • Maxwell-Smith, Zara and Foley, Ben (2023). Automated speech recognition of Indonesian-English language lessons on YouTube using transfer learning. Second Workshop on NLP Applications to Field Linguistics, Dubrovnik, Croatia, 2 - 6 May 2023. Stroudsburg, PA, United States: Association for Computational Linguistics. doi: 10.18653/v1/2023.fieldmatters-1.1

  • Wisniewski, Guillaume, Macaire, Cécile, Galliot, Benjamin, Adams, Oliver, Lambourne, Nicholas, Foley, Ben, Wiles, Janet, Michaud, Alexis, Guillaume, Séverine and Jacques, Guillaume (2023). Natural language processing for language documentation: a progress report for Japhug and Na. Sixth Workshop on Sino-Tibetan Languages of Southwest China 2021, Kobe, Japan, 7 - 11 September 2021.

  • Adams, Oliver, Galliot, Benjamin, Wisniewski, Guillaume, Lambourne, Nicholas, Foley, Ben, Sanders-Dwyer, Rahasya, Wiles, Janet, Michaud, Alexis, Guillaume, Séverine, Besacier, Laurent, Cox, Christopher, Aplonova, Katya, Jacques, Guillaume and Hill, Nathan (2021). User-friendly automatic transcription of low-resource languages: plugging ESPnet into Elpis. 4th Workshop on the Use of Computational Methods in the Study of Endangered Languages, Online, 2-3 March 2021. Stroudsburg, PA USA: Association for Computational Linguistics.

  • Maxwelll-Smith, Zara and Foley, Ben (2021). Developing ASR for Indonesian-English Bilingual Language Teaching. Fifth Workshop on Computational Approaches to Linguistic Code-Switching, Virtual, 11 June 2021. Stroudsburg, PA USA: Association for Computational Linguistics. doi: 10.18653/v1/2021.calcs-1.17

  • Maxwell-Smith, Zara, Foley, Ben, Ochoa, Simon Gonzalez and Suominen, Hanna (2020). Applications of natural language processing in bilingual language teaching: an Indonesian-English case study. 15th Workshop on Innovative Use of NLP for Building Educational Applications, Online, 10 July 2020. Stroudsburg, PA, United States: Association for Computational Linguistics . doi: 10.18653/v1/2020.bea-1.12

  • Buckeridge, Nicholas and Foley, Ben (2020). Scaling language data import/export with a data transformer interface. 1st Joint Workshop on Spoken Language Technologies for Under-resourced languages (SLTU) and Collaboration and Computing for Under-Resourced Languages (CCURL), Marseille, France, 11-16 May 2020. Paris, France: European Language Resources association.

  • Foley, Ben, van Esch, Daan and San, Nay (2019). Transcription acceleration for language documentation with ELPIS. 6th International Conference on Language Documentation and Conservation (ICLDC), Honolulu, HA USA, 28February-3 March 2019.

  • Esch, Daan van, Foley, Ben and San, Nay (2019). Future directions in technological support for language documentation. 3rd Workshop on Computational Methods for Endangered Languages, Honolulu, HA USA, 26-27 February 2019.

  • Foley, Ben, Rakhi, Alina, Lambourne, Nicholas, Buckeridge, Nicholas and Wiles, Janet (2019). Elpis, an accessible speech-to-text tool. INTERSPEECH 2019: Show & Tell, Graz, Austria, 15-19 September 2019. Baxias, France: International Speech Communication Association. doi: 10.21437/Interspeech.2019-8006

  • Foley, Ben, Arnold, Josh, Coto-Solano, Rolando, Durantin, Gautier, Ellison, T. Mark, van Esch, Daan, Heath, Scott, Kratochvíl, František, Maxwell-Smith, Zara, Nash, David, Olsson, Ola, Richards, Mark, San, Nay, Stoakes, Hywel, Thieberger, Nick and Wiles, Janet (2018). Building speech recognition systems for language documentation: the CoEDL Endangered Language Pipeline and Inference System (ELPIS). SLTU 2018: 6th Workshop on Spoken Language Technologies for Under-resourced Languages, Gurugram, India, 29-31 August 2018. Baxias, France: International Speech Communication Association. doi: 10.21437/sltu.2018-43

  • Green, Jennifer, Woods, Gail and Foley, Ben (2011). Looking at language: appropriate design for sign language resources in remote Australian Indigenous communities. Sustainable data from digital research: Humanities perspectives on digital scholarship, The University of Melbourne, VIC Australia, 12-14th December 2011.

Other Outputs