Àrẹ̀mú Anuoluwapọ
Home
Experience
Skills & Interests
Gallery
RESOURCES
Publications
Medium
Contact
Exploring language, AI, and multilingual NLP through research
Which Nigerian-Pidgin does Generative AI Speak?: Issues about Representativeness and Bias for Multilingual and Low Resource Languages. North American Chapter of the Association for Computational Linguistics
2025
InkubaLM: A small Language Model for Low-Resource African Languages
2024
Voices Unheard: NLP Resources and Models for Yoruba Regional Dialects. Empirical Methods of Natural Language Processing
AfriMTE and AfriCOMET: Enhance COMET to Embrace Under-Represented African Languages. Association of Computational Linguistics
NaijaRC: A Multi-Choice Reading Comprehension Dataset for Nigerian Languages. AfricaNLP Workshop at International Conference on Learning Representations
Ìròyìnspeech: A Multi-Purpose Yorùbá Speech Corpus. Joint International Conference on Computational Linguistics, Language Resources and Evaluation
(LREC-COLING 2024)
YORC: Yoruba Reading Comprehension Dataset. Widening NLP Workshop at Empirical Methods of Natural Language Processing
2023
Multi-Lingual and Multi-Cultural Figurative Language Understanding. Association of Computational Linguistics
MasakhaPOS: Part-of-Speech Tagging for Typologically Diverse African Languages. Association of Computational Linguistics
AfriWOZ: Corpus for Exploiting Cross-Lingual Transferability for Generation of Dialogue in Low-Resource, African Languages. International Joint Conference on Neural Networks
MasakhaNEWS: News Topic Classification for African Languages. AfricaNLP Workshop at International Conference on Learning Representations
AfriQA: Cross-Lingual Open-Retrieval Question Answering for African Languages
MasakhaNER 2.0: Africa-centric Transfer Learning for Named Entity Recognition. Empirical Methods in Natural Language Processing
2022
NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment Analysis. The International Conference of Language Resources and Evaluation
A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation. North American Association of Computational Linguistics
Itakuroso: Exploiting Cross-Lingual Transferability for Natural Language Generation of Dialogues in Low-Resource, African Languages. International Conference of Learning Representation
MasakhaNER: Named Entity Recognition for African Languages. Transactions of the Association for Computational Linguistics 9: 1116–1131
2021
The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics. Association for Computational Linguistics
The Morphology Analysis of Yoruba Personal and Yoruba Praise Names - B.A. Project
Lost in translation: Why Google Translate often gets Yoruba — and other languages — wrong Global Voices
2020
Yoruba Loan Words: How Language Evolves Global Voices