AI for underrepresented settings.
Applied AI for underrepresented settings. We research and build AI systems for communities and languages underserved by today's state-of-the-art models.
Foundational research that defines the principles and methods behind our systems.
Language models and other machine learning models for underrepresented languages and settings.
Data and interactive tools built on top of our models and datasets, for learning, exploration, and practical use.
End-user systems that apply our research to real-world problems.
Research
Foundational research on multilingual and low-resource NLP. Selected publications:
Grammar as Control: Modular Language Generation for the Long Tail
@inproceedings{nakashole2026grammar,
title={Grammar as Control: Modular Language Generation for the Long Tail},
author={Nakashole, Ndapa},
booktitle={Proceedings of ACL},
year={2026}
}
Sentiment Analysis and Language Models for Oshikwanyama
1B, 3B, and 8B parameter LLMs for Oshikwanyama
@inproceedings{nakashole2026oshikwanyama,
title={Sentiment Analysis and Language Models for Oshikwanyama},
author={Nakashole, Ndapa},
booktitle={Proceedings of LREC},
year={2026}
}
Models
Language models and other machine learning models for underrepresented languages and settings. One example from our current work:
OkaLM
Kwanyama Language Models
OkaLM is the first family of publicly available large language models for Kwanyama. Available in three sizes (1B, 3B, 8B parameters) to suit different use cases, from lightweight applications to more capable generation.
๐ค View on Hugging Face →Tools and Data
Data and interactive tools built on top of our models and datasets. One example from our current work:
OkaLex
Kwanyama Language Reference and Learning Platform
OkaLex is a Kwanyama language reference and interactive learning platform. It features a bilingual dictionary with translations, definitions, parts of speech, and example sentences.
The platform includes interactive quizzes, flashcards, and a word-matching game for vocabulary practice, plus nearly 50 grammar modules for Kwanyama learners.
For schools, linguists, and anyone exploring Kwanyama.
Visit OkaLex →Try it โ search a word
Applications
End-user systems that apply our research to real-world problems. Coming soon.
Who we are
Okalai AI was founded by Ndapa Nakashole, who serves as Chief Scientist. Ndapa is an Associate Professor of Computer Science at the University of California, San Diego (UCSD). Her research focuses on Natural Language Processing (NLP), and Artificial Intelligence (AI) more broadly.