Researching the next generation of multilingual AI

Okalai is expanding the map, one language at a time. Originally launched in 2021 through a small series of AI events, Okalai AI became an independent research institute in 2025, focused on advancing foundational methods in multilingual NLP—where even the most underrepresented languages are treated as first-class citizens.
Our work spans over 30 global languages, with a focus on language tech that goes beyond high-resource settings.

Research

We bring underserved languages into multilingual NLP. Through original datasets, tools, and models, we expand the reach of language technology while contributing to the broader development of multilingual systems.

Education

We create educational materials for both people and machines. This includes grammars, dictionaries, and annotated corpora: resources that support language learning, data creation, and the training of language models.

Impact

We build tools that are useful across languages and regions. By working with real-world data, our projects support practical applications, from communication to job matching, in places currently not covered by mainstream language tech.