Speakers

Meet the global voices that are shaping SINFO's excellence.

Diana Abagyan

Member of Technical Staff, Cohere

Diana Abagyan is a Member of Technical Staff at Cohere, specializing in pretraining data for large language models. Her expertise lies in machine learning and natural language processing, with a focus on efficient, multilingual systems and low-resource languages. She led the pretraining and tokenization efforts for the Tiny Aya project and has released work on efficient multilingual tokenization techniques. Diana strives to improve the accessibility and performance of language technologies across diverse linguistic settings.

Sessions

23 Apr 2026
16:30 - 17:20
Main Stage

Tiny Aya: Massively multilingual and globally accessible language models

Tiny Aya redefines what a small multilingual language model can achieve. Trained on 70 languages and refined through region-aware posttraining, it delivers state-of-the-art in translation quality, strong multilingual understanding, and high-quality target-language generation, all with just 3.35B parameters. The release includes a pretrained foundation model, a globally balanced instruction-tuned variant, and three region-specialized models targeting languages from Africa, South Asia, Europe, Asia-Pacific, and West Asia. This talk will cover the training strategy, data composition, and comprehensive evaluation framework behind Tiny Aya, and presents an alternative scaling path for multilingual AI: one centered on efficiency, balanced performance across languages, and practical deployment.

23 Apr 2026
17:20 - 17:35
Connect Stage

Q&A

Join Diana at the Connect Stage for an informal session to network, change ideas, and discuss the themes presented at her keynote.

Diana Abagyan
Cohere logo