Spivavtor: An Instruction Tuned Ukrainian Text Editing Model

Abstract

We introduce Spivavtor, a dataset, and instruction-tuned models for textediting focused on the Ukrainian language. Spivavtor is the Ukrainian-focusedadaptation of the English-only CoEdIT model. Similar to CoEdIT, Spivavtorperforms text editing tasks by following instructions in Ukrainian. This paperdescribes the details of the Spivavtor-Instruct dataset and Spivavtor models.We evaluate Spivavtor on a variety of text editing tasks in Ukrainian, such asGrammatical Error Correction (GEC), Text Simplification, Coherence, andParaphrasing, and demonstrate its superior performance on all of them. Wepublicly release our best-performing models and data as resources to thecommunity to advance further research in this space.

Quick Read (beta)

loading the full paper ...