On-the-Fly Fusion of Large Language Models and Machine Translation

  • 2024-05-06 18:13:27
  • Hieu Hoang, Huda Khayrallah, Marcin Junczys-Dowmunt
  • 0

Abstract

We propose the on-the-fly ensembling of a machine translation model with anLLM, prompted on the same task and input. We perform experiments on 4 languagepairs (both directions) with varying data amounts. We find that a slightlyweaker-at-translation LLM can improve translations of a NMT model, andensembling with an LLM can produce better translations than ensembling twostronger MT models. We combine our method with various techniques from LLMprompting, such as in context learning and translation context.

 

Quick Read (beta)

loading the full paper ...