Featured image

Announcement, notebook and glossary. All resources can also be found in my archive.


Note: this was a tutorial session co-presented with Christian Lang and Bruno Bitter. My section starts at 01:08.


At the AMTA 2024 Virtual Tutorial Day I talked about controlling invariant translation elements with RAG. During the talk several speakers intervened on the topic, each bringing a different perspective of it.

Georg Kirchner introduced the concept of invariant translation elements, such as brand names, UI elements, and corporate slogans. Christian Lang gave a comprehensive overview of the challenges of handling invariant translation elements with existing tools and how LLMs can help at various stages of the translation, covering several approaches, including RAG. Building on his overview, I showed how to implement a simple RAG system to handle these invariants properly using Haystack: we run a Colab notebook live and checked how the translation changes by introducing context about the invariants to the LLM making the translation. Last, Bruno Bitter gave an overview of how you can use Blackbird to integrate a system like this with existing CAT tools and manage the whole lifecycle of content translation.