paulamanzur

PREDICT

PREDICT Methodology

A strategic MT risk methodology for effective localization solutions.

Table of Contents
  1. About The Methodology
  2. Getting Started
  3. Contents
  4. Dynamic Usage
  5. Contributing
  6. License
  7. Contact
  8. FAQ

About The Methodology

Enterprises in the localization sector handle diverse content types, requiring precise localization solutions. Options range from raw machine translation to transcreation. But how can they ensure the best match between content and localization method? Traditionally, the decision relied mostly on human judgment.

The PREDICT Methodology offers a systematic approach for assessing MT suitability, aligning content type with the optimal localization solution. By integrating risk tolerance weights into binary queries about a source content and use case, PREDICT provides a score and recommended solution, from raw MT to human-only translation.

This approach enables our business to provide the right quality for that specific content type, boost translation efficiency and reduce costs. Looking ahead, the methodology envisions integrating LLMs for automation and guidance, utilizing prompts to identify risk-mitigating strategies.

This case study, a contribution from Booking.com’s localization team, has been adapted and shared as open-source material in the AMTA 2024 proceedings. With a slightly modified version available, it aims to provide value for both suppliers and buyers within the localization industry.

(Back to top)

Getting Started

Prerequisites

Any Spreadsheet application capable of load a .xlsx file (MS Excel, LibreOffice, Google Sheets, etc)

Installation

To get started you can either:

or

(Back to top)

Contents

The Sheet includes the following tabs:

General examples have been provided to guide evaluators to the most objective answer and specific client examples can be added.

Field of Expertise refers to any Domains and Subdomains used by customers (not to Content Types), since different content types can belong to the same domain. For example, Marketing is the Domain and Marketing Editorial is the Content type).

It also contains several use cases:

(Back to top)

Dynamic Usage

Parameters, such as the questions, examples, domains and associated risk weights can be adjusted dynamically according to different Customer needs. Evaluators need to be familiarized with the source content text (examples) and the Criteria Definitions before starting answering the questions.

To use the Excel file:

(Back to top)

Contributing

For major changes, please open an issue first to discuss what you would like to change or contact me via the Contact information below.

(Back to top)

License

This Excel file is licensed under the Creative Commons Attribution-NonCommercial 4.0 International License.

License: CC BY-NC 4.0

(Back to top)

Contact

Paula Manzur: LinkedIn

Project Link: https://github.com/paulamanzur/PREDICT-Methodology

(Back to top)

FAQ

Why was this methodology developed?

The creation of this methodology was part of a strategic localization initiative at Booking.com. It introduced a more robust, detailed, and systematic approach for the objective evaluation of diverse content types using multiple evaluators.

Can I adopt it as part of my Localization Program?

Yes, with non-commercial purposes, the Methodology aims to facilitate informed decision-making, aiding stakeholders in the localization industry in determining the optimal localization solution based on MT risk assessment.

For more context, the localization & MT industry are assessing MT suitability in different ways, with no unified system in place. With the PREDICT Methodology, we’ve made a simple yet important attempt at a more robust, detailed, and systematic approach to MT eligibility criteria. This is not the final answer but a starting point to reopen the conversation within the MT community about the need for common best practices that empower people to make informed, data-driven decisions.

Evaluators actively provide their insight by answering the questionnaire. The advised number is 3 or 5 (preferably an odd number). The more evaluators are involved in the methodology, the more objective the evaluation process is. Stakeholders and language professionals who are familiar with the content can participate in the questionnaire. Exploratory evaluations for new content types and Periodic ones for validation (recommendation is once a year) depending on the business context.

Can I use LLMs as Evaluators?

The Methodology was created with a human-centered approach. However, some experiments have been done with ChatGPT as an evaluator. Preliminary conclusions show that it requires several fine-tuning iterations for the prompts. For now, human evaluators are more reliable. If LLMs are used as evaluators, prompts need to be created from scratch and tested. There are no suggested prompts in the Methodology yet.

How can I validate PREDICT’s recommendations?

If the methodology recommends Full MTPE for one content type and A/B testing is used to compare this solution versus RAW MT, for example, this can help validate or reject the prediction from a customer/business perspective.

(Back to top)