AI Translation & Localization

AI Translation Testing Methodology

How we evaluate machine translation, localization, and multilingual content tools.

← Back to Methodology Hub

The 100-Point Scoring Framework

We test translation tools with standardized texts in 20 language pairs, measuring accuracy with professional translator reviews and BLEU scores.

Translation Quality
35 pts
Pricing
25 pts
Features
20 pts
Platform & UX
20 pts

Our Testing Process

01

Translation Tests

Standardized texts in 20 language pairs.

02

Expert Review

Professional translators rate accuracy and fluency.

03

Feature Audit

Test glossaries, TM, and localization workflows.

04

Scoring

BLEU scores and expert ratings published.

1. Translation Quality

35 points max

Accuracy, fluency, and language coverage.

8
Accuracy (BLEU Score)
Machine translation quality benchmarked with BLEU.
7
Fluency
Natural-sounding output rated by native speakers.
6
Language Pairs
Number of supported languages (100+ scores highest).
5
Context Awareness
Handling of context, idioms, and domain terminology.
5
Document Translation
PDF, DOCX, and formatted document translation quality.
4
Specialized Domains
Legal, medical, technical translation accuracy.

2. Pricing

25 points max

Cost per word and volume pricing.

7
Free Tier
Free characters/words per month.
6
Cost per Word
Price per 1,000 words on paid plans.
5
API Pricing
Developer API pricing per million characters.
4
Volume Discounts
Enterprise and high-volume pricing.
3
Team Plans
Multi-user access with glossary sharing.

3. Features

20 points max

Advanced translation and localization features.

5
Glossary / TM
Custom glossaries and translation memory.
4
Website Translation
Full website translation and localization.
4
Tone / Formality
Formal/informal toggle and tone control.
4
File Formats
Support for XLIFF, JSON, PO, and CMS formats.
3
Real-Time Translation
Live translation for chat and communication.

4. Platform & Integration

20 points max

API, integrations, and collaboration.

5
API Quality
REST API documentation and SDK support.
4
CMS Integration
WordPress, Shopify, and headless CMS plugins.
4
CAT Tool Integration
memoQ, Trados, and CAT tool compatibility.
4
Collaboration
Team workflows, review, and approval processes.
3
Web & Mobile
Browser extension and mobile app quality.

Score Grading Scale

Score RangeGradeInterpretation
85 – 100ExcellentBest-in-class. Industry leader in this category.
70 – 84GoodStrong performer for most use cases, minor gaps.
55 – 69SatisfactoryAcceptable but falls behind leaders. Consider alternatives.
0 – 54Needs ImprovementSignificant limitations. Compare alternatives carefully.

Independence & Transparency

Expert-reviewed: Professional translators evaluate all outputs.

No sponsored rankings: Scores are independent.

Bi-annual updates: Re-tested when major model updates ship.

Last methodology update: March 2026