SlideShare a Scribd company logo
1 of 31
Download to read offline
State of the
Machine Translation
by Intento
November 2017
About
• At Intento, we want to make Machine Intelligence
services easy to discover, choose and use.
• So far, evaluation is the most problematic part: to
compare different services, one need to sign a lot of
contracts and integrate a lot of APIs.
• We deliver this overview report for FREE. To get the
full report or evaluate on you own dataset, contact us.
• Also, check out our Natural Language Understanding
Benchmark. NLU may help you to automate workflows
beyond the automated translation.
November 2017© Intento, Inc.
Overview
© Intento, Inc. November 2017
11
Machine Translation Services
35
Language Pairs
TRANSLATION QUALITY LANGUAGE COVERAGE
DEVELOPER EXPERIENCE MISCELLANEOUS
Get the full version of this report
November 2017© Intento, Inc.
Changes since July 2017
• +2 vendors: DeepL (beta), SAP (beta)
• +21 language pair
• Detailed performance analysis
• Developer experience comparison
Machine Translation
Services* Compared
* We have evaluated general purpose Cloud Machine Translation services with prebuilt translation models, provided via API. Some vendors also provide
web-based, on-premise or custom MT engines, which may differ on all aspects from what we’ve evaluated.
Baidu
Translate API
DeepL
API (beta)
Google Cloud
Translation API
GTCom
YeeCloud MT
IBM Watson
Language
Translator
Microsoft
Translator Text
API
PROMT
Cloud API
SAP Translation
Hub (beta)
SDL Cloud
Machine
Translation
Systran REST
Translation API
Yandex
Translate API
Translation Quality
November 2017© Intento, Inc.
Evaluation Methodology
Overall Performance
Available MT Quality
Price vs. Performance
Evaluation methodology (I)
• Translation quality is evaluated by computing LEPOR
score between reference translations and the MT output
(Slide 9).
• Currently, our goal is to evaluate performance of
translation between the most popular languages (Slide
10).
• We use public datasets from StatMT/WMT and
CASMACAT News Commentary (Slide 11).
• We have performed LEPOR metric convergence analysis
to identify minimal viable number of segments in the
dataset. See Slide 12 for some details.
Evaluation methodology (II)
• We consider MT service A more performant than B for the
language pair C if:
- mean LEPOR score of A is greater than LEPOR of B for
the pair C, and
- lower bound of the LEPOR 95% confidence interval of A
is greater than the upper bound of the LEPOR confidence
interval of B for the pair C. See Slide 12 for an example.
• Different language pairs (and different datasets) impose
different translation complexity. In order to compare
overall MT performance of different services we regularise
LEPOR scores across all language pairs (See Appendix A
for more details).
LEPOR score
• LEPOR: automatic machine translation evaluation metric
considering the enhanced Length Penalty, n-gram
Position difference Penalty and Recall
• In our evaluation, we used hLEPORA v.3.1:
• best metric from ACL-WMT 2013 contest
https://www.slideshare.net/AaronHanLiFeng/lepor-an-augmented-machine-translation-evaluation-metric-thesis-ppt
https://github.com/aaronlifenghan/aaron-project-lepor
LIKE BLEU,
BUT BETTER
Language Pairs
We focus on the
en-P1, P1-en and
P1-P1 (partially)
* https://w3techs.com/technologies/overview/content_language/all
Language groups by
web popularity*:
P1 - ≥ 2.0% websites
P2 - 0.5%-2% websites
P3 - 0.1-0.3% websites
P4 - <0.1% websites
en ru ja de es fr pt it zh cs tr fi ro
en ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓
ru ✓ ✓ ✓ ✓ ✓
ja ✓ ✓
de ✓ ✓
es ✓ ✓
fr ✓ ✓ ✓
pt ✓
it ✓ ✓
zh ✓ ✓
cs ✓
tr ✓
fi ✓
ro ✓
Datasets
• WMT-2013 (translation task, news domain)
- en-es, es-en
• WMT-2015 (translation task, news domain)
- fr-en, en-fr
• WMT-2016 (translation task, news domain)
- cs-en, en-cs, de-en, en-de, ro-en, en-ro, fi-en, en-fi, ru-en, en-ru, tr-en, en-tr
• WMT-2017 (translation task, news domain)
- zh-en, en-zh
• NewsCommentary-2011
- en-ja, ja-en, en-pt, pt-en, en-it, it-en, ru-de, ru-es, ru-fr, ru-pt, ja-fr, de-ja, es-zh, fr-ru,
fr-es, it-pt, zh-it
LEPOR Convergence
We used 1440 - 3000 sentences per language pair. In all cases it’s clear that
the metric stabilises and adding more from the same domain won’t change the
outcome.
number of sentences
regularisedhLEPORscores
Aggregated across all language pairs Examples for individual language pairs:
Aggre-
gated
mean
Confi-
dense
interval
Detailed data on each language pair provided in the full report
Overall Performance
35 language pairs, 1440-3000 sentences per pair
>70%
<40%
Variance
among
language
pairs
Detailed data on each language pair provided in the full report
Available MT Quality
en ru ja de es fr pt it zh cs tr fi ro
en 4 7 3 2 7 1 2 6 2 4 5 2
ru 2 3 3 4 4
ja 1 4
de 8 2
es 7 4
fr 6 1 8
pt 4
it 7 2
zh 6 4
cs 2
tr 2
fi 1
ro 4
70 %
60 %
50 %
40 %
30 %
Maximal Achieved
hLEPOR score:
No. of
top-performing
MT Providers
Minimal price
for this quality,
per 1M char:
$$$ ≥$20
$$ $10-15
$ <$10
$$
$
$
$$
$$
$
$
$$$
$$$
$ $$$
$
$
$$$
$$ $$ $
$$$
$
$
$$
$
$
$$
$
$$
$$ $
$$
$
$$$
$
$$
$
Detailed data on each language pair provided in the full report
Sample pair analysis: en-pt
LEPOR
score
Providers
Price range
(per 1M characters)
77 % Google $20
72 % Yandex, Microsoft $4.5-15
70 % Baidu, SDL, IBM $8-$21
62 % Systran, PROMT $3-$8
BEST QUALITY:
BEST PRICE:
PRICE&QUALITY:
Google
PROMT
Microsoft
ALL 35 PAIRS
AVAILABLE
IN THE FULL
REPORT
Price vs. Performance
AFFORDABILITY
PERFORMANCE
As of November 2017
COST-EFFECTIVE
ACCURATE
FREE
(BETA)
NOT
SET
YET
COST-EFFECTIVE
Performance
Regularized hLEPOR
score aggregated across
all language pairs in the
dataset
Affordability = 1/price
Using public volume-
based pricing tiers
Legend
• performance range:
- regularised average
- max across all pairs
- min across all pairs
• price range
Detailed data on each language pair provided in the full report
Language Coverage
November 2017© Intento, Inc.
Supported and Unique per Provider
Coverage by Language Popularity
Language coverage
Unique
language
pairs -
supported
exclusively by
one provider
1
100
10000
Google Yandex Microsoft Baidu Systran SDL MT PROMT SAP DeepLIBM WatsonGTCom
2
1
2
1
54
920
1 246
3 202
6
20
4247
88
104111
756
3 422
8 556
10 712
Total
Unique
Supported language pairs
© Intento, Inc. July 2017Detailed data on the supported languages is provided in the full report
Language popularity
Language groups by
web popularity*:
P1 - ≥ 2.0% websites
P2 - 0.5%-2% websites
P3 - 0.1-0.3% websites
P4 - <0.1% websites
* https://w3techs.com/technologies/overview/content_language/all
A total of
29070
pairs possible,
12989
are supported
across all providers
P1
en, ru, ja, de, es,
fr, pt, it, zh
P2
pl, fa, tr, nl, ko, cs, ar,
vi, el, sv in, ro, hu
P3
da, sk, fi, th, bg, he, lt, uk,
hr, no, nb, sr, ca, sl, lv, et
P4
hi, az, bs, ms, is, mk, bn, eu, ka, sq,
gl, mn, kk, hy, se, uz, kr, ur, ta, nn, af,
be, si, my, br, ne, sw, km, fil, ml, pa,
…
Language groups by
web popularity*:
P1 - ≥ 2.0% websites
P2 - 0.5%-2% websites
P3 - 0.1-0.3% websites
P4 - <0.1% websites
100% 94% 63%
88%
31%
P1 P2 P3 P4
P1
P2
P3
P4
59%
100%
100%
94%
63%
100% 94%
94%
63%
63% 59%
Supported language pairs by popularity
* up from 44% in July 2017 as we better distinguish variations of the Chinese language© Intento, Inc.
Language coverage
45%*
of possible
language pairs
July 2017
by service provider
Google Cloud
Translation API
Microsoft
Translator Text API
Yandex
Translate API
Systran
REST Translation
API
SDL Cloud
Machine Translation
PROMT
Cloud API
IBM Watson
Language
Translation
July 2017© Intento, Inc.
Language coverage
Baidu
Translate API
GTCom
YeeCloud API
DeepL
API
SAP
Translation Hub
Detailed data on the supported languages is provided in the full report
Developer Experience (DX)
November 2017© Intento, Inc.
Evaluation Methodology
DX Charts
Evaluation methodology (I)
Here we evaluate overall service organisation from the following
angles:
• Product - Support of Machine Translation features desired for using the API in various MT scenarios
• Design - Overall API design and technical convenience
• Documentation - How well the API is documented
• Onboarding - How easy is to integrate and start using the API
• Commercial - Flexibility of the commercial terms
• Implementation - Important low-level features of the API
• Maintenance - Convenience of getting information about the API changes for ongoing support
• Reliability - Various technical issues we’ve encountered
Some references:
• http://talks.kinlane.com/apistrat/api101/index.html#/14
• https://mathieu.fenniak.net/the-api-checklist/
• https://www.slideshare.net/jmusser/ten-reasons-developershateyourapi
• https://restfulapi.net/richardson-maturity-model/
• https://github.com/shieldfy/API-Security-Checklist
• https://nordicapis.com/why-api-developer-experience-matters-more-than-ever/
• http://www.drdobbs.com/windows/measuring-api-usability/184405654?pgno=1
Evaluation methodology (II)
• Translation domains
• Translation engines
• Language autodetect
• Glossaries
• TM Support
• Custom engines
• Bulk mode
• Formatted text
• XLIFF support
Product
Design
Documentation
Onboarding
Commercial
Implementation
Maintenance
Reliability
• Authentication
• Use of SSL
• Quota info
• Domain info
• Balance info
• Self-sufficient
• Intuitive
• Versioning
• Bulk mode
• Task-invocation
ratio
• I/O Structure
• List of endpoints
• User documentation
• Supported languages
• Quotas
• Response codes
• Error codes
• Error messages
• API explorer
• API console
• Number of docs
• HTML doc
• Self-registration
• Self-issued keys
• Self-payment
• Free / Trial plan
• Sandbox
• Test console
• Github repo
• Code libraries
• SDK / PDK
• Sample code
• Direct support
• Ticket system
• Self-support
• Tutorial
• FAQ / KB
• Starter package
• Public pricing
• Pay as you go
• Post-paid
• Volume discounts
• Payment systems
• Billing history
• API spec
• Data compression
• Supports JSON
• Negotiable content
• Unicode support
• Error codes
• Error messages
• News source
• Subscription news
• Versioning
• Changelog
• Release notes
• Roadmap
• Status dashboard
• Developer dashboard
• Exportable logs
• Uptime
• Sporadic errors
• Bugs
• Performance issues
• Status dashboard
• Outage alerts
by service provider
July 2017© Intento, Inc.
Developer experience
Available in the full report
Miscellaneous
November 2017© Intento, Inc.
API change frequency
API & Documentation changes since July 2017:
Change frequency
SDL Language Cloud 31
Google Translate API 14
IBM Watson Translator 10
Microsoft Translator API 7
SAP Translation Hub 6
July 2017© Intento, Inc.
Detailed version of this report
• We give this over view version for free.
• The full evaluation report contains:
- Detailed best-deal analysis for each of the 35 language
pairs
- Developer experience analysis for each of the 11 MT
providers
• Also, by ordering the full report you support our
ongoing evaluation of the Cloud MT
• To get the full report, reach us at hello@inten.to
Discover the best service providers for
your AI task
Evaluate performance on your own data
at a fraction of the potential cost
saving
Access any provider with no effort
using to our Single API
Intento Service Platform
July 2017© Intento, Inc.
Intento
https://inten.to
Konstantin Savenkov
CEO Intento, Inc.

<ks@inten.to>
Appendix A
July 2017© Intento, Inc.
Overall performance of the MT services across many language
pairs is computed in the following way:
1. [Standardisation] We compute mean language-standardised
LEPOR score (or z-score) for each provider.
2. [Scale adjustment] We restore the original scale by
multiplying z-score for each MT provider by the global LEPOR
standard deviation and adding the global mean LEPOR score.

More Related Content

What's hot

ChatGPT ppt.pptx
ChatGPT  ppt.pptxChatGPT  ppt.pptx
ChatGPT ppt.pptxYuvrajS9
 
Implications of GPT-3
Implications of GPT-3Implications of GPT-3
Implications of GPT-3Raven Jiang
 
Every Gartner hype cycle of emerging technologies since 2000
Every Gartner hype cycle of emerging technologies since 2000Every Gartner hype cycle of emerging technologies since 2000
Every Gartner hype cycle of emerging technologies since 2000Raymond Meester
 
OpenAI’s GPT 3 Language Model - guest Steve Omohundro
OpenAI’s GPT 3 Language Model - guest Steve OmohundroOpenAI’s GPT 3 Language Model - guest Steve Omohundro
OpenAI’s GPT 3 Language Model - guest Steve OmohundroNumenta
 
2012 Velocity London: DevOps Patterns Distilled
2012 Velocity London: DevOps Patterns Distilled2012 Velocity London: DevOps Patterns Distilled
2012 Velocity London: DevOps Patterns DistilledGene Kim
 
Natural language processing and transformer models
Natural language processing and transformer modelsNatural language processing and transformer models
Natural language processing and transformer modelsDing Li
 
Cloud native-apps-architectures
Cloud native-apps-architecturesCloud native-apps-architectures
Cloud native-apps-architecturesCapgemini
 
Smart Cities and IoT whitepaper
Smart Cities and IoT whitepaperSmart Cities and IoT whitepaper
Smart Cities and IoT whitepaperRichard Hatheway
 
Convolutional Neural Networks and Natural Language Processing
Convolutional Neural Networks and Natural Language ProcessingConvolutional Neural Networks and Natural Language Processing
Convolutional Neural Networks and Natural Language ProcessingThomas Delteil
 
Revolutionary-ChatGPT
Revolutionary-ChatGPTRevolutionary-ChatGPT
Revolutionary-ChatGPT9 series
 
IoT Standards: The Next Generation
IoT Standards: The Next GenerationIoT Standards: The Next Generation
IoT Standards: The Next GenerationReadWrite
 
Solve for X with AI: a VC view of the Machine Learning & AI landscape
Solve for X with AI: a VC view of the Machine Learning & AI landscapeSolve for X with AI: a VC view of the Machine Learning & AI landscape
Solve for X with AI: a VC view of the Machine Learning & AI landscapeEd Fernandez
 
Final cyber physical system (1)
Final cyber physical system (1)Final cyber physical system (1)
Final cyber physical system (1)vanisre jaiswal
 
Jawad's presentation on GPT.pptx
Jawad's presentation on GPT.pptxJawad's presentation on GPT.pptx
Jawad's presentation on GPT.pptxJawadNadeem3
 
Todd Jacobucci presents ChatGPT
Todd Jacobucci presents ChatGPT Todd Jacobucci presents ChatGPT
Todd Jacobucci presents ChatGPT Todd Jacobucci
 
Sentiwordnet [IIT-Bombay]
Sentiwordnet [IIT-Bombay]Sentiwordnet [IIT-Bombay]
Sentiwordnet [IIT-Bombay]Sagar Ahire
 

What's hot (20)

ChatGPT ppt.pptx
ChatGPT  ppt.pptxChatGPT  ppt.pptx
ChatGPT ppt.pptx
 
Implications of GPT-3
Implications of GPT-3Implications of GPT-3
Implications of GPT-3
 
Transformers
TransformersTransformers
Transformers
 
Every Gartner hype cycle of emerging technologies since 2000
Every Gartner hype cycle of emerging technologies since 2000Every Gartner hype cycle of emerging technologies since 2000
Every Gartner hype cycle of emerging technologies since 2000
 
OpenAI’s GPT 3 Language Model - guest Steve Omohundro
OpenAI’s GPT 3 Language Model - guest Steve OmohundroOpenAI’s GPT 3 Language Model - guest Steve Omohundro
OpenAI’s GPT 3 Language Model - guest Steve Omohundro
 
2012 Velocity London: DevOps Patterns Distilled
2012 Velocity London: DevOps Patterns Distilled2012 Velocity London: DevOps Patterns Distilled
2012 Velocity London: DevOps Patterns Distilled
 
Natural language processing and transformer models
Natural language processing and transformer modelsNatural language processing and transformer models
Natural language processing and transformer models
 
Cloud native-apps-architectures
Cloud native-apps-architecturesCloud native-apps-architectures
Cloud native-apps-architectures
 
Smart Cities and IoT whitepaper
Smart Cities and IoT whitepaperSmart Cities and IoT whitepaper
Smart Cities and IoT whitepaper
 
Convolutional Neural Networks and Natural Language Processing
Convolutional Neural Networks and Natural Language ProcessingConvolutional Neural Networks and Natural Language Processing
Convolutional Neural Networks and Natural Language Processing
 
Revolutionary-ChatGPT
Revolutionary-ChatGPTRevolutionary-ChatGPT
Revolutionary-ChatGPT
 
IoT Standards: The Next Generation
IoT Standards: The Next GenerationIoT Standards: The Next Generation
IoT Standards: The Next Generation
 
Introduction to ChatGPT
Introduction to ChatGPTIntroduction to ChatGPT
Introduction to ChatGPT
 
Solve for X with AI: a VC view of the Machine Learning & AI landscape
Solve for X with AI: a VC view of the Machine Learning & AI landscapeSolve for X with AI: a VC view of the Machine Learning & AI landscape
Solve for X with AI: a VC view of the Machine Learning & AI landscape
 
Final cyber physical system (1)
Final cyber physical system (1)Final cyber physical system (1)
Final cyber physical system (1)
 
Nlp ambiguity presentation
Nlp ambiguity presentationNlp ambiguity presentation
Nlp ambiguity presentation
 
Jawad's presentation on GPT.pptx
Jawad's presentation on GPT.pptxJawad's presentation on GPT.pptx
Jawad's presentation on GPT.pptx
 
Todd Jacobucci presents ChatGPT
Todd Jacobucci presents ChatGPT Todd Jacobucci presents ChatGPT
Todd Jacobucci presents ChatGPT
 
Sentiwordnet [IIT-Bombay]
Sentiwordnet [IIT-Bombay]Sentiwordnet [IIT-Bombay]
Sentiwordnet [IIT-Bombay]
 
Cyber-Physical Systems
Cyber-Physical SystemsCyber-Physical Systems
Cyber-Physical Systems
 

Similar to State of the Machine Translation by Intento (November 2017)

State of the Machine Translation by Intento (March 2018)
State of the Machine Translation by Intento (March 2018)State of the Machine Translation by Intento (March 2018)
State of the Machine Translation by Intento (March 2018)Konstantin Savenkov
 
State of the Machine Translation by Intento (July 2018)
State of the Machine Translation by Intento (July 2018)State of the Machine Translation by Intento (July 2018)
State of the Machine Translation by Intento (July 2018)Konstantin Savenkov
 
Intento Machine Translation Benchmark, July 2017
Intento Machine Translation Benchmark, July 2017Intento Machine Translation Benchmark, July 2017
Intento Machine Translation Benchmark, July 2017Konstantin Savenkov
 
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)TAUS - The Language Data Network
 
State of the Machine Translation by Intento (stock engines, Jan 2019)
State of the Machine Translation by Intento (stock engines, Jan 2019)State of the Machine Translation by Intento (stock engines, Jan 2019)
State of the Machine Translation by Intento (stock engines, Jan 2019)Konstantin Savenkov
 
EVALUATION IN USE: NAVIGATING THE MT ENGINE LANDSCAPE WITH THE INTENTO EVALUA...
EVALUATION IN USE: NAVIGATING THE MT ENGINE LANDSCAPE WITH THE INTENTO EVALUA...EVALUATION IN USE: NAVIGATING THE MT ENGINE LANDSCAPE WITH THE INTENTO EVALUA...
EVALUATION IN USE: NAVIGATING THE MT ENGINE LANDSCAPE WITH THE INTENTO EVALUA...Konstantin Savenkov
 
CAT or TMS Implementation: Calculation of the Number of Licenses and the Tota...
CAT or TMS Implementation: Calculation of the Number of Licenses and the Tota...CAT or TMS Implementation: Calculation of the Number of Licenses and the Tota...
CAT or TMS Implementation: Calculation of the Number of Licenses and the Tota...ABBYY Language Serivces
 
TAUS Roundtable Moscow, CAT or TMS Implementation-Calculation of the Number o...
TAUS Roundtable Moscow, CAT or TMS Implementation-Calculation of the Number o...TAUS Roundtable Moscow, CAT or TMS Implementation-Calculation of the Number o...
TAUS Roundtable Moscow, CAT or TMS Implementation-Calculation of the Number o...TAUS - The Language Data Network
 
Tr8n for php - Michael Berkovich
Tr8n for php - Michael BerkovichTr8n for php - Michael Berkovich
Tr8n for php - Michael BerkovichCaroline_Rose
 
Welocalize Throughputs and Post-Editing Productivity Webinar Laura Casanellas
Welocalize Throughputs and Post-Editing Productivity Webinar Laura CasanellasWelocalize Throughputs and Post-Editing Productivity Webinar Laura Casanellas
Welocalize Throughputs and Post-Editing Productivity Webinar Laura CasanellasWelocalize
 
SDL BeGlobal The SDL Platform for Automated Translation
SDL BeGlobal The SDL Platform for Automated TranslationSDL BeGlobal The SDL Platform for Automated Translation
SDL BeGlobal The SDL Platform for Automated TranslationSDL Trados
 
User Empowered Machine Translation. Dion Wiggins, Asia Online
User Empowered Machine Translation. Dion Wiggins, Asia OnlineUser Empowered Machine Translation. Dion Wiggins, Asia Online
User Empowered Machine Translation. Dion Wiggins, Asia OnlineABBYY Language Serivces
 
Maximising Machine Translation Return on Investment (KantanMT/Medialocate)
Maximising Machine Translation Return on Investment (KantanMT/Medialocate)Maximising Machine Translation Return on Investment (KantanMT/Medialocate)
Maximising Machine Translation Return on Investment (KantanMT/Medialocate)kantanmt
 
What machine translation developers are doing to make post-editors happy
What machine translation developers are doing to make post-editors happyWhat machine translation developers are doing to make post-editors happy
What machine translation developers are doing to make post-editors happyIconic Translation Machines
 
State of the Domain-Adaptive Machine Translation by Intento (November 2018)
State of the Domain-Adaptive Machine Translation by Intento (November 2018)State of the Domain-Adaptive Machine Translation by Intento (November 2018)
State of the Domain-Adaptive Machine Translation by Intento (November 2018)Konstantin Savenkov
 
DevOps and 5G cloud native solutions supported by Computaris automated testin...
DevOps and 5G cloud native solutions supported by Computaris automated testin...DevOps and 5G cloud native solutions supported by Computaris automated testin...
DevOps and 5G cloud native solutions supported by Computaris automated testin...Computaris
 
11. manuel leiva & juanjo arevalillo (hermes) evaluation of machine translation
11. manuel leiva & juanjo arevalillo (hermes) evaluation of machine translation11. manuel leiva & juanjo arevalillo (hermes) evaluation of machine translation
11. manuel leiva & juanjo arevalillo (hermes) evaluation of machine translationRIILP
 
An MT Journey Intuit and Welocalize Localization World 2013
An MT Journey Intuit and Welocalize Localization World 2013An MT Journey Intuit and Welocalize Localization World 2013
An MT Journey Intuit and Welocalize Localization World 2013Welocalize
 
iMT Language Solutions
iMT Language SolutionsiMT Language Solutions
iMT Language SolutionsSDL
 

Similar to State of the Machine Translation by Intento (November 2017) (20)

State of the Machine Translation by Intento (March 2018)
State of the Machine Translation by Intento (March 2018)State of the Machine Translation by Intento (March 2018)
State of the Machine Translation by Intento (March 2018)
 
State of the Machine Translation by Intento (July 2018)
State of the Machine Translation by Intento (July 2018)State of the Machine Translation by Intento (July 2018)
State of the Machine Translation by Intento (July 2018)
 
Intento Machine Translation Benchmark, July 2017
Intento Machine Translation Benchmark, July 2017Intento Machine Translation Benchmark, July 2017
Intento Machine Translation Benchmark, July 2017
 
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (3)
 
State of the Machine Translation by Intento (stock engines, Jan 2019)
State of the Machine Translation by Intento (stock engines, Jan 2019)State of the Machine Translation by Intento (stock engines, Jan 2019)
State of the Machine Translation by Intento (stock engines, Jan 2019)
 
EVALUATION IN USE: NAVIGATING THE MT ENGINE LANDSCAPE WITH THE INTENTO EVALUA...
EVALUATION IN USE: NAVIGATING THE MT ENGINE LANDSCAPE WITH THE INTENTO EVALUA...EVALUATION IN USE: NAVIGATING THE MT ENGINE LANDSCAPE WITH THE INTENTO EVALUA...
EVALUATION IN USE: NAVIGATING THE MT ENGINE LANDSCAPE WITH THE INTENTO EVALUA...
 
CAT or TMS Implementation: Calculation of the Number of Licenses and the Tota...
CAT or TMS Implementation: Calculation of the Number of Licenses and the Tota...CAT or TMS Implementation: Calculation of the Number of Licenses and the Tota...
CAT or TMS Implementation: Calculation of the Number of Licenses and the Tota...
 
TAUS Roundtable Moscow, CAT or TMS Implementation-Calculation of the Number o...
TAUS Roundtable Moscow, CAT or TMS Implementation-Calculation of the Number o...TAUS Roundtable Moscow, CAT or TMS Implementation-Calculation of the Number o...
TAUS Roundtable Moscow, CAT or TMS Implementation-Calculation of the Number o...
 
Tr8n for php - Michael Berkovich
Tr8n for php - Michael BerkovichTr8n for php - Michael Berkovich
Tr8n for php - Michael Berkovich
 
Welocalize Throughputs and Post-Editing Productivity Webinar Laura Casanellas
Welocalize Throughputs and Post-Editing Productivity Webinar Laura CasanellasWelocalize Throughputs and Post-Editing Productivity Webinar Laura Casanellas
Welocalize Throughputs and Post-Editing Productivity Webinar Laura Casanellas
 
SDL BeGlobal The SDL Platform for Automated Translation
SDL BeGlobal The SDL Platform for Automated TranslationSDL BeGlobal The SDL Platform for Automated Translation
SDL BeGlobal The SDL Platform for Automated Translation
 
User Empowered Machine Translation. Dion Wiggins, Asia Online
User Empowered Machine Translation. Dion Wiggins, Asia OnlineUser Empowered Machine Translation. Dion Wiggins, Asia Online
User Empowered Machine Translation. Dion Wiggins, Asia Online
 
Maximising Machine Translation Return on Investment (KantanMT/Medialocate)
Maximising Machine Translation Return on Investment (KantanMT/Medialocate)Maximising Machine Translation Return on Investment (KantanMT/Medialocate)
Maximising Machine Translation Return on Investment (KantanMT/Medialocate)
 
What machine translation developers are doing to make post-editors happy
What machine translation developers are doing to make post-editors happyWhat machine translation developers are doing to make post-editors happy
What machine translation developers are doing to make post-editors happy
 
State of the Domain-Adaptive Machine Translation by Intento (November 2018)
State of the Domain-Adaptive Machine Translation by Intento (November 2018)State of the Domain-Adaptive Machine Translation by Intento (November 2018)
State of the Domain-Adaptive Machine Translation by Intento (November 2018)
 
Intento Enterprise MT Hub
Intento Enterprise MT HubIntento Enterprise MT Hub
Intento Enterprise MT Hub
 
DevOps and 5G cloud native solutions supported by Computaris automated testin...
DevOps and 5G cloud native solutions supported by Computaris automated testin...DevOps and 5G cloud native solutions supported by Computaris automated testin...
DevOps and 5G cloud native solutions supported by Computaris automated testin...
 
11. manuel leiva & juanjo arevalillo (hermes) evaluation of machine translation
11. manuel leiva & juanjo arevalillo (hermes) evaluation of machine translation11. manuel leiva & juanjo arevalillo (hermes) evaluation of machine translation
11. manuel leiva & juanjo arevalillo (hermes) evaluation of machine translation
 
An MT Journey Intuit and Welocalize Localization World 2013
An MT Journey Intuit and Welocalize Localization World 2013An MT Journey Intuit and Welocalize Localization World 2013
An MT Journey Intuit and Welocalize Localization World 2013
 
iMT Language Solutions
iMT Language SolutionsiMT Language Solutions
iMT Language Solutions
 

More from Konstantin Savenkov

GPT and other Text Transformers: Black Swans and Stochastic Parrots
GPT and other Text Transformers:  Black Swans and Stochastic ParrotsGPT and other Text Transformers:  Black Swans and Stochastic Parrots
GPT and other Text Transformers: Black Swans and Stochastic ParrotsKonstantin Savenkov
 
Dodging AI biases in future-proof Machine Translation solutions
Dodging AI biases in future-proof Machine Translation solutionsDodging AI biases in future-proof Machine Translation solutions
Dodging AI biases in future-proof Machine Translation solutionsKonstantin Savenkov
 
Building Multi-Purpose MT Portfolio
Building Multi-Purpose MT PortfolioBuilding Multi-Purpose MT Portfolio
Building Multi-Purpose MT PortfolioKonstantin Savenkov
 
Как выбрать и приручить машинный перевод / How to choose and tame the Machine...
Как выбрать и приручить машинный перевод / How to choose and tame the Machine...Как выбрать и приручить машинный перевод / How to choose and tame the Machine...
Как выбрать и приручить машинный перевод / How to choose and tame the Machine...Konstantin Savenkov
 
Progress in Commercial Machine Translation Systems
Progress in Commercial Machine Translation SystemsProgress in Commercial Machine Translation Systems
Progress in Commercial Machine Translation SystemsKonstantin Savenkov
 
Cloud Artificial Intelligence Landscape
Cloud Artificial Intelligence LandscapeCloud Artificial Intelligence Landscape
Cloud Artificial Intelligence LandscapeKonstantin Savenkov
 
Improving the Demand Side of the AI Economy (API World 2018)
Improving the Demand Side of the AI Economy (API World 2018)Improving the Demand Side of the AI Economy (API World 2018)
Improving the Demand Side of the AI Economy (API World 2018)Konstantin Savenkov
 
Сравнительный анализ систем машинного перевода
Сравнительный анализ систем машинного переводаСравнительный анализ систем машинного перевода
Сравнительный анализ систем машинного переводаKonstantin Savenkov
 
Cloud Sentiment Analysis - Vendor Overview (April 2018)
Cloud Sentiment Analysis - Vendor Overview (April 2018)Cloud Sentiment Analysis - Vendor Overview (April 2018)
Cloud Sentiment Analysis - Vendor Overview (April 2018)Konstantin Savenkov
 
NLU / Intent Detection Benchmark by Intento, August 2017
NLU / Intent Detection Benchmark by Intento, August 2017NLU / Intent Detection Benchmark by Intento, August 2017
NLU / Intent Detection Benchmark by Intento, August 2017Konstantin Savenkov
 
Управление бизнесом на основе данных
Управление бизнесом на основе данныхУправление бизнесом на основе данных
Управление бизнесом на основе данныхKonstantin Savenkov
 
Messengers, Bots and Personal Assistants
Messengers, Bots and Personal AssistantsMessengers, Bots and Personal Assistants
Messengers, Bots and Personal AssistantsKonstantin Savenkov
 
Рекомендательные системы: роль и оценка эффективности
Рекомендательные системы: роль и оценка эффективностиРекомендательные системы: роль и оценка эффективности
Рекомендательные системы: роль и оценка эффективностиKonstantin Savenkov
 
Measuring the agile process improvement
Measuring the agile process improvementMeasuring the agile process improvement
Measuring the agile process improvementKonstantin Savenkov
 
Driving Business Goals with Recommender Systems @ YAC/m 2015
Driving Business Goals with Recommender Systems @ YAC/m 2015Driving Business Goals with Recommender Systems @ YAC/m 2015
Driving Business Goals with Recommender Systems @ YAC/m 2015Konstantin Savenkov
 
The Economics of Recommender Systems
The Economics of Recommender SystemsThe Economics of Recommender Systems
The Economics of Recommender SystemsKonstantin Savenkov
 

More from Konstantin Savenkov (20)

GPT and other Text Transformers: Black Swans and Stochastic Parrots
GPT and other Text Transformers:  Black Swans and Stochastic ParrotsGPT and other Text Transformers:  Black Swans and Stochastic Parrots
GPT and other Text Transformers: Black Swans and Stochastic Parrots
 
Dodging AI biases in future-proof Machine Translation solutions
Dodging AI biases in future-proof Machine Translation solutionsDodging AI biases in future-proof Machine Translation solutions
Dodging AI biases in future-proof Machine Translation solutions
 
Building Multi-Purpose MT Portfolio
Building Multi-Purpose MT PortfolioBuilding Multi-Purpose MT Portfolio
Building Multi-Purpose MT Portfolio
 
Machine Translation Insights
Machine Translation InsightsMachine Translation Insights
Machine Translation Insights
 
Как выбрать и приручить машинный перевод / How to choose and tame the Machine...
Как выбрать и приручить машинный перевод / How to choose and tame the Machine...Как выбрать и приручить машинный перевод / How to choose and tame the Machine...
Как выбрать и приручить машинный перевод / How to choose and tame the Machine...
 
Progress in Commercial Machine Translation Systems
Progress in Commercial Machine Translation SystemsProgress in Commercial Machine Translation Systems
Progress in Commercial Machine Translation Systems
 
Cloud Artificial Intelligence Landscape
Cloud Artificial Intelligence LandscapeCloud Artificial Intelligence Landscape
Cloud Artificial Intelligence Landscape
 
Intento Enterprise MT Hub
Intento Enterprise MT HubIntento Enterprise MT Hub
Intento Enterprise MT Hub
 
Improving the Demand Side of the AI Economy (API World 2018)
Improving the Demand Side of the AI Economy (API World 2018)Improving the Demand Side of the AI Economy (API World 2018)
Improving the Demand Side of the AI Economy (API World 2018)
 
Сравнительный анализ систем машинного перевода
Сравнительный анализ систем машинного переводаСравнительный анализ систем машинного перевода
Сравнительный анализ систем машинного перевода
 
Cloud Sentiment Analysis - Vendor Overview (April 2018)
Cloud Sentiment Analysis - Vendor Overview (April 2018)Cloud Sentiment Analysis - Vendor Overview (April 2018)
Cloud Sentiment Analysis - Vendor Overview (April 2018)
 
NLU / Intent Detection Benchmark by Intento, August 2017
NLU / Intent Detection Benchmark by Intento, August 2017NLU / Intent Detection Benchmark by Intento, August 2017
NLU / Intent Detection Benchmark by Intento, August 2017
 
Building a Data Driven Business
Building a Data Driven BusinessBuilding a Data Driven Business
Building a Data Driven Business
 
Управление бизнесом на основе данных
Управление бизнесом на основе данныхУправление бизнесом на основе данных
Управление бизнесом на основе данных
 
Messengers, Bots and Personal Assistants
Messengers, Bots and Personal AssistantsMessengers, Bots and Personal Assistants
Messengers, Bots and Personal Assistants
 
Рекомендательные системы: роль и оценка эффективности
Рекомендательные системы: роль и оценка эффективностиРекомендательные системы: роль и оценка эффективности
Рекомендательные системы: роль и оценка эффективности
 
Measuring the agile process improvement
Measuring the agile process improvementMeasuring the agile process improvement
Measuring the agile process improvement
 
Lean production для SAAS
Lean production для SAASLean production для SAAS
Lean production для SAAS
 
Driving Business Goals with Recommender Systems @ YAC/m 2015
Driving Business Goals with Recommender Systems @ YAC/m 2015Driving Business Goals with Recommender Systems @ YAC/m 2015
Driving Business Goals with Recommender Systems @ YAC/m 2015
 
The Economics of Recommender Systems
The Economics of Recommender SystemsThe Economics of Recommender Systems
The Economics of Recommender Systems
 

Recently uploaded

Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 

Recently uploaded (20)

Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 

State of the Machine Translation by Intento (November 2017)

  • 1. State of the Machine Translation by Intento November 2017
  • 2. About • At Intento, we want to make Machine Intelligence services easy to discover, choose and use. • So far, evaluation is the most problematic part: to compare different services, one need to sign a lot of contracts and integrate a lot of APIs. • We deliver this overview report for FREE. To get the full report or evaluate on you own dataset, contact us. • Also, check out our Natural Language Understanding Benchmark. NLU may help you to automate workflows beyond the automated translation. November 2017© Intento, Inc.
  • 3. Overview © Intento, Inc. November 2017 11 Machine Translation Services 35 Language Pairs TRANSLATION QUALITY LANGUAGE COVERAGE DEVELOPER EXPERIENCE MISCELLANEOUS Get the full version of this report
  • 4. November 2017© Intento, Inc. Changes since July 2017 • +2 vendors: DeepL (beta), SAP (beta) • +21 language pair • Detailed performance analysis • Developer experience comparison
  • 5. Machine Translation Services* Compared * We have evaluated general purpose Cloud Machine Translation services with prebuilt translation models, provided via API. Some vendors also provide web-based, on-premise or custom MT engines, which may differ on all aspects from what we’ve evaluated. Baidu Translate API DeepL API (beta) Google Cloud Translation API GTCom YeeCloud MT IBM Watson Language Translator Microsoft Translator Text API PROMT Cloud API SAP Translation Hub (beta) SDL Cloud Machine Translation Systran REST Translation API Yandex Translate API
  • 6. Translation Quality November 2017© Intento, Inc. Evaluation Methodology Overall Performance Available MT Quality Price vs. Performance
  • 7. Evaluation methodology (I) • Translation quality is evaluated by computing LEPOR score between reference translations and the MT output (Slide 9). • Currently, our goal is to evaluate performance of translation between the most popular languages (Slide 10). • We use public datasets from StatMT/WMT and CASMACAT News Commentary (Slide 11). • We have performed LEPOR metric convergence analysis to identify minimal viable number of segments in the dataset. See Slide 12 for some details.
  • 8. Evaluation methodology (II) • We consider MT service A more performant than B for the language pair C if: - mean LEPOR score of A is greater than LEPOR of B for the pair C, and - lower bound of the LEPOR 95% confidence interval of A is greater than the upper bound of the LEPOR confidence interval of B for the pair C. See Slide 12 for an example. • Different language pairs (and different datasets) impose different translation complexity. In order to compare overall MT performance of different services we regularise LEPOR scores across all language pairs (See Appendix A for more details).
  • 9. LEPOR score • LEPOR: automatic machine translation evaluation metric considering the enhanced Length Penalty, n-gram Position difference Penalty and Recall • In our evaluation, we used hLEPORA v.3.1: • best metric from ACL-WMT 2013 contest https://www.slideshare.net/AaronHanLiFeng/lepor-an-augmented-machine-translation-evaluation-metric-thesis-ppt https://github.com/aaronlifenghan/aaron-project-lepor LIKE BLEU, BUT BETTER
  • 10. Language Pairs We focus on the en-P1, P1-en and P1-P1 (partially) * https://w3techs.com/technologies/overview/content_language/all Language groups by web popularity*: P1 - ≥ 2.0% websites P2 - 0.5%-2% websites P3 - 0.1-0.3% websites P4 - <0.1% websites en ru ja de es fr pt it zh cs tr fi ro en ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ru ✓ ✓ ✓ ✓ ✓ ja ✓ ✓ de ✓ ✓ es ✓ ✓ fr ✓ ✓ ✓ pt ✓ it ✓ ✓ zh ✓ ✓ cs ✓ tr ✓ fi ✓ ro ✓
  • 11. Datasets • WMT-2013 (translation task, news domain) - en-es, es-en • WMT-2015 (translation task, news domain) - fr-en, en-fr • WMT-2016 (translation task, news domain) - cs-en, en-cs, de-en, en-de, ro-en, en-ro, fi-en, en-fi, ru-en, en-ru, tr-en, en-tr • WMT-2017 (translation task, news domain) - zh-en, en-zh • NewsCommentary-2011 - en-ja, ja-en, en-pt, pt-en, en-it, it-en, ru-de, ru-es, ru-fr, ru-pt, ja-fr, de-ja, es-zh, fr-ru, fr-es, it-pt, zh-it
  • 12. LEPOR Convergence We used 1440 - 3000 sentences per language pair. In all cases it’s clear that the metric stabilises and adding more from the same domain won’t change the outcome. number of sentences regularisedhLEPORscores Aggregated across all language pairs Examples for individual language pairs: Aggre- gated mean Confi- dense interval Detailed data on each language pair provided in the full report
  • 13. Overall Performance 35 language pairs, 1440-3000 sentences per pair >70% <40% Variance among language pairs Detailed data on each language pair provided in the full report
  • 14. Available MT Quality en ru ja de es fr pt it zh cs tr fi ro en 4 7 3 2 7 1 2 6 2 4 5 2 ru 2 3 3 4 4 ja 1 4 de 8 2 es 7 4 fr 6 1 8 pt 4 it 7 2 zh 6 4 cs 2 tr 2 fi 1 ro 4 70 % 60 % 50 % 40 % 30 % Maximal Achieved hLEPOR score: No. of top-performing MT Providers Minimal price for this quality, per 1M char: $$$ ≥$20 $$ $10-15 $ <$10 $$ $ $ $$ $$ $ $ $$$ $$$ $ $$$ $ $ $$$ $$ $$ $ $$$ $ $ $$ $ $ $$ $ $$ $$ $ $$ $ $$$ $ $$ $ Detailed data on each language pair provided in the full report
  • 15. Sample pair analysis: en-pt LEPOR score Providers Price range (per 1M characters) 77 % Google $20 72 % Yandex, Microsoft $4.5-15 70 % Baidu, SDL, IBM $8-$21 62 % Systran, PROMT $3-$8 BEST QUALITY: BEST PRICE: PRICE&QUALITY: Google PROMT Microsoft ALL 35 PAIRS AVAILABLE IN THE FULL REPORT
  • 16. Price vs. Performance AFFORDABILITY PERFORMANCE As of November 2017 COST-EFFECTIVE ACCURATE FREE (BETA) NOT SET YET COST-EFFECTIVE Performance Regularized hLEPOR score aggregated across all language pairs in the dataset Affordability = 1/price Using public volume- based pricing tiers Legend • performance range: - regularised average - max across all pairs - min across all pairs • price range Detailed data on each language pair provided in the full report
  • 17. Language Coverage November 2017© Intento, Inc. Supported and Unique per Provider Coverage by Language Popularity
  • 18. Language coverage Unique language pairs - supported exclusively by one provider 1 100 10000 Google Yandex Microsoft Baidu Systran SDL MT PROMT SAP DeepLIBM WatsonGTCom 2 1 2 1 54 920 1 246 3 202 6 20 4247 88 104111 756 3 422 8 556 10 712 Total Unique Supported language pairs © Intento, Inc. July 2017Detailed data on the supported languages is provided in the full report
  • 19. Language popularity Language groups by web popularity*: P1 - ≥ 2.0% websites P2 - 0.5%-2% websites P3 - 0.1-0.3% websites P4 - <0.1% websites * https://w3techs.com/technologies/overview/content_language/all A total of 29070 pairs possible, 12989 are supported across all providers P1 en, ru, ja, de, es, fr, pt, it, zh P2 pl, fa, tr, nl, ko, cs, ar, vi, el, sv in, ro, hu P3 da, sk, fi, th, bg, he, lt, uk, hr, no, nb, sr, ca, sl, lv, et P4 hi, az, bs, ms, is, mk, bn, eu, ka, sq, gl, mn, kk, hy, se, uz, kr, ur, ta, nn, af, be, si, my, br, ne, sw, km, fil, ml, pa, …
  • 20. Language groups by web popularity*: P1 - ≥ 2.0% websites P2 - 0.5%-2% websites P3 - 0.1-0.3% websites P4 - <0.1% websites 100% 94% 63% 88% 31% P1 P2 P3 P4 P1 P2 P3 P4 59% 100% 100% 94% 63% 100% 94% 94% 63% 63% 59% Supported language pairs by popularity * up from 44% in July 2017 as we better distinguish variations of the Chinese language© Intento, Inc. Language coverage 45%* of possible language pairs July 2017
  • 21. by service provider Google Cloud Translation API Microsoft Translator Text API Yandex Translate API Systran REST Translation API SDL Cloud Machine Translation PROMT Cloud API IBM Watson Language Translation July 2017© Intento, Inc. Language coverage Baidu Translate API GTCom YeeCloud API DeepL API SAP Translation Hub Detailed data on the supported languages is provided in the full report
  • 22. Developer Experience (DX) November 2017© Intento, Inc. Evaluation Methodology DX Charts
  • 23. Evaluation methodology (I) Here we evaluate overall service organisation from the following angles: • Product - Support of Machine Translation features desired for using the API in various MT scenarios • Design - Overall API design and technical convenience • Documentation - How well the API is documented • Onboarding - How easy is to integrate and start using the API • Commercial - Flexibility of the commercial terms • Implementation - Important low-level features of the API • Maintenance - Convenience of getting information about the API changes for ongoing support • Reliability - Various technical issues we’ve encountered Some references: • http://talks.kinlane.com/apistrat/api101/index.html#/14 • https://mathieu.fenniak.net/the-api-checklist/ • https://www.slideshare.net/jmusser/ten-reasons-developershateyourapi • https://restfulapi.net/richardson-maturity-model/ • https://github.com/shieldfy/API-Security-Checklist • https://nordicapis.com/why-api-developer-experience-matters-more-than-ever/ • http://www.drdobbs.com/windows/measuring-api-usability/184405654?pgno=1
  • 24. Evaluation methodology (II) • Translation domains • Translation engines • Language autodetect • Glossaries • TM Support • Custom engines • Bulk mode • Formatted text • XLIFF support Product Design Documentation Onboarding Commercial Implementation Maintenance Reliability • Authentication • Use of SSL • Quota info • Domain info • Balance info • Self-sufficient • Intuitive • Versioning • Bulk mode • Task-invocation ratio • I/O Structure • List of endpoints • User documentation • Supported languages • Quotas • Response codes • Error codes • Error messages • API explorer • API console • Number of docs • HTML doc • Self-registration • Self-issued keys • Self-payment • Free / Trial plan • Sandbox • Test console • Github repo • Code libraries • SDK / PDK • Sample code • Direct support • Ticket system • Self-support • Tutorial • FAQ / KB • Starter package • Public pricing • Pay as you go • Post-paid • Volume discounts • Payment systems • Billing history • API spec • Data compression • Supports JSON • Negotiable content • Unicode support • Error codes • Error messages • News source • Subscription news • Versioning • Changelog • Release notes • Roadmap • Status dashboard • Developer dashboard • Exportable logs • Uptime • Sporadic errors • Bugs • Performance issues • Status dashboard • Outage alerts
  • 25. by service provider July 2017© Intento, Inc. Developer experience Available in the full report
  • 26. Miscellaneous November 2017© Intento, Inc. API change frequency
  • 27. API & Documentation changes since July 2017: Change frequency SDL Language Cloud 31 Google Translate API 14 IBM Watson Translator 10 Microsoft Translator API 7 SAP Translation Hub 6 July 2017© Intento, Inc.
  • 28. Detailed version of this report • We give this over view version for free. • The full evaluation report contains: - Detailed best-deal analysis for each of the 35 language pairs - Developer experience analysis for each of the 11 MT providers • Also, by ordering the full report you support our ongoing evaluation of the Cloud MT • To get the full report, reach us at hello@inten.to
  • 29. Discover the best service providers for your AI task Evaluate performance on your own data at a fraction of the potential cost saving Access any provider with no effort using to our Single API Intento Service Platform July 2017© Intento, Inc.
  • 31. Appendix A July 2017© Intento, Inc. Overall performance of the MT services across many language pairs is computed in the following way: 1. [Standardisation] We compute mean language-standardised LEPOR score (or z-score) for each provider. 2. [Scale adjustment] We restore the original scale by multiplying z-score for each MT provider by the global LEPOR standard deviation and adding the global mean LEPOR score.