Content Services
- Technical Writing
- Training & eLearning
- Financial Reports
- Digital Marketing
- SEO & Content Optimization
Translation Services
- Video Localization
- Software Localization
- Website Localization
- Translation for Regulated Companies
- Interpretation
- Instant Interpreter
- Live Events
- Language Quality Services
Testing Services
- Functional QA & Testing
- Compatibility Testing
- Interoperability Testing
- Performance Testing
- Accessibility Testing
- UX/CX Testing
Solutions
- Translation Service Models
- Machine Translation
- Smart Onboarding™
- Aurora AI Studio™
Our Knowledge Hubs
- Positive Patient Outcomes
- Modern Clinical Trial Solutions
- Future of Localization
- Innovation to Immunity
- COVID-19 Resource Center
- Disruption Series
- Patient Engagement
- Lionbridge Insights
Life Sciences
- Pharmaceutical
- Clinical
- Regulatory
- Post-Approval
- Corporate
- Medical Devices
- Validation and Clinical
- Regulatory
- Post-Authorization
- Corporate
Banking & Finance
Retail
Luxury
E-Commerce
Games
Automotive
Consumer Packaged Goods
Technology
Industrial Manufacturing
Legal Services
Travel & Hospitality
SELECT LANGUAGE:
As professionals and the general public increasingly use generic, freely available Machine Translation (MT) systems, it’s essential to be aware that these engines can produce flawed translations. Poor quality or catastrophic errors may have a detrimental impact on businesses. But there are ways to bolster MT quality, including executing MT terminology management effectively.
Due to the complex relationships between concepts and their representations through terminology units, terminology remains one of the biggest challenges for Machine Translation.
Using free MT systems for specific domains can especially cause undesired results from a terminological point of view. The impact can be particularly harmful to the medical and legal fields.
While you can train MT systems with a specific domain corpus to prevent poor results to some degree, the use of generic MT systems may not guarantee consistent or accurate terminology.
The quality of the translation by an MT engine depends on the quality of the bilingual training corpus, among other things. Therefore, an accurate translation of terminology can only be guaranteed if the corpus includes the relevant terms in both the source and the target part.
Although Neural Machine Translation (NMT) systems are based on probabilistic distributions of terms, their presence is necessary but insufficient to ensure high-quality translations. The frequency of terms in the corpus must be satisfactory for the decoder to obtain the exact equivalents. If the frequency of a particular term is inadequate, it will not get the sufficient weight to be considered a candidate for equivalents and will not be translated accurately.
Generic MT systems are often trained with large corpora of varied content. As a result, the most frequent candidate term and its possible equivalent may belong to a different domain than the term being translated. This situation can cause terms to be translated inaccurately into the target language.
For example, the Spanish term fósforo can be translated as a match (the object to light fire) or as phosphorus (a chemical element). A generic MT engine will be unable to differentiate the intent easily, and the translation may result in an error.
The solution to this problem is to train customized MT systems with domain-specific bilingual texts that include specialized terminology.
Still, accurate translations cannot be guaranteed when engines are trained with specialized texts if the terminology is not used consistently.
Research in this area proposes to inject linguistic information into NMT systems by means of annotation methods.
The implementation of manual or semi-automatic annotation depends on available resources, such as glossaries, and constraints, such as time, cost, and availability of human annotators.
Lionbridge’s Smart MT™ allows the application of linguistic rules to the source and target text, as well as the enforcement of terminology based on Do Not Translate (DNT) and glossary lists added to a specific profile.
We help our customers create and maintain glossaries, which are regularly refined to include new, relevant terms and retire obsolete terminology. When glossaries are created once in Smart MT, they can then be used for all the MT engines, saving time and money.
Using glossaries for MT projects is not as simple as it may seem. Glossaries, if used inappropriately, can negatively affect the overall quality of Machine Translation. The best way to follow terminology in MT is through MT training.
The combination of trained MT engines, glossary customization, and the identification of preprocessing and post-processing rules ensure MT output contains proper terminology and is similar in style to the customer's documentation.
When you assess terminology management features in a Machine Translation solution, look for the offering’s ability to:
Together, these capabilities will promote a higher-quality translation output.
Smart MT works with various third-party MT systems via a connector. Think of it as an “MT harness” that can:
Lionbridge also has other automations able to identify inconsistencies between the terminology in customer glossaries and how that terminology is used in the training corpus and in the output of MT. These automations help identify the cases in which the training corpus or, later, the MT outputs don’t follow the approved terminology and will correct them.
If we learn that MT does not include the required terminology, we suggest using a glossary with DNTs, product names, and specific key domain or brand terminology.
To ensure the desired Machine Translation outcome, we recommend you consider the following guidelines as you create your glossaries:
If you’d like to learn more about how Lionbridge can help you manage your terminology and get the most out of MT, contact us today.