Efficient Architectures for Low-Resource Machine Translation

Low-resource Neural Machine Translation is highly sensitive to hyperparameters and needs careful tuning to achieve the best results with small amounts of training data. We focus on exploring the impact of changes in the Transformer architecture on downstream translation quality, and propose a metric to score the computational efficiency of such changes. By experimenting on English-Akkadian, German-Lower Sorbian, English-Italian, and English-Manipuri, we confirm previous finding in low-resource machine translation optimization, and show that smaller and more parameter-efficient models can achieve the same translation quality of larger and unwieldy ones at a fraction of the computational cost. We compile a list of optimal ranges for each hyperparameter.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
_results		_results
_scripts		_scripts
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Efficient Architectures for Low-Resource Machine Translation

Citation:

About

Uh oh!

Releases

Packages

Languages

edoardosignoroni/eff_archs_lowre

Folders and files

Latest commit

History

Repository files navigation

Efficient Architectures for Low-Resource Machine Translation

Citation:

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages