Neural Machine Translation for Bangla-English Language Pair - Bangla Language Computing Research

Md. Arid Hasan, Firoj Alam, Shammur Absar Chowdhury, Naira Khan: Neural Machine Translation for Bangla-English Language Pair. In: 2019 22nd International Conference of Computer and Information Technology (ICCIT), IEEE 2019.

Abstract

Due to the rapid advancement of different neural network architectures, the task of automated translation from one language to another is now in a new era of Machine Translation (MT) research. In the last few years, Neural Machine Translation (NMT) architectures have proven to be successful for resource-rich languages, trained on a large dataset of translated sentences, with variations of NMT algorithms used to train the model. In this study, we explore different NMT algorithms-Bidirectional Long Short Term Memory (LSTM) and Transformer based NMT, to translate the Bangla to English language pair. For the experiments, we used different datasets and our experimental results outperform the existing performance by a large margin on different datasets. We also investigated the factors affecting the data quality and how they influence the performance of the models. It shows a promising research avenue to enhance NMT for the Bangla-English language pair.

BibTeX (Download)

@inproceedings{arid2019neural,
title = {Neural Machine Translation for Bangla-English Language Pair},
author = {Md. Arid Hasan and Firoj Alam and Shammur Absar Chowdhury and Naira Khan},
url = {https://www.researchgate.net/publication/338223294_Neural_Machine_Translation_for_the_Bangla-English_Language_Pair},
year  = {2019},
date = {2019-12-18},
booktitle = {2019 22nd International Conference of Computer and Information Technology (ICCIT)},
organization = {IEEE},
abstract = {Due to the rapid advancement of different neural network architectures, the task of automated translation from one language to another is now in a new era of Machine Translation (MT) research. In the last few years, Neural Machine Translation (NMT) architectures have proven to be successful for resource-rich languages, trained on a large dataset of translated sentences, with variations of NMT algorithms used to train the model. In this study, we explore different NMT algorithms-Bidirectional Long Short Term Memory (LSTM) and Transformer based NMT, to translate the Bangla to English language pair. For the experiments, we used different datasets and our experimental results outperform the existing performance by a large margin on different datasets. We also investigated the factors affecting the data quality and how they influence the performance of the models. It shows a promising research avenue to enhance NMT for the Bangla-English language pair.},
keywords = {Bangla-to-English, Bidirectional LSTM, Machine Translation, Neural Machine Translation, Transformer},
pubstate = {published},
tppubtype = {inproceedings}
}