Professional Documents
Culture Documents
ISSN No:-2456-2165
Abstract:- In this paper, Wolaytegna to Amharic In this research work the Bilingual dictionary which is
machine translation were conducted using dictionary used in the Wolaytegna to Amharic translation and vice
based machine translation approach. Machine versa is the core components of a machine translation of
translation system one of a key purpose Natural these two languages. There are many approaches for
Language processing and it is a process of translating developing the MT systems, each approach has their own
from one language to another. In this study the advantages and disadvantages. Out of these approaches
researcher were translated two Ethiopian languages one dictionary based machine translation the most
local language (Wolaytegna) and the other one is official recommended for linguistically less resourced language
language of the country (Amharic) by using dictionary like Wolaytegna. In Ethiopia there are about 80 different
based approach. This research is very important for the languages are available from which Wolaytegna is the 7th
development of the Wolaytegna language which is most spoken language which is spoken by around 7 million
spoken by around 7 million people in Wolaytta zone and people in the country specially by Wolaytta people in
other part of the Ethiopia. For this research we used SNNPR and one of a language with few resource published
Java, MYSQL database and 5400 word entries in electronically in internet and other different media.
dictionary were created in the database to create Oppositely Amharic is historically advantageous language
accurate translation. For all words of source language in Ethiopia because different regime at different period in
we defined meaning in target language in bilingual Ethiopia used the language as official language of the
dictionary. The proposed methodology uses dictionary country so that it is one of linguistically well resourced
for translating word by word without much because this languages compared to other Ethiopian languages.
kind of approach is very advisable for linguistically less
resourced language like Wolaytegna. So this research work will supports Amharic speakers
to use Wolaytegna and Vice-versa by using dictionary
Keyword:- Wolaytegna, Machine translation, Dictionary, based machine translation.
Bilingual, Multilingual, Natural language processing.
The biggest challenge for Statistical Machine
I. INTRODUCTION Translation is to get the high quality corpus because of
insufficient sources of the data for the language like
Translation systems plays a vital role in narrowing the Wolaytegna. Dictionary Based Machine Translation
communication barrier between human race from different (DBMT) approach is used when less number of linguistic
corner of the world. Natural Language Processing (NLP) is resources is available for the languages. In the dictionary
a core discipline in machine translation and it is field of based translation, a system is defined which contains set of
computer science devoted to the improvement of models source language word and corresponding target language
and technologies empowering computers to use human words. During the run time, dictionary based translation use
languages both as input and output [3]. One of the aim of bilingual corpus as its database which is defined in the form
NLP is to develop computational models that can have of dictionary. This database is stored in the translation
equal performance like in the task of reading, writing, memory. Since the two languages, Wolaytegna and
learning, speaking and understanding. Computational Amharic have the same grammatical sentence structure, so
models are useful to explore the nature of linguistic that when the system encounters the any sentence the
communication as well as for enabling effective human- system does not require any rearrange in the sentence
machine interaction. rather it translates directly by retrieving from the translation
memory.
This speedy growth of data on internet was
encouragement for the MT researchers to develop more
profitable MT systems to deliver a worldwide
communication.
We will explain basic details of those steps in C. Detecting contextual ambiguity from alternatives
architectural model in the following content. A word may have more than one meaning in different
sentence with same spelling and pronunciation.
A. Splitting sentence into words
In this research translation can be word based, phrase D. Retrieving target word
based or sentence based; so that if the input is phrase or In this stage of machine translation checking for the
sentence it must be break into words because the entry in availability of each words of source language in given
database is only word based in dictionary. sentence and storing to defined array.
V. EXPERMENTAL RESULTS
Fig 7:- The main interface before feeding texts of source language.
Fig 8:- The figure illustrated how bilingual machine translator works