doktorska disertacija
Jernej Vičič (Author), Igor Kononenko (Mentor), Tomaž Erjavec (Co-mentor)

Abstract

Hitra postavitev prevajalnih sistemov na osnovi pravil za sorodne naravne jezike

Keywords

strojno prevajanje;strojno prevajanje sorodnih naravnih jezikov;tehnologije hitrih postavitev sistemov za strojno prevajanje;računalništvo;disertacije;

Data

Language: Slovenian
Year of publishing:
Typology: 2.08 - Doctoral Dissertation
Organization: UL FRI - Faculty of Computer and Information Science
Publisher: [J. Vičič]
UDC: 004.85:81'322.4(043.3)
COBISS: 9339732 Link will open in a new window
Views: 50
Downloads: 2
Average score: 0 (0 votes)
Metadata: JSON JSON-RDF JSON-LD TURTLE N-TRIPLES XML RDFA MICRODATA DC-XML DC-RDF RDF

Other data

Secondary language: English
Secondary title: A fast implementation of rules based machine translation systems for similar natural languages
Secondary abstract: The work presents an overview of the systems and methods for the natural language machine translation. It focuses primarily on systems and methods for the translation of the related languages. Most of the presented systems belong to the Shallow Parse and Transfer Rule-Based Machine Translation paradigm, which is better suited for the implementation of a translation system for related languages. The major problem of the rule-based translation systems is costly manual production of dictionaries and translation rules in the case of a classical approach to building such systems. The work provides an overview over the collection of selected and new methods designed for automatic production of materials for the installation of systems based on translation rules. Methods were tested on a case study: the implementation of a fully functioning translation system for related languages. The following four systems were used as the basis: Slovenian-Serbian, Slovenian-Czech, Slovenian-English and Slovenian-Estonian. The evaluation process focused on the quality of the translations as well as the estimation of the time needed for the implementation of a new system. The dissertation presents a method that extends the basic Statistical Machine Translation by Parsing paradigm for languages with limited support of language technologies. The learning phase uses an aligned corpus instead of a full treebank. The dissertation describes a method for the automatic creation of morphologies, which includes automatic paradigm tagging, automatic paradigm construction for the highly inflected languages and automatic production of bilingual dictionaries. The dissertation presents a method for the selection and assessment of the rules for the structural transfer. Methods for the automatic construction of structural transfer rules often produce a large set of rules, which compete with each other (it is possible to use multiple rules on the same part of text). The best rules are chosen on the basis of the target language corpus.
Secondary keywords: machine translation;machine translation of related languages;speeding up the inplementation of machine translation systems;computer science;doctoral dissertations;theses;Strojno prevajanje;Disertacije;
File type: application/pdf
Type (COBISS): Dissertation
Thesis comment: Univ. v Ljubljani, Fak. za računalništvo in informatiko
Pages: 165 str.
ID: 24168141