doktorska disertacija
Jernej Vičič (Avtor), Igor Kononenko (Mentor), Tomaž Erjavec (Komentor)

Povzetek

Hitra postavitev prevajalnih sistemov na osnovi pravil za sorodne naravne jezike

Ključne besede

strojno prevajanje;strojno prevajanje sorodnih naravnih jezikov;tehnologije hitrih postavitev sistemov za strojno prevajanje;računalništvo;disertacije;

Podatki

Jezik: Slovenski jezik
Leto izida:
Tipologija: 2.08 - Doktorska disertacija
Organizacija: UL FRI - Fakulteta za računalništvo in informatiko
Založnik: [J. Vičič]
UDK: 004.85:81'322.4(043.3)
COBISS: 9339732 Povezava se bo odprla v novem oknu
Št. ogledov: 50
Št. prenosov: 2
Ocena: 0 (0 glasov)
Metapodatki: JSON JSON-RDF JSON-LD TURTLE N-TRIPLES XML RDFA MICRODATA DC-XML DC-RDF RDF

Ostali podatki

Sekundarni jezik: Angleški jezik
Sekundarni naslov: A fast implementation of rules based machine translation systems for similar natural languages
Sekundarni povzetek: The work presents an overview of the systems and methods for the natural language machine translation. It focuses primarily on systems and methods for the translation of the related languages. Most of the presented systems belong to the Shallow Parse and Transfer Rule-Based Machine Translation paradigm, which is better suited for the implementation of a translation system for related languages. The major problem of the rule-based translation systems is costly manual production of dictionaries and translation rules in the case of a classical approach to building such systems. The work provides an overview over the collection of selected and new methods designed for automatic production of materials for the installation of systems based on translation rules. Methods were tested on a case study: the implementation of a fully functioning translation system for related languages. The following four systems were used as the basis: Slovenian-Serbian, Slovenian-Czech, Slovenian-English and Slovenian-Estonian. The evaluation process focused on the quality of the translations as well as the estimation of the time needed for the implementation of a new system. The dissertation presents a method that extends the basic Statistical Machine Translation by Parsing paradigm for languages with limited support of language technologies. The learning phase uses an aligned corpus instead of a full treebank. The dissertation describes a method for the automatic creation of morphologies, which includes automatic paradigm tagging, automatic paradigm construction for the highly inflected languages and automatic production of bilingual dictionaries. The dissertation presents a method for the selection and assessment of the rules for the structural transfer. Methods for the automatic construction of structural transfer rules often produce a large set of rules, which compete with each other (it is possible to use multiple rules on the same part of text). The best rules are chosen on the basis of the target language corpus.
Sekundarne ključne besede: machine translation;machine translation of related languages;speeding up the inplementation of machine translation systems;computer science;doctoral dissertations;theses;Strojno prevajanje;Disertacije;
Vrsta datoteke: application/pdf
Vrsta dela (COBISS): Doktorska disertacija
Komentar na gradivo: Univ. v Ljubljani, Fak. za računalništvo in informatiko
Strani: 165 str.
ID: 24168141