Št. zadetkov: 10
Diplomsko delo
Oznake:
sociologija;zgodovina;Lovrenc na Pohorju;šolstvo;kulturna društva;športna društva;naravne znamenitosti;kulturne znamenitosti;gospodarstvo;diplomska dela;
Sveti Lovrenc na Pohorju : pregled kulturnega, gospodarskega, socialnega, cerkvenega in šolskega razvoja Svetega Lovrenca na Pohorju
Leto:
2009
Vir:
Filozofska fakulteta (UM FF)
Magistrsko delo
Oznake:
strojno prevajanje;statistično strojno prevajanje;prilagajanje strojnega prevajalnika za specifično domeno;prilagajanje statističnega strojnega prevajanja za področje farmacije;faktorski model;Moses;model na osnovi besednih zvez;Cohenova kappa;Fleissova kappa;strinjanje ocenjevalcev;
Strojno prevajanje, še posebej statistično strojno prevajanje, se je v zadnjih letih zelo razširilo, zahvaljujoč predvsem vse večjemu številu večjezičnih jezikovnih virov. Večina javno dostopnih strojnih prevajalnikov nam omogoča, da dobimo osnovno razumevanje vsebine v tujem jeziku, medtem ko ti ni ...
Leto:
2016
Vir:
Fakulteta za računalništvo in informatiko (UL FRI)
Raziskovalni podatki
Oznake:
Moses language model;probability language model
The ccGigafida ARPA language model was created from the ccGigafida written corpus of Slovenian (https://www.clarin.si/repository/xmlui/handle/11356/1035) using the KenLM algorithm in the Moses machine translation framework. It is a general language model of contemporary standard Slovenian language t ...
Leto:
2017
Vir:
CLARIN.si
Raziskovalni podatki
Oznake:
common words;frequent words;reference corpora;readability
The reference list of Slovene most frequent common words was prepared by selecting vocabulary at the intersection of the most frequent 10,000 lemmas of four Slovene text corpora: the balanced reference corpus of written Slovene Kres, the reference corpus of spoken Slovene GOS, the corpus of computer ...
Leto:
2020
Vir:
CLARIN.si
Raziskovalni podatki
Oznake:
benchmark;language understanding
SuperGLUE is a benchmark styled after GLUE with a new set of more difficult language understanding tasks, improved resources, and a public leaderboard. It is comprised of 8 corpora (BoolQ, CB, COPA, MultiRC, ReCoRD, RTE, WiC, WSC), which cover 4 different types of tasks (QA, NLI, WSD, coref.). Slove ...
Leto:
2020
Vir:
CLARIN.si
Raziskovalni podatki
Oznake:
terminology;manual annotation
The dataset contains 22,950 term candidates extracted from 15 Slovenian PhD theses. The term candidates are of length 1 to 4, extracted via morphosyntactic patterns and the frequency threshold of 3. The PhD theses are from the areas of chemistry, computer science and political science. Each of the t ...
Leto:
2018
Vir:
CLARIN.si
Raziskovalni podatki
Oznake:
corpus linguistics;text processing;extraction;characters;word parts;words;word sets;n-grams;morphology
The LIST corpus extraction tool is a Java program for extracting lists from text corpora on the levels of characters, word parts, words, and word sets. It supports VERT and TEI P5 XML formats and outputs .CSV files that can be imported into Microsoft Excel or similar statistical processing software.
Leto:
2019
Vir:
CLARIN.si
Raziskovalni podatki
Oznake:
corpus linguistics;text processing;extraction;characters;word parts;words;word sets;n-grams;morphology
The LIST corpus extraction tool is a Java program for extracting lists from text corpora on the levels of characters, word parts, words, and word sets. It supports VERT and TEI P5 XML formats and outputs .CSV files that can be imported into Microsoft Excel or similar statistical processing software. ...
Leto:
2019
Vir:
CLARIN.si
Raziskovalni podatki
Oznake:
morphology;inflection;word forms;derivation;LMF;lemmatisation;word accents;IPA;SAMPA
Sloleks is the reference morphological lexicon for Slovenian language, developed to be used in NLP applications and language manuals. Encoded in LMF XML, the lexicon contains approx. 100,000 most frequent Slovenian lemmas, their inflected or derivative word forms and the corresponding grammatical de ...
Leto:
2019
Vir:
CLARIN.si
Raziskovalni podatki
Oznake:
thesaurus
This is an automatically created Slovene thesaurus from Slovene data available in a comprehensive English–Slovenian dictionary, a monolingual dictionary, and a corpus. A network analysis on the bilingual dictionary word co-occurrence graph was used, together with additional information from the dist ...
Leto:
2018
Vir:
CLARIN.si