luščenje in prikaz podatkov o jezikovni rabi
Kaja Dobrovoljc (Author), Simon Krek (Author)

Abstract

V prispevku predstavljamo proces luščenja in prikazovanja korpusnih podatkov, kakršen je bil vzpostavljen pri pripravi demonstracijskih gesel na spletnem portalu Slogovni priročnik. Kot most med nevtralnimi korpusnimi podatki in vizualizacijo normativnih podatkov na portalu služi leksikon besednih oblik, njihovo pretakanje iz leksikona na portal pa usmerja mehanizem kratkega odgovora, ki omogoča, da se podatki na portalu avtomatsko prilagajajo spremembam v jeziku oz. referenčnem korpusu.

Keywords

spletni portal;jezikovni priročniki;standardizacija;pravopis;jezikovne tehnologije;luščenje podatkov;

Data

Language: Slovenian
Year of publishing:
Typology: 1.16 - Independent Scientific Component Part or a Chapter in a Monograph
Organization: UL FF - Faculty of Arts
UDC: 81'22:004.8
COBISS: 27588391 Link will open in a new window
Views: 9
Downloads: 0
Average score: 0 (0 votes)
Metadata: JSON JSON-RDF JSON-LD TURTLE N-TRIPLES XML RDFA MICRODATA DC-XML DC-RDF RDF

Other data

Secondary language: English
Secondary abstract: The paper presents the process of corpus data extraction and representation for the purpose of creating the Style Guide web portal for Slovene. The neutral corpus data and information about language codification are merged within a lexicon of inflected forms and subsequently visualised through the šshort answer’ system that enables the portal data to automatically adapt to any changes in the language or its reference corpus.
Secondary keywords: web portal;language reference books;standardisation;normative guide;language technologies;data extraction;
Type (COBISS): Article
Pages: Str. 101-107
ID: 19519219
Recommended works:
, luščenje in prikaz podatkov o jezikovni rabi
, scientific basis and inclusion of the public