Ana Zwitter Vitez (Author), Jana Zemljarič Miklavčič (Author), Marko Stabej (Author), Simon Krek (Author)

Abstract

V okviru projekta Sporazumevanje v slovenskem jeziku nastaja referenčni govorni korpus slovenskega jezika, ki bo govorni vir za nekatere jezikovne priročnike in različne jezikoslovne raziskave. Zaradi praktične namembnosti govornega korpusa so ključni cilji njegove gradnje čim bolj pregledne iskalne možnosti in čim lažja berljivost transkripcij. V prispevku predstavljamo načela označevanja posnetkov ter segmentiranja in transkribiranja govora.

Keywords

slovenščina;korpusna lingvisitka;govorni korpus;govorjeni jezik;transkribiranje;segmentiranje;

Data

Language: Slovenian
Year of publishing:
Typology: 1.08 - Published Scientific Conference Contribution
Organization: UL FF - Faculty of Arts
UDC: 811.163.6'271.16:003.035
COBISS: 43881570 Link will open in a new window
Views: 15
Downloads: 0
Average score: 0 (0 votes)
Metadata: JSON JSON-RDF JSON-LD TURTLE N-TRIPLES XML RDFA MICRODATA DC-XML DC-RDF RDF

Other data

Secondary language: English
Secondary abstract: The project Communication in Slovene includes the construction of a reference corpus of spoken Slovene, which will function as a resource for certain language guides and research projects. Due to its practical goals, key aims of the corpus are straightforward search options and easy-to-read transcription. This paper presents the method to be used for the mark-up of recordings, and for segmenting and transcribing speech samples.
Secondary keywords: Slovene language;corpus linguistics;spoken corpus;spoken language;transcribing;segmenting;
Type (COBISS): Article
Pages: Str. 437-442
ID: 20861400
Recommended works:
, no subtitle data available
, no subtitle data available
, zasnova vprašalnika, prvi rezultati