diplomsko delo
Gašper Žejn (Author), Dušan Kodek (Mentor)

Abstract

Diarizacija govorcev v zvočnih posnetkih

Keywords

analiza govora;diarizacija;indeksiranje govorcev;računalništvo;univerzitetni študij;diplomske naloge;

Data

Language: Slovenian
Year of publishing:
Typology: 2.11 - Undergraduate Thesis
Organization: UL FRI - Faculty of Computer and Information Science
Publisher: [G. Žejn]
UDC: 004(043.2)
COBISS: 9963348 Link will open in a new window
Views: 56
Downloads: 1
Average score: 0 (0 votes)
Metadata: JSON JSON-RDF JSON-LD TURTLE N-TRIPLES XML RDFA MICRODATA DC-XML DC-RDF RDF

Other data

Secondary language: English
Secondary title: Speaker diarization of audio recordings
Secondary abstract: Speech analysis is a broad research area in computer science. Diarization is a process of answering the question "who spoke when" by analyzing speech and extracting speaker specific information from it. This thesis focuses on evaluation of freely available tools for speaker diarization for use on Slovenian speech with emphasis on recordings of meetings. Two tools are evaluated, SHoUT and LIUM SpkDiarization. Both tools use similar theoretical primitives, which are explained in chapter 3. Tools, their use and test recordings are introduced in chapter 4. Results show the SHoUT tool is useful for Slovenian speech too, despite the fact the tool was not evaluated on Slovenian speech during its development. LIUM SpkDiarization is less stable and shows peculiar anomalies, such as merging all the speakers of same gender into one, which indicates additional research and parameter discovery should be done before using LIUM SpkDiarization on Slovenian speech.
Secondary keywords: speech analysis;diarization;diarisation;speaker indexing;computer science;diploma;
File type: application/pdf
Type (COBISS): Undergraduate thesis
Thesis comment: Univ. v Ljubljani, Fak. za računalništvo in informatiko
Pages: 33 f.
ID: 24168180