Povzetek
The incorporation of grammatical information into speech recognition systems is often used to increase performance in morphologically rich languages. However, this introduces demands for sufficiently large training corpora and proper methods of using the additional information. In this paper, we present a method for building factored language models that use data obtained by morphosyntactic tagging. The models use only relevant factors that help to increase performance and ignore data from other factors, thus also reducing the need for large morphosyntactically tagged training corpora. Which data is relevant is determined at run-time, based on the current text segment being estimated, i.e., the context. We show that using a context-dependent model in a two-pass recognition algorithm, the overall speech recognition accuracy in a Broadcast News application improved by 1.73% relatively, while simpler models using the same data achieved only 0.07% improvement. We also present a more detailed error analysis based on lexical features, comparing first-pass and second-pass results.
Ključne besede
govorne tehnologije;razpoznavanje govora;avtomatsko razpoznavanje govora;speech recognition;factored language model;dynamic backoff path;word context;inflectional language;morphosyntactic tags;
Podatki
Jezik: |
Angleški jezik |
Leto izida: |
2017 |
Tipologija: |
1.01 - Izvirni znanstveni članek |
Organizacija: |
UM FERI - Fakulteta za elektrotehniko, računalništvo in informatiko |
UDK: |
004.934 |
COBISS: |
20330774
|
ISSN: |
1687-4722 |
Št. ogledov: |
1139 |
Št. prenosov: |
319 |
Ocena: |
0 (0 glasov) |
Metapodatki: |
|
Ostali podatki
Sekundarni jezik: |
Slovenski jezik |
Sekundarne ključne besede: |
govorne tehnologije;razpoznavanje govora;avtomatsko razpoznavanje govora; |
URN: |
URN:SI:UM: |
Vrsta dela (COBISS): |
Znanstveno delo |
Strani: |
str. 1-16 |
Letnik: |
ǂVol. ǂ2017 |
Zvezek: |
ǂno. ǂ6 |
Čas izdaje: |
2017 |
DOI: |
10.1186/s13636-017-0104-6 |
ID: |
10844541 |