SciELO - Scientific Electronic Library Online

 
vol.111 número4Leveraging the Technology of Unmanned Aerial Vehicles for Developing Countries índice de autoresíndice de assuntospesquisa de artigos
Home Pagelista alfabética de periódicos  

Serviços Personalizados

Artigo

Indicadores

Links relacionados

  • Em processo de indexaçãoCitado por Google
  • Em processo de indexaçãoSimilares em Google

Compartilhar


SAIEE Africa Research Journal

versão On-line ISSN 1991-1696
versão impressa ISSN 0038-2221

Resumo

KIZITO, Ronald; OKELLO, Wayne S.  e  KAGUMIRE, Sulaiman. Design and Implementation of a Luganda Text Normalization Module for a Speech Synthesis Software Program. SAIEE ARJ [online]. 2020, vol.111, n.4, pp.149-154. ISSN 1991-1696.

This paper describes a Luganda text normalization module, a crucial component needed for a Luganda Text to Speech system. We describe the use of a rule-based approach for detection, classification and verbalization of Luganda text. At the core of this module are the Luganda grammar rules that were hand-built to normalize Non-Standard Words (NSWs) from different semiotic and noun classes. Input text is first analyzed, matched against handcrafted patterns developed using regular expressions to detect any NSWs. Upon detection, NSWs are tokenized and classified into one of the semiotic classes and then if necessary, into one of the Luganda noun classes. These are subsequently verbalized, each according to its semiotic as well as noun class, and a new text file is produced. We tested the module with 7 datasets and achieved average detection and normalization rates of 82% and 77.7% respectively.

Palavras-chave : Automatic Speech Recognition; Detection-conversion; Luganda; Machine Translation; NLP; Number system; Speech Synthesis; Text Normalization; Text-to-speech; TTS.

        · texto em Inglês     · Inglês ( pdf )

 

Creative Commons License Todo o conteúdo deste periódico, exceto onde está identificado, está licenciado sob uma Licença Creative Commons