Loading [MathJax]/extensions/MathMenu.js
The Indonesian Language speech synthesizer based on the Hidden Markov Model | IEEE Conference Publication | IEEE Xplore

The Indonesian Language speech synthesizer based on the Hidden Markov Model


Abstract:

Speech synthesizer is a technology which gives the computer a capability to speech text sequences. In this research, we develop a speech synthesizer for Indonesian Langua...Show More

Abstract:

Speech synthesizer is a technology which gives the computer a capability to speech text sequences. In this research, we develop a speech synthesizer for Indonesian Language based on the Hidden Markov Model (HMM). The Speech synthesizer using the HMM can produce more appropriate result than based on the syllable concatenation. There are some studies of speech synthesizers using HMM for Indonesian Language. However, it still has some problems such as it still cannot distinguish between vowel "e" ("e" in "get" is different from "e" in "apple"); It cannot handle abbreviation, numbers, special characters, and foreign (English) terms widely. In this research, we also proposed some methods to solve those problems. To solve "e" problem, this research divided the HMM for the 2 "e" vowel. To solve the other problems, the "e" rules, the abbreviation rules, the number rules, the special character rules, and the foreign term rules are made. To evaluate the synthesizer, we employ two methods: the Mean Opinion Score (MOS) to measure the naturalness of synthesized speech; and the Semantically Unpredictable Sentence (SUS) to measure the accuracy of the synthesized speech. Result shows that the developed speech synthesizer improved the naturalness of synthesized speech. It achieves 4.1 for MOS point and 96,07 % word accuracy.
Date of Conference: 24-25 November 2014
Date Added to IEEE Xplore: 19 February 2015
ISBN Information:
Conference Location: Kuta, Bali, Indonesia
No metrics found for this document.

I. Introduction

Nowadays, we interact with computer using keyboard, mouse, and touch screen. However, typing for computer input is not natural for human interaction. We also usually see the computer screen if the output is visual. Some research focused on how to increase the naturalness of interaction. Speech as our most of natural means communication can be used when interacting with computer. Some of the technologies employing speech such as voice recognizer and automatic speech synthesizer have been developed for many languages, especially English [1].

Usage
Select a Year
2024

View as

Total usage sinceMar 2015:110
00.511.522.53JanFebMarAprMayJunJulAugSepOctNovDec200010000002
Year Total:5
Data is updated monthly. Usage includes PDF downloads and HTML views.
Contact IEEE to Subscribe

References

References is not available for this document.