LAUSR.org creates dashboard-style pages of related content for over 1.5 million academic articles. Sign Up to like articles & get recommendations!

Diacritizing Arabic Text Using a Single Hidden Markov Model

Photo by miklevasilyev from unsplash

The main obstacle in the development of a robust Arabic text-to-speech system is the ability to place diacritics, that is, small marks above and below the raw text that indicate… Click to show full abstract

The main obstacle in the development of a robust Arabic text-to-speech system is the ability to place diacritics, that is, small marks above and below the raw text that indicate the correct pronunciation. In this paper, we propose a system that can retrieve the diacritics that match a given Arabic text. First, the system injects the raw text into an engine that is based on a single hidden Markov model. The engine then generates an optimal path through its states. The system finally matches the state sequence with its equivalent diacritics and sets them in place within the text. Experiential results on diverse data sets demonstrate the robustness of the proposed system, even with samples that are novel to the system.

Keywords: hidden markov; system; single hidden; markov model; arabic text; text

Journal Title: IEEE Access
Year Published: 2018

Link to full text (if available)


Share on Social Media:                               Sign Up to like & get
recommendations!

Related content

More Information              News              Social Media              Video              Recommended



                Click one of the above tabs to view related content.