LAUSR.org creates dashboard-style pages of related content for over 1.5 million academic articles. Sign Up to like articles & get recommendations!

DKNet: Deep Kuzushiji Characters Recognition Network

Photo by patrickltr from unsplash

Kuzushiji, a cursive writing style, had been extensively utilized in Japan for over a thousand years starting from the $8^{th}$ century. In 1900, Kuzushiji was not included in regular school… Click to show full abstract

Kuzushiji, a cursive writing style, had been extensively utilized in Japan for over a thousand years starting from the $8^{th}$ century. In 1900, Kuzushiji was not included in regular school curricula due to the change in the Japanese writing system. Nowadays Japanese natives are unable to read historical books that were written using Kuzushiji language. Therefore, libraries and museums have decided to build digital copies of the documents and books that were written in Kuzushiji language. Due to a limited number of trained experts, researchers have utilized machine and deep learning models to convert historical documents and books into a modern script that can be easily read by human beings. However, the existing deep learning techniques suffer from over-fitting and gradient vanishing problems. To overcome these problems, an efficient deep Kuzushiji characters recognition network (DKNet) is proposed. Initially, to remove noise from the training images, a trilateral joint filter is applied. Contrast limited adaptive histogram equalization (CLAHE) is then applied to enhance the visibility of filtered images. Thereafter, a pre-trained MobileNet is utilized to extract the features of Kuzushiji characters. MobileNet’s final layers are removed, including the fully connected layer and softmax. The flatten layer is then applied to the input. A fully connected classification layer is then used with Rectified linear units (ReLUs) and dropouts. Dropouts are used to generalize the model, thus preventing the over-fitting problem. Finally, the softmax activation function is employed to provide the recognition results. To test the proposed model, actual documents are first segmented by using the proposed Maximally stable extremal regions (MSERs) and convexhull-based segmentation approach. Segmented characters are then recognized using the trained DKNet. Extensive comparative analyses reveal that DKNet achieves better performance than the competitive models in terms of various performance metrics. An efficient Application Programming Interface (API) is also designed for Japanese Kuzushiji ancient heritage character recognition to help the end-users.

Keywords: kuzushiji characters; characters recognition; deep kuzushiji; recognition network; dknet

Journal Title: IEEE Access
Year Published: 2022

Link to full text (if available)


Share on Social Media:                               Sign Up to like & get
recommendations!

Related content

More Information              News              Social Media              Video              Recommended



                Click one of the above tabs to view related content.