LAUSR.org creates dashboard-style pages of related content for over 1.5 million academic articles. Sign Up to like articles & get recommendations!

Newspaper text recognition of Gurumukhi script using random forest classifier

Photo by sebastian_unrau from unsplash

Newspapers consist of very crucial information related to current as well memorable events. So, newspaper text needs to be preserved in a computer processable form for indexing of headline or… Click to show full abstract

Newspapers consist of very crucial information related to current as well memorable events. So, newspaper text needs to be preserved in a computer processable form for indexing of headline or making possible the search operations on newspaper text. For accurate results of recognition of text, appropriate classification of text based on extracted features is very important. Random Forest classifier is a widely used classifier in the field of pattern recognition and computer vision applications. In this paper, we have presented the recognition results using random forest classifier for newspaper text printed in Gurumukhi script. Different kinds of feature extraction techniques are used to extract the feature of characters that are fed to the random forest classifier. Standard k-fold cross validation and dataset partitioning strategy has been used for experimental work. Using the proposed method, maximum recognition accuracy of 96.9% and 96.4% has been achieved, using 5-fold cross validation and dataset partitioning strategy, respectively.

Keywords: newspaper text; recognition; forest classifier; text; random forest

Journal Title: Multimedia Tools and Applications
Year Published: 2019

Link to full text (if available)


Share on Social Media:                               Sign Up to like & get
recommendations!

Related content

More Information              News              Social Media              Video              Recommended



                Click one of the above tabs to view related content.