LAUSR.org creates dashboard-style pages of related content for over 1.5 million academic articles. Sign Up to like articles & get recommendations!

Toward multi-lingual information retrieval system based on internet linguistic diversity measurement

Photo by alterego_swiss from unsplash

Abstract We introduce a method for measuring the quantity of online content of a set of languages at domain level. This measurement is used for building a Multi-Lingual Information Retrieval… Click to show full abstract

Abstract We introduce a method for measuring the quantity of online content of a set of languages at domain level. This measurement is used for building a Multi-Lingual Information Retrieval (MLIR) system that identifies which languages are strongly represented on the internet about a specific query topic. The system architecture includes two modules; the off-line module builds a linguistic diversity index for languages at topic level and the on-line module, where the suitable language for search is identified based the index for retrieving the relevant documents to the user query in that language. The conducted experiments explore the usefulness of building such an index and its usage effect on both of monolingual and traditional MLIR system. From the obtained results, it has been proven that the more internet resources, the better the accuracy of the retrieved results, and therefore the better the system performance.

Keywords: system; information retrieval; multi lingual; linguistic diversity; measurement; lingual information

Journal Title: Ain Shams Engineering Journal
Year Published: 2019

Link to full text (if available)


Share on Social Media:                               Sign Up to like & get
recommendations!

Related content

More Information              News              Social Media              Video              Recommended



                Click one of the above tabs to view related content.