BACKGROUND Processing of free text written medical texts involves many difficulties arising from typographical errors, synonyms, and abbreviations occurring in the texts. METHODS In this study, the applicability of the… Click to show full abstract
BACKGROUND Processing of free text written medical texts involves many difficulties arising from typographical errors, synonyms, and abbreviations occurring in the texts. METHODS In this study, the applicability of the most common string similarity measures were analyzed and compared for the keyword-based medical text search. RESULTS The usefulness of the similarity measures was studied in a set of medical documents containing more than 20,000 echocardiography reports. Experimental results showed that the Jaro-Winkler dissimilarity measure is the most capable measure to explore the content of the medical texts.
               
Click one of the above tabs to view related content.