Handwritten word recognition by new feature and lexicon reduction

Document Type : Research Paper

Authors

1 Department of Computer engineering, Mobarakeh Branch, Islamic Azad University

2 Department of Computer engineering and IT, Payam Noor University

Abstract

Handwritten word recognition (HWR) is very important in document analysis and retrieval. In this paper, an off-line handwritten recognition system for Persian manuscript is introduced. For feature extraction, SIFT descriptors extracted densely from the block of word image and enriched by appending the normalized x and y coordinates and the scale they were extracted at. Linear discriminate analysis (LDA) is used for feature reduction. All words in the dictionary were hierarchically clustered by ISOCLUSE algorithm. In order to recognize the word images, multiple-class and two-class SVM classifiers methods were used. The experimental results showed a better performance in terms of speed and precision of two-class SVM method on the Iranshahr data set. The accuracy of proposed system by select 5 top cluster is shown 93.37% by 76.65% reduction of lexicon.

Keywords