Views 
   PDF Download PDF Downloads: 1182

 Open Access -   Download full article: 

Study and Analysis of Multilingual Handwritten Characters Recognition using SVM Classifier

Ujwal  Singh Vohra, ShriPrakash Dwivedi and  H. L. Mandoria

Department of Information Technology G. B. Pant University of Agriculture and Technology, Pantnagar, India

DOI : http://dx.doi.org/10.13005/ojcst/9.02.07

Article Publishing History
Article Received on :
Article Accepted on :
Article Published : 08 Aug 2016
Article Metrics
ABSTRACT:

Day by day the researchers are trying to make such characters recognition system  that can be able to detect the writing and languages of individuals, for this multilingual handwritten characters recognition is such system that playing a vital role for recognizing the characters written in different languages and in different styles. The research work presented in this thesis aims to do the study and analysis to recognize the multilingual handwritten characters with a high level of accuracy and for this purpose the classifier that we are using is support vector machine. Here in this work we have used the languages like Hindi and English and along with this we have taken special characters and numerals and tried to recognize them with our system.

KEYWORDS: OCR; Handwritten Characters; Recognition; SVM

Copy the following to cite this article:

Vohra U. S, Dwivedi S. P, Mandoria H. L. Study and Analysis of Multilingual Handwritten Characters Recognition using SVM Classifier. Orient.J. Comp. Sci. and Technol;9(2)


Copy the following to cite this URL:

Vohra U. S, Dwivedi S. P, Mandoria H. L. Study and Analysis of Multilingual Handwritten Characters Recognition using SVM Classifier. Orient. J. Comp. Sci. and Technol;9(2). Available from: http://www.computerscijournal.org/?p=3767


Intoduction

Multilingual Handwritten Characters Recognition

While processing handwritten characters if a document contains more than one handwritten language characters to recognize then it refers as Multilingual Handwritten Characters Recognition[11]. Like here in the figure 1.1 we have taken Hindi and English as an input language characters and in figure 1.2 the Multilingual Handwritten Characters Recognition process is shown.

Figure 1

Figure 1

 
Click here to View figure

 

Previous Work

As per some research papers initially the Character recognition started in late 1970s, there are many research that has been done in the field of Characters recognitionand Hand written Characters Recognition till yet.In the field of character recognition many researchers has provided their opinions and proved it with their results. Deshpande (2008) in this paper they proposed Gaussian filter to provide feature vector for the dimension of 200 (5x5x8). And the accuracy achieved was 94% with Support vector Machines (SVM) as the classifier.DOGRA (2012) in this paper they have suggested an automatic recognition method of OCR. They have attempted handwritten Hindi characters recognition and the classifier used was the SVM and the feature extraction method was diagonal feature extraction. The accuracy achieved by this method was 93.06 % [6].Bansal (2014)have proposed an efficient method for recognition Hindi handwritten numerals, with the use of energy and chain codes. For classification purpose SVM is used for classification. The average recognition of 90.1 % is achieved using four segment methods. Singh (2015)in this paper they got 97.61% result using SVM and ANN for the Handwritten Devanagari Character recognition. The accuracy/recognition rate obtained for different size of images by ANN and SVM classifiers. Farkya (2015) in this paper the a whole document was scanned and passed to OCR then the rate of recognition slightly decreases to 96 %.

Work flow of Multilingual Handwritten Characters recognition system:

Figure 1.3: Pictorial representation of Multilingual Handwritten Characters recognition system

Figure 1.3: Pictorial representation of Multilingual Handwritten Characters recognition system

 

Click here to View figure

 

Results and Discussions

Various handwriting samples are collected and captured/scanned for applying further processing on them. By clicking on the upload image the user can select image of various formats (like . png, .jpg, .jpeg) from the desired location and based on the selected image various operations are performed on the image so that the handwritten characters those are present in the image can be extracted from the image and can be recognized easily by the system. Here we have taken four different cases for the recognition of Multilingual Handwritten Characters Image and finally the output of each case is shown in the notepad file.

Figure 3

Figure 3 



Click here to View figure

 

Case 1: Multilingual Handwritten Characters Image

Figure 4

Figure 4

 

Click here to View figure

 

Accuracy(%) = 96.96

Case 2: Multilingual Handwritten Characters With Skew

Figure 5

Figure 5

 
Click here to View figure

 

Accuracy(%) =94.28

Case 3 :Multilingual Handwritten Characters Large Noisy Image.

Figure 6

Figure 6

 
Click here to View figure

 

ACCURACY(%) = 92.0

Case 4: Large Handwritten Document Image

Figure 7

Figure 7

 

Click here to View figure

 

Accuracy(%) = 96.95

The above graphical representation of the accuracy as shown in the figure 1.12 is taken from the values of the accuracy of various images taken in different cases. In this at the x axis the number of images taken for recognition is taken, and on the y axis the accuracy of different images is taken in percentage.

Conclusion

While processing the handwritten characters various degradations are found in multilingual handwritten characters scanned/captured images. Some of them are broken lines incomplete characters, different style of writings. Degradations may be due to the defect in papers, writer hand writings, during the capturing or digitization of the image. While processing the characters (likeअं ! ; : “ ? = % अःङetc.) we found that for some unknown characters the system recognized them as noise and the noise is depicted as ¤.After the study we found the average accuracy of 95.048 % for Hindi,English,Numerals and special characters in a single  captured/scanned. We have taken 50 samples per characters from different writers for training and50 samples per characters for testing with theSVM Classifier. This study can further be implemented using the HOG features with SVM for handwritten script recognition of different languages combined in a single document image.

References

  1. Arica, N., Fatos, T. and Yarman-Vural. 2001. An overview of character recognition focused on off-line handwriting.Transactions on systems, man, and cybernetics—part c: applications and reviews, IEEE, 31:2.
  2. Arora, S. 2010. Performance comparison of svm and ann for handwritten devnagari character recognition. International Journal of Computer Science,IJCSI,7, 3.
  3. Arya, S., C., Singh, R., S. and Mandoria, H., L. 2015. Image Denoising in Hand Written Document for Degraded Documents using Wiener Filter Algorithm.International Journal for Research in Emerging Science and Technology, 2, 7.
  4. Bansal, C. and Khan, A. 2014.Handwritten numeral recognition using svm and chain code,IJARET, 2: VII.
  5. Deshpande, P., S., Malik, L., Arora, S., 2008. Recognition of hand written devnagari characters with percentage component regular expression matching and classification tree.TENCON, IEEE,2159-3442, 1 – 4.
  6. Dogra, S. and  Prakash, C. 2012 .Pehchaan: hindi handwritten character recognition system based on svm. International Journal on Computer Science and Engineering , IJCSE,4.
  7. Farkya, s., Surampudi,G. and Kothari, A. 2015. Hindi Speech Synthesis by concatenation of recognized Hand written Devnagri Script using Support Vector Machines Classifier. IEEE ICCSP conference.2,4,0893 – 0898.
  8. Jawahar, C.V., Kumar, P. and Kiran, S.S.K.2003. A bilingual ocr for hindi-telugu documents and its applications. Seventh International Conference on Document Analysis and Recognition (ICDAR), 1 ,408 – 412.
    CrossRef
  9. Otsu, N.,1979. A threshold selection method from gray-level histograms. Transactionson Systems,  Man, and Cybernetics SMC-9 IEEE,1,62–66.
  10. Singh,A. and Maring, K., A. 2015. Handwritten devanagari character recognition using svm and ann. International Journal of Advanced Research in Computer and Communication Engineering,4,8.
  11. Vohra,U.S.,Dwivedi, S.P., Mandoria, H.L.2016. An analytical study of handwritten character recognition.i-manager’s Journal on Pattern Recognition, 2, 4.
  12. SonalPaliwal, Rajesh Shyam Singh & H. L. Mandoria. A survey on various text detection and extraction techniques from videos and images.International Journal of Computer Science Engineering and Information Technology Research (IJCSEITR). 6, 3, 1-10.

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.