International Journal of Innovative Research in Engineering and Management
Year: 2015, Volume: 3, Issue: 4
First page : ( 13) Last page : ( 18)
Online ISSN : 2350-0557.
Article Tools: Print the Abstract | Indexing metadata | How to cite item | Email this article | Post a Comment
Vikas K. Yeotikar , Manish T. Wanjari
Text extraction in document images has been an important research area. Extraction of the information in the form of text involves detection, localization, tracking, extraction, enhancement, and recognition of the text from a given document image. A large number of techniques have been proposed to address this problem. In this paper a novel method is proposed by using three features extraction techniques i.e. Gabor, Wavelet and Hough to detect text objects from document images. The performance of the proposed method is tested on NIST document Image dataset.
[1] Shuichi Tsujimoto And Haruo Asada. Invited Paper .Major Components of a Complete Text Reading System. Proceedings of the IEEE, Vol. 80, No. 7, pp.1133-1149, July 1992.
[2] Gaurav Harit,Santanu Chaudhari, Gupta P., Vohra N., Joshi S. D. .A Model Guided Document Image Analysis Scheme. proceedings of IEEE pp. 1137-1141, 2001.
[3] Haralick, R.M. 1979. Statistical and Structural Approaches to Texture. Proceedings of the IEEE, 67:786-804; (also 1973, IEEE-T-SMC.
[4] Y. Zhan, W. Wang, W. Gao (2006), “A Robust Split-And-Merge Text Segmentation Approach For Images”, International Conference On Pattern Recognition,06(2):pp 1002-1005.
[5] Thai V. Hoang , S. Tabbone(2010),“Text Extraction From Graphical Document Images Using Sparse Representation”in Proc. Das, pp 143–150. International Journal of Computer Science & Engineering Survey (IJCSES) Vol.3, No.4, August 2012.
[6] S. Audithan,, R.M.Chandrasekaran (2009), "Document Text Extraction From Document Images Using Haar Discrete Wavelet Transform",European Journal Of Scientific Research, Vol.36 No.4 , pp.502-512.
[7] Sachin, Grover, Kushal Arora,,Suman K. Mitra(2009),“Text Extraction From Document Images Using Edge Information”,IEEE India Council Conference.
[8] P. Nagabhushan, S. Nirmala(2009) ,”Text Extraction In Complex Color Document Images For Enhanced Readability”,Intelligent Information Management, pp: 120-133.
[9] Davod Zaravi, Habib Rostami, Alireza Malahzaheh, S.S Mortazavi(2011),” Journals Subheadlines Text Extraction Using Wavelet Thresholding And New Projection Profile”, World Academy Of Science, Engineering And Technology .Issue 73.
[10] Karin Sobottka, Horst Bunke and Heino Kronenberg(2009), “Identification Of Text On Colored Book And Journal Covers”, ICDAR.
[11] Zhixin Shi, Srirangaraj Setlur And Venu Govindaraju(2005), “Text Extraction From Gray Scale Historical Document Image Using Adaptive Local Connectivity Map”, Proceeding Of The Eighth International Conference On Document Analysis And Recognition, Vol. 2, pp: 794–798.
[12] Syed Saqib Bukhari , Thomas M. Breuel,Faisal Shafait(2009), “Textline Information Extraction From Grayscale Camera-Captured Document Images “, ICIP Proceedings Of The 16th IEEE International Conference On Image Processing, pp: 2013 – 2016.
[13] Wafa , Aymen Bougacha, Abderrazak Zahour, Haikal El Abed, Adel Alimi(2009) ,“Enhanced Text Extraction From Arabic Degraded Document Images Using Em Algorithm”, 10th International Conference On Document Analysis And Recognition.
[14] Simona E. Grigorescu, Nicolai Petkov, and Peter Kruizinga; Comparison of Texture Features Based on Gabor Filters; IEEE Transactions on Image Processing, Vol. 11, No. 10, OCT. 2002; pp 1160-1167. [15] E. J. Stollnitz, T. D. DeRose and D. H. Salesin" Wavelets for computer graphics: a primer, part I,"IEEE Computer Graphics and Applications, vol.15, No. 3, pp. 76-84, May 1995.
[16] Duda R. O. and P. E. Hart, “Use of the Hough Transformation to detect Lines and Curves in Pictures,” Comm. ACM, Vol. 15, pp. 11-15 , Jan-1972.
[17] Muthukrishnan.R and M.Radha (Dec. 2011). Edge Detection Techniques For Image Segmentation. International Journal of Computer Science & Information Technology (IJCSIT) Vol 3, No 6.
[18] Manual of NIST DATABASE, Federal Register Document Image Database. NIST Special Database 25 – volume 1.(NISTIR 6245).
[19] A. P. DEMPSTERN, M. LAIRDa nd D. B. RubIN, “Maximum Likelihood from Incomplete Data via the EM Algorithm” Journal of the Royal Statistical Society. Series B (Methodological), Vol. 39, No. 1. (1977), pp. 1-38
Department of Computer Science, SSESA’s, Science College, Congress Nagar, Nagpur (MH), India, 9405436996.
No. of Downloads: 8 | No. of Views: 1054
Dr Isa Ali Ibrahim, I. B. Mohammed, Bashir Saidu.
May 2015 - Vol 3, Issue 3
Anindita Kundu, ..
March 2015 - Vol 3, Issue 2
Anjula Balmiki, Sarsij Tripathi.
November 2014 - Vol 2, Issue 6