Mixed-Myanmar and English Character Recognition with Formatting
This paper proposed Myanmar and English typeface Character Recognition with their related format. The system converts Portable Document Format (.pdf) to Machine Editable Word Document (.doc). It includes two parts; recognition and formatting. The recognition of Myanmar and English character can be done by MICR (Myanmar Intelligent Character Recognition) which is one kind of ICR. Statistical and semantic information is used in MICR. Final decision of is made by voting system. MICR has become successful in character recognition area recent years. MICR can produce character recognition with high accuracy rate and faster speed. Table classification is used for the recognition of table format. Hough Transformation is used to detect lines in table recognition. This system can perform not only paragraph format but also text format. Paragraph format includes alignment (left, right and center). Text format includes font color, font size and bold, etc. The system use image processing and Matlab programming.
Keywords: Character Recognition, MICR, Table Forma, Hough Transformation, Text Format, Paragraph Format
Download Full-Text








