Junction point detection and identification of broken character in touching Arabic handwritten text using overlapping set theory

Touching characters are formed when two or more characters share the same space with each other. Therefore, segmentation of these touching character is very challenging research topic especially for handwritten Arabic degraded documents. This is one of the key issue in recognition of the handwritten...

Full description

Saved in:
Bibliographic Details
Main Authors: Ullah, Inam, Azmi, Mohd Sanusi, Desa, Mohammad Ishak
Format: Article
Language:en
Published: Science and Information Organization 2019
Online Access:http://eprints.utem.edu.my/id/eprint/24350/2/002-PAPER_36-JUNCTION_POINT_DETECTION_AND_IDENTIFICATION.PDF
http://eprints.utem.edu.my/id/eprint/24350/
https://thesai.org/Downloads/Volume10No6/Paper_36-Junction_Point_Detection_and_Identification.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Touching characters are formed when two or more characters share the same space with each other. Therefore, segmentation of these touching character is very challenging research topic especially for handwritten Arabic degraded documents. This is one of the key issue in recognition of the handwritten Arabic text. In order to make the recognition system more effective segmentation of these touching handwritten Arabic characters is considered to be very important research area. In this research, a new method is proposed, which is used to identify the junction or common point of Arabic touching word image by applying overlapping or intersection set theory operation, which will help to trace the correct boundary of the touching characters, identify the broken characters and also segmented these touching handwritten text in an efficient way. The proposed method has been evaluated on Arabic touching handwritten characters taken from handwritten datasets. The results show the efficiency of the proposed method. The proposed method is applicable to both degraded handwritten documents and printed documents.