Chinese character recognition using neural network / Low Poh Tian

This project explores to the application of neural networks to the problem of identifying handwritten Chinese Characters in an automated manner. In particular, a backpropagation net is trained on a 5 Chinese Character fonts. The scope originally was to recognize five characters; it could easily be t...

Full description

Saved in:
Bibliographic Details
Main Author: Low, Poh Tian
Format: Thesis
Published: 2003
Subjects:
Online Access:http://studentsrepo.um.edu.my/12628/1/Low_Poh_Tian.pdf
http://studentsrepo.um.edu.my/12628/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1831436121074040832
author Low, Poh Tian
author_facet Low, Poh Tian
author_sort Low, Poh Tian
building UM Library
collection Institutional Repository
content_provider Universiti Malaya
content_source UM Student Repository
continent Asia
country Malaysia
description This project explores to the application of neural networks to the problem of identifying handwritten Chinese Characters in an automated manner. In particular, a backpropagation net is trained on a 5 Chinese Character fonts. The scope originally was to recognize five characters; it could easily be trained to recognize Chinese fonts. In process of training data, an original set of chine font was first generated. From this set, 4 other set were derived with different rotation angles. This provide a total of 20 images for the training data. In this system, each character is captured in a 320*240 pixel, black and white BITMAP file. During image processing a Bitmap file is broken up into a 1OO components, each being a 32*24 pixel in dimension. If the "on" pixels in this region is more than 10% of the total pixels in tire region, the component is set to "1" otherwise, it is set to "0". According to the implementation of this system, the input layer consists of 100 nodes and the output layer 5 nodes, each representing a character. Experiment has been carried out to determine the hidden layer size be/ ore it is set to 25 nodes. The training parameters of the network i.e. the learning rate and momentum has both fixed to 0.6. The extracted image data is then feed into the network, the weights of which are modified by error-backpropagation algorithm. The network has successfully been train to the error tolerance of 0.0001. As a result, it is able to recognize all the characters with which it is trained. However, other free-form (handwritten) cannot be recognize due to the limited training data.
format Thesis
id my.um.stud-12628
institution Universiti Malaya
publishDate 2003
record_format eprints
spelling my.um.stud-126282021-11-23T22:54:28Z Chinese character recognition using neural network / Low Poh Tian Low, Poh Tian QA75 Electronic computers. Computer science This project explores to the application of neural networks to the problem of identifying handwritten Chinese Characters in an automated manner. In particular, a backpropagation net is trained on a 5 Chinese Character fonts. The scope originally was to recognize five characters; it could easily be trained to recognize Chinese fonts. In process of training data, an original set of chine font was first generated. From this set, 4 other set were derived with different rotation angles. This provide a total of 20 images for the training data. In this system, each character is captured in a 320*240 pixel, black and white BITMAP file. During image processing a Bitmap file is broken up into a 1OO components, each being a 32*24 pixel in dimension. If the "on" pixels in this region is more than 10% of the total pixels in tire region, the component is set to "1" otherwise, it is set to "0". According to the implementation of this system, the input layer consists of 100 nodes and the output layer 5 nodes, each representing a character. Experiment has been carried out to determine the hidden layer size be/ ore it is set to 25 nodes. The training parameters of the network i.e. the learning rate and momentum has both fixed to 0.6. The extracted image data is then feed into the network, the weights of which are modified by error-backpropagation algorithm. The network has successfully been train to the error tolerance of 0.0001. As a result, it is able to recognize all the characters with which it is trained. However, other free-form (handwritten) cannot be recognize due to the limited training data. 2003 Thesis NonPeerReviewed application/pdf http://studentsrepo.um.edu.my/12628/1/Low_Poh_Tian.pdf Low, Poh Tian (2003) Chinese character recognition using neural network / Low Poh Tian. Undergraduates thesis, Universiti Malaya. http://studentsrepo.um.edu.my/12628/
spellingShingle QA75 Electronic computers. Computer science
Low, Poh Tian
Chinese character recognition using neural network / Low Poh Tian
title Chinese character recognition using neural network / Low Poh Tian
title_full Chinese character recognition using neural network / Low Poh Tian
title_fullStr Chinese character recognition using neural network / Low Poh Tian
title_full_unstemmed Chinese character recognition using neural network / Low Poh Tian
title_short Chinese character recognition using neural network / Low Poh Tian
title_sort chinese character recognition using neural network / low poh tian
topic QA75 Electronic computers. Computer science
url http://studentsrepo.um.edu.my/12628/1/Low_Poh_Tian.pdf
http://studentsrepo.um.edu.my/12628/
url_provider http://studentsrepo.um.edu.my/