Staff View: Information Extraction from Heterogeneous WWW Resources

Information Extraction from Heterogeneous WWW Resources

The information available on the WWW is growing very fast. However, a fundamental problem with the information on the WWW is its lack of structure making its exploitation very difficult. As a result, the desired information is getting more difficult to retrieve and extract. To overcome this problem...

Full description

Saved in:

Bibliographic Details
Main Authors:	Sulong, Muhammad Suhaizan, Meziane, Farid
Format:	Conference or Workshop Item
Language:	English
Published:	2004
Subjects:	QA75 Electronic computers. Computer science
Online Access:	http://eprints.utem.edu.my/id/eprint/1867/1/wwcs2004.pdf http://eprints.utem.edu.my/id/eprint/1867/ http://www.damai-sciences.com/cd_wwcs.html
Tags:	Add Tag No Tags, Be the first to tag this record!

id	my.utem.eprints.1867
record_format	eprints
spelling	my.utem.eprints.18672015-05-28T02:25:44Z http://eprints.utem.edu.my/id/eprint/1867/ Information Extraction from Heterogeneous WWW Resources Sulong, Muhammad Suhaizan Meziane, Farid QA75 Electronic computers. Computer science The information available on the WWW is growing very fast. However, a fundamental problem with the information on the WWW is its lack of structure making its exploitation very difficult. As a result, the desired information is getting more difficult to retrieve and extract. To overcome this problem many tools and techniques are being developed and used for locating the web pages of interest and extracting the desired information from these pages. In this paper we present the first prototype of an Information Extraction (IE) system that attempts to extract information on different Computer Science related courses offered by British Universities. 2004-06 Conference or Workshop Item PeerReviewed application/pdf en http://eprints.utem.edu.my/id/eprint/1867/1/wwcs2004.pdf Sulong, Muhammad Suhaizan and Meziane, Farid (2004) Information Extraction from Heterogeneous WWW Resources. In: 7th International Conference on Work with Computing Systems (WWCS), 29 Jun - 2 Jul 2004, Kuala Lumpur, Malaysia. http://www.damai-sciences.com/cd_wwcs.html
institution	Universiti Teknikal Malaysia Melaka
building	UTEM Library
collection	Institutional Repository
continent	Asia
country	Malaysia
content_provider	Universiti Teknikal Malaysia Melaka
content_source	UTEM Institutional Repository
url_provider	http://eprints.utem.edu.my/
language	English
topic	QA75 Electronic computers. Computer science
spellingShingle	QA75 Electronic computers. Computer science Sulong, Muhammad Suhaizan Meziane, Farid Information Extraction from Heterogeneous WWW Resources
description	The information available on the WWW is growing very fast. However, a fundamental problem with the information on the WWW is its lack of structure making its exploitation very difficult. As a result, the desired information is getting more difficult to retrieve and extract. To overcome this problem many tools and techniques are being developed and used for locating the web pages of interest and extracting the desired information from these pages. In this paper we present the first prototype of an Information Extraction (IE) system that attempts to extract information on different Computer Science related courses offered by British Universities.
format	Conference or Workshop Item
author	Sulong, Muhammad Suhaizan Meziane, Farid
author_facet	Sulong, Muhammad Suhaizan Meziane, Farid
author_sort	Sulong, Muhammad Suhaizan
title	Information Extraction from Heterogeneous WWW Resources
title_short	Information Extraction from Heterogeneous WWW Resources
title_full	Information Extraction from Heterogeneous WWW Resources
title_fullStr	Information Extraction from Heterogeneous WWW Resources
title_full_unstemmed	Information Extraction from Heterogeneous WWW Resources
title_sort	information extraction from heterogeneous www resources
publishDate	2004
url	http://eprints.utem.edu.my/id/eprint/1867/1/wwcs2004.pdf http://eprints.utem.edu.my/id/eprint/1867/ http://www.damai-sciences.com/cd_wwcs.html
_version_	1665905254036668416
score	13.211869

Information Extraction from Heterogeneous WWW Resources

Similar Items