Information Extraction from Heterogeneous WWW Resources
The information available on the WWW is growing very fast. However, a fundamental problem with the information on the WWW is its lack of structure making its exploitation very difficult. As a result, the desired information is getting more difficult to retrieve and extract. To overcome this problem...
Saved in:
Main Authors: | , |
---|---|
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2004
|
Subjects: | |
Online Access: | http://eprints.utem.edu.my/id/eprint/1867/1/wwcs2004.pdf http://eprints.utem.edu.my/id/eprint/1867/ http://www.damai-sciences.com/cd_wwcs.html |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
id |
my.utem.eprints.1867 |
---|---|
record_format |
eprints |
spelling |
my.utem.eprints.18672015-05-28T02:25:44Z http://eprints.utem.edu.my/id/eprint/1867/ Information Extraction from Heterogeneous WWW Resources Sulong, Muhammad Suhaizan Meziane, Farid QA75 Electronic computers. Computer science The information available on the WWW is growing very fast. However, a fundamental problem with the information on the WWW is its lack of structure making its exploitation very difficult. As a result, the desired information is getting more difficult to retrieve and extract. To overcome this problem many tools and techniques are being developed and used for locating the web pages of interest and extracting the desired information from these pages. In this paper we present the first prototype of an Information Extraction (IE) system that attempts to extract information on different Computer Science related courses offered by British Universities. 2004-06 Conference or Workshop Item PeerReviewed application/pdf en http://eprints.utem.edu.my/id/eprint/1867/1/wwcs2004.pdf Sulong, Muhammad Suhaizan and Meziane, Farid (2004) Information Extraction from Heterogeneous WWW Resources. In: 7th International Conference on Work with Computing Systems (WWCS), 29 Jun - 2 Jul 2004, Kuala Lumpur, Malaysia. http://www.damai-sciences.com/cd_wwcs.html |
institution |
Universiti Teknikal Malaysia Melaka |
building |
UTEM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknikal Malaysia Melaka |
content_source |
UTEM Institutional Repository |
url_provider |
http://eprints.utem.edu.my/ |
language |
English |
topic |
QA75 Electronic computers. Computer science |
spellingShingle |
QA75 Electronic computers. Computer science Sulong, Muhammad Suhaizan Meziane, Farid Information Extraction from Heterogeneous WWW Resources |
description |
The information available on the WWW is growing very fast. However, a fundamental problem with the information on the WWW is its lack of structure making its exploitation very difficult. As a result, the desired information is getting more difficult to retrieve and extract. To overcome this problem many tools and techniques are being developed and used for locating the web pages of interest and extracting the desired information from these pages. In this paper we present the first prototype of an Information Extraction (IE) system that attempts to extract information on different Computer Science related courses offered by British Universities. |
format |
Conference or Workshop Item |
author |
Sulong, Muhammad Suhaizan Meziane, Farid |
author_facet |
Sulong, Muhammad Suhaizan Meziane, Farid |
author_sort |
Sulong, Muhammad Suhaizan |
title |
Information Extraction from Heterogeneous WWW Resources |
title_short |
Information Extraction from Heterogeneous WWW Resources |
title_full |
Information Extraction from Heterogeneous WWW Resources |
title_fullStr |
Information Extraction from Heterogeneous WWW Resources |
title_full_unstemmed |
Information Extraction from Heterogeneous WWW Resources |
title_sort |
information extraction from heterogeneous www resources |
publishDate |
2004 |
url |
http://eprints.utem.edu.my/id/eprint/1867/1/wwcs2004.pdf http://eprints.utem.edu.my/id/eprint/1867/ http://www.damai-sciences.com/cd_wwcs.html |
_version_ |
1665905254036668416 |
score |
13.211869 |