Information Extraction from Heterogeneous WWW Resources

The information available on the WWW is growing very fast. However, a fundamental problem with the information on the WWW is its lack of structure making its exploitation very difficult. As a result, the desired information is getting more difficult to retrieve and extract. To overcome this problem...

Full description

Saved in:
Bibliographic Details
Main Authors: Sulong, Muhammad Suhaizan, Meziane, Farid
Format: Conference or Workshop Item
Language:English
Published: 2004
Subjects:
Online Access:http://eprints.utem.edu.my/id/eprint/1867/1/wwcs2004.pdf
http://eprints.utem.edu.my/id/eprint/1867/
http://www.damai-sciences.com/cd_wwcs.html
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The information available on the WWW is growing very fast. However, a fundamental problem with the information on the WWW is its lack of structure making its exploitation very difficult. As a result, the desired information is getting more difficult to retrieve and extract. To overcome this problem many tools and techniques are being developed and used for locating the web pages of interest and extracting the desired information from these pages. In this paper we present the first prototype of an Information Extraction (IE) system that attempts to extract information on different Computer Science related courses offered by British Universities.