Open Access Repository

An evaluation study of web monitoring : web monitoring vs. web crawling


Downloads per month over past year

Kim, YS 2009 , 'An evaluation study of web monitoring : web monitoring vs. web crawling', PhD thesis, University of Tasmania.

[img] PDF (Whole thesis)
whole_KimYangso...pdf | Request a copy
Full text restricted
Available under University of Tasmania Standard License.


Nowadays people use web search engines to find information. Even though these engines
endeavour to provide information in a complete and timely manner, there are significant
delays and under-coverage in their services. However, people sometimes want to obtain
new information from personally selected web pages without missing anything and with
little delay. Web monitoring tries to fulfil this goal by revisiting the selected web pages
frequently. Initially, web monitoring focused on the monitoring method, but then the
research emphasis changed in order to address the problem of information overload and
scheduling under limited resources.
This dissertation focuses on the following research problems to improve the efficiency of
web monitoring systems. Firstly, it analyses how efficiently a document classification
system that uses an incremental knowledge acquisition method, called MCRDR (Multiple
Classification Ripple-Down Rules), was used to resolve individual information overload
problems. Secondly, it discusses how MCRDR knowledge bases, standard web search
engines, and appropriate· web page locating heuristics can be employed in unison to locate
relevant monitoring web pages. Thirdly, it demonstrates that the web monitoring system
exhibits better performance in respect of service coverage and delay than commercial web
search engines. Lastly, it proposes a monitoring web page prioritization method that
decides the orders of monitoring sequence using the estimated service coverage and delay
of web search engines obtained by using various predictor variables identified from the web
crawling policies and statistical regression methods.

Item Type: Thesis - PhD
Authors/Creators:Kim, YS
Keywords: Web search engines, Internet searching, Search engines
Copyright Holders: The Author
Copyright Information:

Copyright 2009 the Author - The University is continuing to endeavour to trace the copyright
owner(s) and in the meantime this item has been reproduced here in good faith. We
would be pleased to hear from the copyright owner(s).

Additional Information:

Available for use in the Library but NOT for copying until 16th February 2011. After that date, available for use in the Library and copying in accordance with the Copyright Act 1968, as amended. Thesis (PhD)--University of Tasmania, 2009. Includes bibliographical references. Ch. 1. Introduction -- Ch. 2. Web monitoring - possibilities and limitations -- Ch. 3. Managing information overload using MCRDR document -- Ch. 4. Relevant web page retrieval from search engines reusing MCRDR knowledge bases -- Ch. 5. Monitoring web page location huristics -- Ch. 6. Service coverage and delay in search engines -- Ch. 7. Modelling prioritization -- Ch. 8. Study conclusions

Item Statistics: View statistics for this item

Actions (login required)

Item Control Page Item Control Page