creators_name: Everts, TJ type: thesis datestamp: 2004-12-16 lastmod: 2008-07-18 09:38:07 metadata_visibility: show title: Using Formal Concept Analysis with a Push-based Web Document Management System ispublished: unpub subjects: 280100 full_text_status: public monograph_type: NULL keywords: MCRDR, lattice-based browsing, classification, web documents, ripple-down rules abstract: The significant increase in amount of information readily available on the World Wide Web (WWW) makes it difficult for users to locate the information they desire in a timely manner. Modern information gathering and retrieval methods focus on simplifying this task by enabling the user to retrieve only a small subset of information that is more relevant and manageable. However, often the majority of users will not find an immediate use for the information. Therefore, it is necessary to provide a method to store it effectively so it can be utilised as a future knowledge resource. A commonly adopted approach is to classify the retrieved information based on its content. A technique that has been found to be suitable for this purpose is Multiple Classification Ripple Down Rules (MCRDR). MCRDR constructs a classification knowledge base over time using an incremental learning process. This incremental method of acquiring classification knowledge suits the nature of Web information because it is constantly evolving and being updated. However, despite this advantage, the classification knowledge of MCRDR is not often utilised for browsing the classified information. This is because MCRDR does not directly organise the knowledge in a way that is suitable for browsing. As a result, often an alternate structure is utilised for browsing the information which is usually based on a user's abstract understanding of the information domain. This study investigated the feasibility of utilising the classification knowledge acquired through the use of MCRDR as a resource for browsing information retrieved from the WWW. A system was implemented that used the concept lattice based browsing scheme of Formal Concept Analysis (FCA) to support the browsing of documents based on MCRDR classification knowledge. The feasibility of utilising classification knowledge as a resource for browsing documents was evaluated statistically. This was achieved by comparing the concept lattice-based browsing approach to a standard one that utilises abstract knowledge of a domain as a resource for browsing the same documents. date: 2004-11 date_type: published institution: University of Tasmania department: School of Computing thesis_type: honours refereed: FALSE referencetext: - 46 - Boyapati, V., Chevrier, K., Finkel, A., Glance, N., Pierce, T., Stockton, R. and Whitmer, C. 2002, 'ChangeDetector: a site-level monitoring tool for the WWW', Eleventh International Conference on World Wide Web, Honolulu, Hawaii, USA Buchwitz, L. 1997, 'Monitoring Competitive Intelligence using Internet Push Technology', Competitive Intelligence Review. Chakravarthy, S., Sanka, A., Jacob, J. and Pandrangi, N. 2004, 'A Learning-Based Approach for Fetching Pages in WebVigiL', 2004 ACM Symposium on Applied Computing, ACM, Nicosia, Cyprus, pp. 1725-1729 Chin, P. 2003, Push Technology: Still Relevant After All These Years?, Intranet Journal, viewed October 8, Cho, W. C. 2003, 'Use of Cache Mechanism for Web Information Search', Masters thesis, University of Tasmania. Compton, P. and Jansen, R. 1989, 'A Philosophical Basis for Knowledge Acquisition', 3rd European Knowledge Acquisition for Knowledge-Based Systems Workshop, Paris, pp. 75-89 Correia, J. H., Stumme, G., Wille, R. and Wille, U. 2003, 'Conceptual Knowledge Discovery - A Human-Centred Approach', Applied Artificial Intelligence, vol. 17, pp. 281-302. Dazely, R. and Kang, B. H. 2003, 'Weighted MCRDR: Deriving Information about Relationships between Classifications in MCRDR', Australian Conference on Artificial Intelligence, pp. 245-255 Dumais, S., Cutrell, E., Cadiz, J., Jancke, G., Sarin, R. and Robbins, D. C. 2003, 'Stuff I've Seen: A System for Personal Information Retrieval and Re-use', 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, Toronto, Canada, pp. 72-79 Edwards, G., Compton, P., Malor, R., Srinivasan, A. and Lazarus, L. 1993, 'PEIRS: a pathologist maintained expert system for the interpretation of chemical pathology reports', Pathology, vol. 25, pp. 27-34. Furnas, G. W. 1986, 'Generalised Fisheye Views', Proceedings of CIII'86, ACM, pp. 16-23 Ganter, B. and Wille, R. 1997, 'Applied Lattice Theory: Formal Concept Analysis', Preprints, http://wwwbib.mathematik.tu-darmstadt.de/Math-Net/Preprints/Listen/pp97.html. Glance, N., Meunier, J.-L., Bernard, P. and Arregui, D. 2001, 'Collaboritive Document Monitoring', 2001 International ACM SIGGROUP Conference on Supporting Group Work, ACM, Boulder, Colorado, USA Godin, R., Missaoui, R. and Alaoui, H. 1995, 'Incremental concept formulation algorithms based on Galois (concept) lattices', Computational Intelligence, vol. 11, no. 2, pp. 246-267. Godin, R., Pichet, C. and Gecsei, J. 1989, 'Design of a browsing interface for information retrieval', 12th annual international ACM SIGIR conference on Research and development in information retrieval, ACM Press, pp. 32--39 References - 47 - Greenspan, R. 2002, Search Engine Usage Ranks High, viewed 9 June, InternetWorldStats 2004, Internet Usage Statistics - The Big Picture: World Internet Users and Population Stats, viewed October 8, 2004, Kang, B. H. 1996, 'Multiple Classification Ripple Down Rules', PhD thesis, University of New South Wales. Kang, B. H., Compton, P. and Preston, P. 1995, 'Multiple Classification Ripple Down Rules: Evaluation and Possibilities', 9th Banff Knowledge Aquisition for Knowledge Based Systems Workshop, Banff, pp. 17.11-17.20 Kang, B. H., Yoshida, K., Motoda, H. and Compton, P. 1997, 'Help Desk System with Intelligent Interface', Applied Artificial Intelligence, vol. 11, pp. 611-631. Kim, M. 2003, 'Document Management and Retrieval for Specialised Domains: An Evolutionary User-Based Approach', Ph.D. thesis, University of New South Wales. Kim, M. and Compton, P. 2000, 'Developing a domain-specific Document Retrieval Mechanism', 6th Pacific Knowledge Acquisition Workshop, Sydney, Australia, pp. 189-206 Kim, M. and Compton, P. 2001, 'A Web-based Browsing Mechanism Based on Conceptual Structures', 9th International Conference on Conceptual Structures, Standford University, California, USA, pp. 47-60 Kim, M. and Compton, P. 2004, 'Evolutionary document management and retrieval for specialized domains on the web', International Journal of Human-Computer Studies, vol. 60, no. 2, p. 201-241. Kim, Y. S., Park, S. S., Deards, E. and Kang, B. H. 2004a, 'Adaptive Web Document Classification with MCRDR', International Conference on Information Technology (ITCC), Las Vegas, NV, USA Kim, Y. S., Park, S. S., Kang, B. H. and Choi, Y. J. 2004b, 'Incremental Knowledge Management of Web Community Groups on Web Portals', 5th International Conference on Practical Aspects of Knowledge Management, Vienna, Austria Kobayashi, M. and Takeda, K. 2000, 'Information Retrieval on the Web', ACM Computing Surveys, vol. 32, no. 2, pp. 144-173. Lam, S. K. S. and Ozsu, M. T. 2002, 'Querying Web data - the WebQA approach', Third International Conference on Web Information Systems Engineering, IEEE, Singapore, pp. 139-148 Mladenic, D. 1999, 'Text-learning and related intelligent agents', Application of Intelligent Information Retrieval Park, S. S., Kim, Y. S. and Kang, B. H. 2003, 'Web Information Management System: Personalization and Generalization', IADIS International Conference WWW/Internet 2003, Algarve Portugal Preston, P., Compton, P., Edwards, G. and Kang, B. H. 1996, 'An Implementation of Multiple Classification Ripple Down Rules', Tenth Knowledge Acquistion for Knowledge- Based Systems Workshop. References Rajapakse, R. K. and Denham, M. 2003, 'A Reinforcement Learning Strategy for (formal) Concept and Keyword Weight Learning for Adaptive Information Retrieval', 9th International Conference on User Modeling, Johnstown, Pennsylvania, USA, pp. 29- 39 Richards, D. C. 1998, 'The Reuse in Ripple Down Rule Knowledge Based Systems', Doctorate thesis, University of New South Wales. Richards, D. C. 2001, 'Combining Cases and Rules to Provide Contexualised Knowledge Based Systems', Third International Conference on Modelling and Using Context, Coff's Harbour, Australia, pp. 85-94 Richards, D. C. and Compton, P. 1997, 'Combining Formal Concept Analysis and Ripple Down Rules to Support the Reuse of Knowledge', Ninth International Conference on Software Engineering Knowledge Engineering SEKE'97, Springer Verlag, Madrid, Spain Senastiani, F. 2002, 'Machine Learning in Automated Text Categorization', ACM Computing Surveys, vol. 34, no. 1, pp. 1-47. Tam, G. K. T. 2004, 'FOCAS - Formal Concept Analysis and Text Similarity', Honours thesis, Monash University. Wille, R. 1982, 'Restructuring lattice theory: an approach based on hierarchies of concepts', in I. Rival (ed.), Ordered Sets, Reidel, Dordrecht-Boston: pp. 445-470. Wille, R. 1989, 'Lattices in Data Analysis: How to Draw them with a Computer', in I. Rival (ed.), Algorithms and Order, Kluwer, Dordrecht, Boston: pp. 33-58. Wille, R. and Ganter, B. 1999, Formal Concept Analysis: Mathematical Foundations, Springer, Berlin-Heidelberg. citation: Everts, TJ (2004) Using Formal Concept Analysis with a Push-based Web Document Management System. Honours thesis, University of Tasmania. document_url: http://eprints.utas.edu.au/116/1/EvertsT_Hons_Thesis2004.pdf