Some experiments in Web usage mining and XML Document mining

Dr Anne-Marie Vercoustre
Research Scientist
INRIA-Rocquencourt, France

*** NOTE UNUSUAL DAY AND TIME ***

Thursday 20th January 2005 at 2pm

Abstract

Anne-Marie Vercoustre has recently joined the Axis research group at INRIA. AxIS is a pluri-disciplinary team (Artificial Intelligence, Data Mining, Data Analysis, Software Engineering). It aims at conceiving methods and tools directed by usage, for assistance with the design and the analysis of knowledge and/or information systems (IS).

I will present some of Axis research activities, focusing on Web usage mining and XML Documents mining. In both cases we are using clustering methods to infer information that is not explicitly represented in the document collection (or site). Log-based Web usage mining tries to caracterise users' behavior when browsing a site. The goal is to identify classes of behavior in order to provide assistance to those users, or to help redesigning the site if necessary.

Our work in XML Document mining was to measure the impact of using different parts of the documents (based on the XML structure) when clustering the collection. More generally we are interested in developing technology for clustering XML collection based on their content and/or structure.

Short resume

Dr Anne-Marie Vecoustre is Research Director at INRIA Rocquencourt. A former member of the Project Verso, she took leave without pay and moved to CSIRO-CMIS, Melbourne, Australia,(1997 and Sept. 2000 to March 2004), where she was a member of the TED group (Technologies for Electronic Documents), before becoming the leader of the Knowledge Sharing group in the new CSIRO ICT Centre.

Anne-Marie has since returned to Inria-Rocquencourt, where she has joined the AxIS research group working on Web usage mining.

Back to HAIL Home Page