Compact Representation of XML

Dr Raymond Wong
Project Leader
National ICT Australia

Tuesday 27th September 2005 at 11am

 

Abstract

As XML database sizes grow, the amount of space used for storing the data and auxiliary supporting data structures becomes a major factor in query and update performance. This paper presents a new storage scheme for XML data that supports all navigational operations in near constant time. In addition to supporting efficient queries, the space requirement of the proposed scheme is within a constant factor of the information theoretic minimum, while insertions and deletions can be performed in near constant time as well. As a result, the proposed structure features a small memory footprint that increases cache locality, whilst still supporting standard APIs, such as DOM, and necessary database operations, such as queries and updates, efficiently. Both formal analysis and experimental evidence demonstrate that the proposed structure is space and time efficient. We believe that the proposed scheme is extremely useful for handling XML in handheld and embedded devices.

Short resume

Dr. Raymond Wong is a Project Leader at National ICT Australia, a Senior Lecturer at University of New South Wales and the founder of Green Pea Software. In 2001, he led his development team in building one of the first native XML database management systems, supporting full transaction capabilities and multiple XML query languages including XPath and XQuery. He is currently a supervisor of 8 PhD students and has published more than 80 research manuscripts in the database field. He received his BSc from ANU, MPhil and PhD from HKUST. After his PhD, he was a postdoc at Stanford University and a visiting scholar at UCLA. Before joining UNSW, he worked at the computer science department of the following universities: Chinese University of Hong Kong, Macquarie University, and University of Sydney.

Back to HAIL Home Page