Automata-based Static Analysis of XML Document Adaptations

Alessandro Solimando

Équipe OAK, LRI/INRIA Paris Saclay

Friday 29th January 2016, 10h00

Amphithéatre F107, Inria Grenoble Rhône-Alpes, Montbonnot


The structure of an XML document can be optionally specified by means of XML Schema, thus enabling the exploitation of structural information for efficient document handling. Upon schema evolution, or when exchanging documents among different collections exploiting related but not identical schemas, the need may arise of adapting a document, known to be valid for a given schema S, to a target schema S'.The adaptation may require knowledge of the element semantics and cannot always be automatically derived.We present an automata-based method for the static analysis of user-defined XML document adaptations, expressed as sequences of XQuery Update update primitives. The key feature of the method is the use of an automatic inference method for extracting the type, expressed as an Hedge Automaton, of a sequence of document updates. The type is computed starting from the original schema S and from rewriting rules that formally define the operational semantics of a sequence of document updates. Type inclusion can then be used as conformance test w.r.t. the type extracted from the target schema S'.


Alessandro Solimando received his BSc, MSc and PhD from the University of Genova, where he defended his thesis on change management for the traditional and Semantic Web, under the supervision of Prof. Giovanna Guerrini and Dr. Ernesto Jimenez-Ruiz. He has been an intern at Inria-Saclay and Universit XI Paris-Sud in 2011 working on optimization for XQuery Update query processing, he has also been a visiting student at University Roma "La Sapienza", and the University of Oxford, working on approximation for Ontology-Based Data Access (OBDA) systems and Ontology-to-ontology Alignment Debugging. In October 2015 he joined Inria-Saclay as a PostDoc fellow. His current research interests are at the intersection of data management and knowledge representation fields, spanning from technologies related to the Semantic Web to distributed data processing, from query optimization to efficient data storage and querying.

© | ? | *