Indexing XML Data for Efficient Twig Pattern Matching

Indexing XML Data for Efficient Twig Pattern Matching
Title Indexing XML Data for Efficient Twig Pattern Matching PDF eBook
Author Praveen Rao
Publisher
Pages 316
Release
Genre
ISBN

Download Indexing XML Data for Efficient Twig Pattern Matching Book in PDF, Epub and Kindle

The Extensible Markup Language XML has become the de facto standard for information representation and interchange on the Internet. In this dissertation, I address the problem of indexing and querying XML in two environments, namely, (a) a traditional environment where data is centrally stored and (b) a growingly popular peer-to-peer (P2P) environment. In a traditional environment, the index built over XML data is typicallycentralized. On the other hand, due to the distributed nature of the data in a P2P system, the index is also distributed. Due to the different models of storing data in these two environments, I propose two different XML indexing schemes for efficient query processing. In a traditional environment, a core operation is tofind all occurrences of a given query pattern in the database. I propose a new way of indexing XML documents and processing query patterns. Every XML document in the database is transformed into a sequence of labels by Prơ̧fer's method that constructs a one-to-one correspondence between trees and sequences. During query processing, a query pattern is also transformed into its Prơ̧fer sequence. By performing subsequence matching on the set of sequences in the database, and performing a series of refinement phasesthat I have developed, all the occurrences of a query pattern can be found in the database. Furthermore, I show that all correct answers are found without any false dismissals or false alarms. I present the design, implementation, and experimental evaluation of the PRIX system that I have developed for this purpose. Coupled with the growing popularity of P2P systems, XML is commonly used as an underlying data model for P2P applications to handle the heterogeneity of the data and limited expressiveness of queries. Locating relevant data sources across a large number of participating peers is an important challenge. In this environment, the challenge is to quickly test the existence ofa query pattern in XML documents published by usersrather than finding all their occurrences. PRIX finds all occurrences of a query pattern and hence is not the best solution. Moreover, in a P2P environment, a distributed and decentralized index is necessary. Therefore, I propose a distributed indexing scheme for XML documents to quickly test for existence of query patterns based on polynomial signatures. In this scheme, each XML document is mapped into an algebraic signature that captures the structural summary of the document. The participating peers in the network collectively maintain a distributed and hierarchical index over the signatures. By virtue of the signature index, the signatures of documents with similar structural characteristics tend to be stored together at the same peer, and a search for document sources is resolved quickly. I present the design, implementation, and empirical evaluation of the psiX system that I have developed for this purpose. The signature scheme proposed in psiX can be applied to querying heterogeneous XML databases.

Database Systems for Advanced Applications

Database Systems for Advanced Applications
Title Database Systems for Advanced Applications PDF eBook
Author Kian Lee Tan
Publisher Springer
Pages 940
Release 2006-03-11
Genre Computers
ISBN 354033338X

Download Database Systems for Advanced Applications Book in PDF, Epub and Kindle

This book constitutes the refereed proceedings of the 11th International Conference on Database Systems for Advanced Applications, DASFAA 2006, held in Singapore in April 2006. 46 revised full papers and 16 revised short papers presented were carefully reviewed and selected from 188 submissions. Topics include sensor networks, subsequence matching and repeating patterns, spatial-temporal databases, data mining, XML compression and indexing, xpath query evaluation, uncertainty and streams, peer-to-peer and distributed networks and more.

Computational Science - ICCS 2007

Computational Science - ICCS 2007
Title Computational Science - ICCS 2007 PDF eBook
Author Yong Shi
Publisher Springer Science & Business Media
Pages 1310
Release 2007-05-18
Genre Computers
ISBN 3540725830

Download Computational Science - ICCS 2007 Book in PDF, Epub and Kindle

Annotation The four-volume set LNCS 4487-4490 constitutes the refereed proceedings of the 7th International Conference on Computational Science, ICCS 2007, held in Beijing, China in May 2007. More than 2400 submissions were made to the main conference and its 35 topical workshops. The 80 revised full papers and 11 revised short papers of the main track were carefully reviewed and selected from 360 submissions and are presented together with 624 accepted workshop papers in four volumes. According to the ICCS 2007 theme "Advancing Science and Society through Computation" the papers cover a large volume of topics in computational science and related areas, from multiscale physics, to wireless networks, and from graph theory to tools for program development. The papers are arranged in topical sections on efficient data management, parallel monte carlo algorithms, simulation of multiphysics multiscale systems, dynamic data driven application systems, computer graphics and geometric modeling, computer algebra systems, computational chemistry, computational approaches and techniques in bioinformatics, computational finance and business intelligence, geocomputation, high-level parallel programming, networks theory and applications, collective intelligence for semantic and knowledge grid, collaborative and cooperative environments, tools for program development and analysis in CS, intelligent agents in computing systems, CS in software engineering, computational linguistics in HCI, internet computing in science and engineering, workflow systems in e-science, graph theoretic algorithms and applications in cs, teaching CS, high performance data mining, mining text, semi-structured, Web, or multimedia data, computational methods in energy economics, risk analysis, advances in computational geomechanics and geophysics, meta-synthesis and complex systems, scientific computing in electronics engineering, wireless and mobile systems, high performance networked media and services, evolution toward next generation internet, real time systems and adaptive applications, evolutionary algorithms and evolvable systems.

Database and XML Technologies

Database and XML Technologies
Title Database and XML Technologies PDF eBook
Author Mong Li Lee
Publisher Springer Science & Business Media
Pages 163
Release 2010-09
Genre Computers
ISBN 3642156835

Download Database and XML Technologies Book in PDF, Epub and Kindle

This book constitutes the refereed proceedings of the 7th International XML Database Symposium, XSym 2010, held in Singapore, in September 2010. The 11 papers were carefully reviewed and selected from 20 submissions. The papers are organized in topical sections on XML query processing; XML update and applications; and XML modeling.

Networked Digital Technologies, Part I

Networked Digital Technologies, Part I
Title Networked Digital Technologies, Part I PDF eBook
Author Filip Zavoral
Publisher Springer Science & Business Media
Pages 599
Release 2010-06-30
Genre Language Arts & Disciplines
ISBN 3642142915

Download Networked Digital Technologies, Part I Book in PDF, Epub and Kindle

On behalf of the NDT 2010 conference, the Program Committee and Charles University in Prague, Czech Republic, we welcome you to the proceedings of the Second International Conference on ‘Networked Digital Technologies’ (NDT 2010). The NDT 2010 conference explored new advances in digital and Web technology applications. It brought together researchers from various areas of computer and information sciences who addressed both theoretical and applied aspects of Web technology and Internet applications. We hope that the discussions and exchange of ideas that took place will contribute to advancements in the technology in the near future. The conference received 216 papers, out of which 85 were accepted, resulting in an acceptance rate of 39%. These accepted papers are authored by researchers from 34 countries covering many significant areas of Web applications. Each paper was evaluated by a minimum of two reviewers. Finally, we believe that the proceedings document the best research in the studied areas. We express our thanks to the Charles University in Prague, Springer, the authors and the organizers of the conference.

Advanced Applications and Structures in XML Processing: Label Streams, Semantics Utilization and Data Query Technologies

Advanced Applications and Structures in XML Processing: Label Streams, Semantics Utilization and Data Query Technologies
Title Advanced Applications and Structures in XML Processing: Label Streams, Semantics Utilization and Data Query Technologies PDF eBook
Author Li, Changqing
Publisher IGI Global
Pages 500
Release 2010-02-28
Genre Social Science
ISBN 1615207287

Download Advanced Applications and Structures in XML Processing: Label Streams, Semantics Utilization and Data Query Technologies Book in PDF, Epub and Kindle

"This book is for professionals and researchers working in the field of XML in various disciplines who want to improve their understanding of the XML data management technologies, such as XML models, XML query and update processing, XML query languages and their implementations, keywords search in XML documents, database, web service, publish/subscribe, medical information science, and e-business"--Provided by publisher.

Database Systems for Advanced Applications

Database Systems for Advanced Applications
Title Database Systems for Advanced Applications PDF eBook
Author Lei Chen
Publisher Springer
Pages 383
Release 2009-08-27
Genre Computers
ISBN 3642042058

Download Database Systems for Advanced Applications Book in PDF, Epub and Kindle

DASFAA is an annual international database conference, located in the Asia- Paci?cregion,whichshowcasesstate-of-the-artR & Dactivities in databases- tems and their applications. It provides a forum for technical presentations and discussions among database researchers, developers and users from academia, business and industry. DASFAA 2009, the 14th in the series, was held during April 20-23, 2009 in Brisbane, Australia. In this year, we carefully selected six workshops, each focusing on speci?c research issues that contribute to the main themes of the DASFAA conference. Thisvolumecontainsthe?nalversionsofpapersacceptedforthesesixworkshops that were held in conjunction with DASFAA 2009. They are: – First International Workshop on Benchmarking of XML and Semantic Web Applications (BenchmarX 2009) – Second International Workshop on Managing Data Quality in Collaborative Information Systems (MCIS 2009) – First International Workshop on Data and Process Provenance (WDPP 2009) – First International Workshop on Privacy-Preserving Data Analysis (PPDA 2009) – FirstInternationalWorkshoponMobileBusinessCollaboration(MBC2009) – DASFAA 2009 PhD Workshop All the workshops were selected via a public call-for-proposals process. The workshop organizers put a tremendous amount of e?ort into soliciting and - lecting papers with a balance of high quality, new ideas and new applications. We asked all workshops to follow a rigid paper selection process, including the procedure to ensure that any Program Committee members are excluded from the paper review process of any paper they are involved with. A requirement about the overall paper acceptance rate of no more than 50% was also imposed on all the workshops.