• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Li, Yeting (Li, Yeting.) | Chen, Haiming (Chen, Haiming.) | Zhang, Xiaolan (Zhang, Xiaolan.) | Zhang, Lingqi (Zhang, Lingqi.)

Indexed by:

CPCI-S EI Scopus

Abstract:

The advantages offered by the presence of a schema are numerous. However, many XML documents in practice are not accompanied by a (valid) schema, making schema inference an attractive research problem. The fundamental task in XML schema learning is inferring restricted subclasses of regular expressions. Most previous work either lacks support for interleaving or only has limited support for interleaving. In this paper, we first propose a new subclass Single Occurrence Regular Expressions with Interleaving (SOIRE), which has unrestricted support for interleaving. Then, based on single occurrence automaton and maximum independent set, we propose an algorithm iSOIRE to infer SOIREs. Finally, we further conduct a series of experiments on real datasets to evaluate the effectiveness of our work, comparing with both ongoing learning algorithms in academia and industrial tools in real-world. The results reveal the practicability of SOIRE and the effectiveness of iSOIRE, showing the high preciseness and conciseness of our work.

Keyword:

interleaving XML schema inference learning expressions

Author Community:

  • [ 1 ] [Li, Yeting]Chinese Acad Sci, Inst Software, State Key Lab Comp Sci, Beijing, Peoples R China
  • [ 2 ] [Chen, Haiming]Chinese Acad Sci, Inst Software, State Key Lab Comp Sci, Beijing, Peoples R China
  • [ 3 ] [Zhang, Xiaolan]Chinese Acad Sci, Inst Software, State Key Lab Comp Sci, Beijing, Peoples R China
  • [ 4 ] [Li, Yeting]Univ Chinese Acad Sci, Beijing, Peoples R China
  • [ 5 ] [Zhang, Xiaolan]Univ Chinese Acad Sci, Beijing, Peoples R China
  • [ 6 ] [Zhang, Lingqi]Beijing Univ Technol, Beijing, Peoples R China

Reprint Author's Address:

Show more details

Related Keywords:

Related Article:

Source :

IDEAS '19: PROCEEDINGS OF THE 23RD INTERNATIONAL DATABASE APPLICATIONS & ENGINEERING SYMPOSIUM (IDEAS 2019)

ISSN: 1098-8068

Year: 2019

Page: 189-198

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 1

Affiliated Colleges:

Online/Total:1392/10905101
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.