• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Li, Yeting (Li, Yeting.) | Chen, Haiming (Chen, Haiming.) | Zhang, Lingqi (Zhang, Lingqi.) | Huang, Bo (Huang, Bo.) | Zhang, Jianzhao (Zhang, Jianzhao.)

Indexed by:

CPCI-S EI Scopus

Abstract:

The presence of a schema for XML documents has numerous advantages. Unfortunately, many XML documents in practice are not accompanied by a schema or a valid schema. Therefore, it is essential to devise algorithms to infer schemas. The fundamental task in XML schema inference is to learn regular expressions. In this paper, we focus on learning the subclass of RE(&) called SIREs (the subclass of regular expressions with interleaving). Previous work in this direction lacks inference algorithms that support inference from positive and negative examples. We provide an algorithm to learn SIREs from positive and negative examples based on genetic algorithms and parallel techniques. Our algorithm also has better expansibility, which means that our algorithm not only supports learning with positive and negative examples, but also supports learning with positive or negative examples only. Experimental results demonstrate the effectiveness of our algorithm.

Keyword:

Interleaving Learning expressions XML Positive and negative examples Schema inference

Author Community:

  • [ 1 ] [Li, Yeting]Chinese Acad Sci, Inst Software, State Key Lab Comp Sci, Beijing, Peoples R China
  • [ 2 ] [Chen, Haiming]Chinese Acad Sci, Inst Software, State Key Lab Comp Sci, Beijing, Peoples R China
  • [ 3 ] [Zhang, Jianzhao]Chinese Acad Sci, Inst Software, State Key Lab Comp Sci, Beijing, Peoples R China
  • [ 4 ] [Li, Yeting]Univ Chinese Acad Sci, Beijing, Peoples R China
  • [ 5 ] [Zhang, Jianzhao]Univ Chinese Acad Sci, Beijing, Peoples R China
  • [ 6 ] [Zhang, Lingqi]Beijing Univ Technol, Beijing, Peoples R China
  • [ 7 ] [Huang, Bo]Northwestern Polytech Univ, Xian, Peoples R China

Reprint Author's Address:

Show more details

Related Keywords:

Related Article:

Source :

ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2020, PT II

ISSN: 0302-9743

Year: 2020

Volume: 12085

Page: 769-781

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 1

Affiliated Colleges:

Online/Total:1311/10904628
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.