• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Li, Yeting (Li, Yeting.) | Chen, Haiming (Chen, Haiming.) | Zhang, Lingqi (Zhang, Lingqi.) | Huang, Bo (Huang, Bo.) | Zhang, Jianzhao (Zhang, Jianzhao.)

Indexed by:

EI

Abstract:

The presence of a schema for XML documents has numerous advantages. Unfortunately, many XML documents in practice are not accompanied by a schema or a valid schema. Therefore, it is essential to devise algorithms to infer schemas. The fundamental task in XML schema inference is to learn regular expressions. In this paper, we focus on learning the subclass of RE(&) called SIREs (the subclass of regular expressions with interleaving). Previous work in this direction lacks inference algorithms that support inference from positive and negative examples. We provide an algorithm to learn SIREs from positive and negative examples based on genetic algorithms and parallel techniques. Our algorithm also has better expansibility, which means that our algorithm not only supports learning with positive and negative examples, but also supports learning with positive or negative examples only. Experimental results demonstrate the effectiveness of our algorithm. © Springer Nature Switzerland AG 2020.

Keyword:

Pattern matching Inference engines XML Data mining Genetic algorithms

Author Community:

  • [ 1 ] [Li, Yeting]State Key Laboratory of Computer Science, Institute of Software, Chinese Academy of Sciences, Beijing; 100190, China
  • [ 2 ] [Li, Yeting]University of Chinese Academy of Sciences, Beijing, China
  • [ 3 ] [Chen, Haiming]State Key Laboratory of Computer Science, Institute of Software, Chinese Academy of Sciences, Beijing; 100190, China
  • [ 4 ] [Zhang, Lingqi]Beijing University of Technology, Beijing, China
  • [ 5 ] [Huang, Bo]Northwestern Polytechnical University, Xi’an, China
  • [ 6 ] [Zhang, Jianzhao]State Key Laboratory of Computer Science, Institute of Software, Chinese Academy of Sciences, Beijing; 100190, China
  • [ 7 ] [Zhang, Jianzhao]University of Chinese Academy of Sciences, Beijing, China

Reprint Author's Address:

  • [chen, haiming]state key laboratory of computer science, institute of software, chinese academy of sciences, beijing; 100190, china

Show more details

Related Keywords:

Related Article:

Source :

ISSN: 0302-9743

Year: 2020

Volume: 12085 LNAI

Page: 769-781

Language: English

Cited Count:

WoS CC Cited Count: 0

SCOPUS Cited Count: 1

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 12

Affiliated Colleges:

Online/Total:1405/10840668
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.