• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Mao, Guojun (Mao, Guojun.) | Gao, Mingxia (Gao, Mingxia.) | Yao, Wenji (Yao, Wenji.)

Indexed by:

CPCI-S

Abstract:

This paper proposes an algorithm for clustering XML data stream using sliding window. It is a dynamic clustering algorithm based on XML structure. Firstly, we use level structure to represent XML document, which is based on temporal clustering feature. This structure is suitable for extracting information from XML document structure and calculating similarity between XML documents. Secondly, we use the sliding window technique, which adopts exponential histogram of XML cluster feature as a micro-cluster of it. By using the model, we can dynamically accept the new data and get rid of the old data thereby getting a better distribution feature of the current window. Finally, the experimental results based on real and synthetic XML datasets show that our algorithm not only achieves the real-time requirements of the online clustering, but also gains better clustering quality and faster processing speed.

Keyword:

XML data stream sliding window

Author Community:

  • [ 1 ] [Mao, Guojun]Cent Univ Finance & Econ, Coll Informat, Beijing, Peoples R China
  • [ 2 ] [Gao, Mingxia]Beijing Univ Technol, Sch Comp Sci, Beijing, Peoples R China
  • [ 3 ] [Yao, Wenji]Beijing Univ Technol, Sch Comp Sci, Beijing, Peoples R China

Reprint Author's Address:

  • [Mao, Guojun]Cent Univ Finance & Econ, Coll Informat, Beijing, Peoples R China

Show more details

Related Keywords:

Related Article:

Source :

DBKDA 2011: THE THIRD INTERNATIONAL CONFERENCE ON ADVANCES IN DATABASES, KNOWLEDGE, AND DATA APPLICATIONS

Year: 2011

Page: 96-101

Language: English

Cited Count:

WoS CC Cited Count: 1

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 7

Online/Total:480/10557661
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.