• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Wang, Xiaoyu (Wang, Xiaoyu.) | Zhai, Yujia (Zhai, Yujia.) | Lin, Yuanhai (Lin, Yuanhai.) | Wang, Fang (Wang, Fang.)

Indexed by:

SSCI EI Scopus SCIE

Abstract:

Tech mining is the application of text mining tools to science and technology information resources. The ever-increasing volume of scientific outputs is a boom to technological innovation, but it also complicates efforts to obtain useful and concise information for problem solving. This challenge extends to tech mining, where the development of techniques compatible with big data is an urgent issue. This article introduces a semi-supervised method for extracting layered technological information from scientific papers in order to extend the reach of tech mining. Our method starts with several pre-set seed patterns used to extract candidate phrases by matching the dependency tree of each sentence. Then, after a series of judgements, phrases are divided into two categories: 'main technique' and 'tech-component'. (A technique, for the purposes of this study, is a method or tool used in the article being analysed.) In order to generate new patterns for subsequent iterations, a weighted pattern learning method is also adopted. Finally, multiple iterations of the method are applied to extract technological information from each paper. A dataset from the field of optical switcher is used to verify the method's effectiveness. Our findings are that (1) by two loops of extraction process in each iteration, our method realises the layered technological information extraction, which contains the 'part-whole' relationships between main techniques and tech-components; (2) the recall rate for main techniques is superior to the baseline after iterating 23 rounds; (3) when layering is disregarded, in the aspect of the precision and the volume of techniques, the new method is higher than that for the baseline; and (4) adjusting another two parameters can optimise the efficiency - however, the effect is neither pronounced nor straightforward.

Keyword:

semi-supervised learning information extraction Dependency tree tech mining

Author Community:

  • [ 1 ] [Wang, Xiaoyu]Nankai Univ, Business Sch, Dept Informat Resource Management, 94 Weijin Rd, Tianjin 300071, Peoples R China
  • [ 2 ] [Wang, Fang]Nankai Univ, Business Sch, Dept Informat Resource Management, 94 Weijin Rd, Tianjin 300071, Peoples R China
  • [ 3 ] [Wang, Xiaoyu]CETC Big Data Res Inst Co Ltd, Guiyang, Guizhou, Peoples R China
  • [ 4 ] [Zhai, Yujia]Tianjin Normal Univ, Sch Management, Dept Informat Resource Management, Tianjin, Peoples R China
  • [ 5 ] [Lin, Yuanhai]Beijing Univ Technol, Inst Informat Photon Technol, Beijing, Peoples R China
  • [ 6 ] [Lin, Yuanhai]Beijing Univ Technol, Coll Appl Sci, Beijing, Peoples R China

Reprint Author's Address:

  • [Wang, Fang]Nankai Univ, Business Sch, Dept Informat Resource Management, 94 Weijin Rd, Tianjin 300071, Peoples R China

Show more details

Related Keywords:

Source :

JOURNAL OF INFORMATION SCIENCE

ISSN: 0165-5515

Year: 2019

Issue: 6

Volume: 45

Page: 779-793

2 . 4 0 0

JCR@2022

ESI Discipline: SOCIAL SCIENCES, GENERAL;

ESI HC Threshold:84

JCR Journal Grade:3

Cited Count:

WoS CC Cited Count: 2

SCOPUS Cited Count: 5

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 3

Affiliated Colleges:

Online/Total:832/10657434
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.