A deep learning based method for extracting semantic information from patent documents - Details

Author：

Indexed by：

SSCI Scopus SCIE

Abstract：

The　text-based　patent　analysis　is　grounded　in　information　extraction　technique.　However,　such　technique　suffers　from　obvious　defects　such　as　low　degree　of　automation　and　unsatisfactory　extraction　accuracy.　To　deal　with　these　problems,　after　an　information　schema　is　pre-defined,　which　contains　17　types　of　entities　and　15　types　of　semantic　relations,　a　dataset　of　1010　patent　abstracts　is　annotated　and　opened　freely　to　the　research　community.　Then,　a　novel　patent　information　extraction　framework　is　proposed,　in　which　two　deep-learning　models,　BiLSTM-CRF　and　BiGRU-HAN,　are　respectively　used　for　entity　identification　and　semantic　relation　extraction.　Finally,　to　demonstrate　the　advantages　of　the　new　framework,　extensive　experiments　are　conducted,　and　the　SAO　method　and　PCNNs　model　are　taken　as　respective　baselines　on　the　framework　and　module　levels.　Experimental　results　show　that　our　framework　out-performs　the　traditional　one　in　terms　of　automation　and　accuracy,　and　is　capable　of　extracting　fine-grained　structured　information　from　patent　texts.

Keyword：

BiGRU-HAN BiLSTM-CRF PCNNs SAO Patent analysis Entity identification Thin film head Deep learning Relation extraction

Author Community：

[ 1 ] [Chen, Liang]Inst Sci & Tech Informat China, Beijing 100038, Peoples R China
[ 2 ] [Zhu, Lijun]Inst Sci & Tech Informat China, Beijing 100038, Peoples R China
[ 3 ] [Zhang, Jing]Inst Sci & Tech Informat China, Beijing 100038, Peoples R China
[ 4 ] [Lei, Xiaoping]Inst Sci & Tech Informat China, Beijing 100038, Peoples R China
[ 5 ] [Xu, Shuo]Beijing Univ Technol, Res Base Beijing Modern Mfg Dev, Coll Econ & Management, Beijing 100124, Peoples R China
[ 6 ] [Yang, Guancan]Renmin Univ China, Sch Informat Resource Management, Beijing 100872, Peoples R China

Reprint Author's Address：

徐硕
[Xu, Shuo]Beijing Univ Technol, Res Base Beijing Modern Mfg Dev, Coll Econ & Management, Beijing 100124, Peoples R China

Email：

Show more details

Related Keywords：

A Traditional Chinese Medicine Terminology Recognition Model Based on Deep Learning: A TCM Terminology Recognition Model
2021，6th International Conference on Big Data and Computing, ICBDC 2021
A novel approach for patent similarity measurement based on sequence alignment
2020，
An improved patent similarity measurement based on entities and semantic relations
2021，Journal of Informetrics
A novel approach for patent similarity measurement based on sequence alignment
2020，1st Workshop on Extraction and Evaluation of Knowledge Entities from Scientific Documents, EEKE 2020

Source ：

SCIENTOMETRICS

ISSN： 0138-9130

Year： 2020

Issue： 1

Volume： 125

Page： 289-312

3 . 9 0 0

JCR@2022

ESI Discipline： SOCIAL SCIENCES, GENERAL;

ESI HC Threshold：79

Cited Count：

WoS CC Cited Count： 47

SCOPUS Cited Count： 75

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 9

Affiliated Colleges：

经济与管理学院本学院/部未明确归属的数据

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to