• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Zhu, Zhichao (Zhu, Zhichao.) | Zhao, Qing (Zhao, Qing.) | Li, Jianjiang (Li, Jianjiang.) | Ge, Yanhu (Ge, Yanhu.) | Ding, Xingjian (Ding, Xingjian.) | Gu, Tao (Gu, Tao.) | Zou, Jingchen (Zou, Jingchen.) | Lv, Sirui (Lv, Sirui.) | Wang, Sheng (Wang, Sheng.) | Yang, Ji-Jiang (Yang, Ji-Jiang.)

Indexed by:

Scopus SCIE

Abstract:

The emergence of large language models (LLMs) has provided robust support for application tasks across various domains, such as name entity recognition (NER) in the general domain. However, due to the particularity of the medical domain, the research on understanding and improving the effectiveness of LLMs on biomedical named entity recognition (BNER) tasks remains relatively limited, especially in the context of Chinese text. In this study, we extensively evaluate several typical LLMs, including ChatGLM2-6B, GLM-130B, GPT-3.5, and GPT-4, on the Chinese BNER task by leveraging a real-world Chinese electronic medical record (EMR) dataset and a public dataset. The experimental results demonstrate the promising yet limited performance of LLMs with zero-shot and few-shot prompt designs for Chinese BNER tasks. More importantly, instruction fine-tuning significantly enhances the performance of LLMs. The fine-tuned offline ChatGLM2-6B surpassed the performance of the task-specific model BiLSTM+CRF (BC) on the real-world dataset. The best fine-tuned model, GPT-3.5, outperforms all other LLMs on the publicly available CCKS2017 dataset, even surpassing half of the baselines; however, it still remains challenging for it to surpass the state-of-the-art task-specific models, i.e., Dictionary-guided Attention Network (DGAN). To our knowledge, this study is the first attempt to evaluate the performance of LLMs on Chinese BNER tasks, which emphasizes the prospective and transformative implications of utilizing LLMs on Chinese BNER tasks. Furthermore, we summarize our findings into a set of actionable guidelines for future researchers on how to effectively leverage LLMs to become experts in specific tasks.

Keyword:

large language model biomedical named entity recognition electronic medical record

Author Community:

  • [ 1 ] [Zhu, Zhichao]Beijing Univ Technol, Coll Comp Sci, Beijing 100124, Peoples R China
  • [ 2 ] [Zhao, Qing]Beijing Univ Technol, Coll Comp Sci, Beijing 100124, Peoples R China
  • [ 3 ] [Li, Jianjiang]Beijing Univ Technol, Coll Comp Sci, Beijing 100124, Peoples R China
  • [ 4 ] [Ding, Xingjian]Beijing Univ Technol, Coll Comp Sci, Beijing 100124, Peoples R China
  • [ 5 ] [Gu, Tao]Beijing Univ Technol, Coll Comp Sci, Beijing 100124, Peoples R China
  • [ 6 ] [Zou, Jingchen]Beijing Univ Technol, Coll Comp Sci, Beijing 100124, Peoples R China
  • [ 7 ] [Lv, Sirui]Beijing Univ Technol, Coll Comp Sci, Beijing 100124, Peoples R China
  • [ 8 ] [Ge, Yanhu]Capital Med Univ, Beijing Anzhen Hosp, Dept Anesthesiol, Beijing 100013, Peoples R China
  • [ 9 ] [Wang, Sheng]Capital Med Univ, Beijing Anzhen Hosp, Dept Anesthesiol, Beijing 100013, Peoples R China
  • [ 10 ] [Yang, Ji-Jiang]Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China

Reprint Author's Address:

  • [Wang, Sheng]Capital Med Univ, Beijing Anzhen Hosp, Dept Anesthesiol, Beijing 100013, Peoples R China;;[Yang, Ji-Jiang]Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China;;

Show more details

Related Keywords:

Source :

BIOENGINEERING-BASEL

Year: 2024

Issue: 10

Volume: 11

4 . 6 0 0

JCR@2022

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 5

Affiliated Colleges:

Online/Total:846/10803410
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.