• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Azam, Muhammad (Azam, Muhammad.) | Ahmed, Tanvir (Ahmed, Tanvir.) | Sabah, Fahad (Sabah, Fahad.) | Hussain, Muhammad Iftikhar (Hussain, Muhammad Iftikhar.)

Abstract:

Scientific publications has been increasing enormously, with this increase classification of scientific publications is becoming challenging task. The core objective of this research is to analyze the performance of classification algorithms using Scopus dataset. In text classification, classification and feature extraction from the document using extracted features are the major issues for decreasing the performances in different algorithms. In this paper, performances of classification algorithms such as Naive Bayes (NB) and K-Nearest Neighbor (K-NN) shown better improvement using Bayesian boost and bagging. The performance results were analyzed through selected classification algorithms over 10K documents from Scopus examined using F-measure and produced comparison matrices to estimate accuracy, precision and recall using NB and KNN classifier. Further, data preprocessing and cleaning steps are induced on the selected dataset and class imbalance issues are analyzed to increase the performance of text classification algorithms. Experimental results showed performances over 7% using K-NN and revealed better as compared to NB.

Keyword:

naive bayes K-NN feature extraction rapid miner text classification

Author Community:

  • [ 1 ] [Azam, Muhammad]Super Univ Lahore, Dept Comp Sci & Informat Technol, Lahore, Pakistan
  • [ 2 ] [Ahmed, Tanvir]Super Univ Lahore, Dept Comp Sci & Informat Technol, Lahore, Pakistan
  • [ 3 ] [Sabah, Fahad]Super Univ Lahore, Dept Comp Sci & Informat Technol, Lahore, Pakistan
  • [ 4 ] [Hussain, Muhammad Iftikhar]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 5 ] [Hussain, Muhammad Iftikhar]Beijing Univ Technol, Beijing Engn Res Ctr IoT Software & Syst, Beijing 100124, Peoples R China

Reprint Author's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY

ISSN: 1738-7906

Year: 2018

Issue: 12

Volume: 18

Page: 95-101

ESI Discipline: COMPUTER SCIENCE;

ESI HC Threshold:161

Cited Count:

WoS CC Cited Count: 0

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 4

Affiliated Colleges:

Online/Total:302/10509246
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.