Towards using visual, semantic and structural features to improve code readability classification - Details

Author：

Mi, Qing (Mi, Qing.) | Hao, Yiqun (Hao, Yiqun.) | Ou, Liwei (Ou, Liwei.) | Ma, Wei (Ma, Wei.)

Indexed by：

EI Scopus SCIE

Abstract：

Context:　Code　readability,　which　correlates　strongly　with　software　quality,　plays　a　critical　role　in　software　maintenance　and　evolvement.　Although　existing　deep　learning-based　code　readability　models　have　reached　a　rather　high　classification　accuracy,　only　structural　features　are　utilized　which　inevitably　limits　their　model　performance.　Objective:　To　address　this　problem,　we　propose　to　extract　readability-related　features　from　visual,　semantic,　and　structural　aspects　from　source　code　in　an　attempt　to　further　improve　code　readability　classification.　Method:　First,　we　convert　a　code　snippet　into　a　RGB　matrix　(for　visual　feature　extraction),　a　token　sequence　(for　semantic　feature　extraction)　and　a　character　matrix　(for　structural　feature　extraction).　Then,　we　input　them　into　a　hybrid　neural　network　that　is　composed　of　BERT,　CNN,　and　BiLSTM　for　feature　extraction.　Finally,　the　extracted　features　are　concatenated　and　input　into　a　classifier　to　make　a　code　readability　classification.　Result:　A　series　of　experiments　are　conducted　to　evaluate　our　method.　The　results　show　that　the　average　accuracy　could　reach　85.3%,　which　outperforms　all　existing　models.　Conclusion:　As　an　innovative　work　of　extracting　readability-related　features　automatically　from　visual,　semantic,　and　structural　aspects,　our　method　is　proved　to　be　effective　for　the　task　of　code　readability　classification.　(c)　2022　Elsevier　Inc.　All　rights　reserved.

Keyword：

Code readability classification Program comprehension Software analysis Neural networks Code representation

Author Community：

[ 1 ] [Mi, Qing]Beijing Univ Technol, Beijing, Peoples R China
[ 2 ] [Hao, Yiqun]Beijing Univ Technol, Beijing, Peoples R China
[ 3 ] [Ou, Liwei]Beijing Univ Technol, Beijing, Peoples R China
[ 4 ] [Ma, Wei]Beijing Univ Technol, Beijing, Peoples R China

Reprint Author's Address：

Email：

miqing@bjut.edu.cn |
yiqun.hao@ucdconnect.ie |
liwei.ou@ucdconnect.ie |
mawei@bjut.edu.cn

Show more details

Related Keywords：

A graph-based code representation method to improve code readability classification
2023，EMPIRICAL SOFTWARE ENGINEERING
Improving Multi-Class Code Readability Classification with An Enhanced Data Augmentation Approach (130)
2022，INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING
The effectiveness of data augmentation in code readability classification
2021，INFORMATION AND SOFTWARE TECHNOLOGY
DeepWukong: Statically Detecting Software Vulnerabilities Using Deep Graph Neural Network
2021，ACM Transactions on Software Engineering and Methodology

Source ：

JOURNAL OF SYSTEMS AND SOFTWARE

ISSN： 0164-1212

Year： 2022

Volume： 193

3 . 5

JCR@2022

3 . 5 0 0

JCR@2022

ESI Discipline： COMPUTER SCIENCE;

ESI HC Threshold：46

JCR Journal Grade：2

CAS Journal Grade：2

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 0

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to