• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Zheng, MeiXia (Zheng, MeiXia.) | Jia, XiBin (Jia, XiBin.) (Scholars:贾熹滨)

Indexed by:

EI Scopus

Abstract:

The paper aims to establish a effective feature form of visual speech to realize the Chinese viseme recognition. We propose and discuss a representation model of the visual speech which bases on the local binary pattern (LBP) and the discrete cosine transform (DCT) of mouth images. The joint model combines the advantages of the local and global texture information together, which shows better performance than using the global feature only. By computing LBP and DCT of each mouth frame capturing during the subject speaking, the Hidden Markov Model (HMM) is trained based on the training dataset and is employed to recognize the new visual speech. The experiments show this visual speech feature model exhibits good performance in classifying the difference speaking states. © 2012 Springer-Verlag GmbH.

Keyword:

Speech recognition Hidden Markov models Speech Intelligent computing Discrete cosine transforms Textures

Author Community:

  • [ 1 ] [Zheng, MeiXia]Beijing University of Technology, Beijing, China
  • [ 2 ] [Jia, XiBin]Beijing University of Technology, Beijing, China
  • [ 3 ] [Jia, XiBin]Multimedia and Intelligent Software Technology, Beijing Municipal Key Laboratory, Beijing, China

Reprint Author's Address:

Show more details

Related Keywords:

Related Article:

Source :

ISSN: 1867-5662

Year: 2012

Volume: 137 AISC

Page: 101-107

Language: English

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count: 1

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 6

Online/Total:608/10701104
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.