• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Mou, L. (Mou, L..) | Zhou, C. (Zhou, C..) | Xie, P. (Xie, P..) | Zhao, P. (Zhao, P..) | Jain, R. (Jain, R..) | Gao, W. (Gao, W..) | Yin, B. (Yin, B..)

Indexed by:

EI Scopus SCIE

Abstract:

Driverdrowsiness is an important cause of traffic accidents. Many studies using computer vision techniques to detect driver drowsiness states, such as slow blinking, yawning, and nodding, have demonstrated excellent potential. Although existing studies have made significant progress, the number of samples in the training corpora is small, which makes it difficult for a model to learn effective drowsiness representations from images or videos. To address this issue, we develop an isotropic self-supervised learning (IsoSSL) approach to learn powerful representations of images without relying on human-provided annotations and propose an IsoSSL-MoCo model by combining IsoSSL with momentum contrast (MoCo). To exploit the complementarity of multimodal data, an attention-based multimodal fusion model is also proposed to fuse features from the eye, mouth, and optical flow of the head. Specifically, we first use the IsoSSL-MoCo model to pretrain the image encoders for the three modalities in other datasets. Then, these encoders are fine-tuned and integrated into the proposed fusion model. The feature vectors generated by the image encoders of the three modalities are fed into the recursive layer to extract temporal information. To capture the importance degrees of the effects of temporal features from the three modalities on drowsiness detection, an attention mechanism is introduced to automatically weigh the feature vectors from the recursive layer to improve detection accuracy. Finally, a vector representation is generated by the attention layer and is used to detect driver drowsiness states. Experimental results based on two challenging datasets show that our method outperforms the baseline methods and the latest existing methods. © 1999-2012 IEEE.

Keyword:

driver drowsiness detection isotropic self-supervised learning (IsoSSL) momentum contrast (MoCo) multimodal fusion model Attention

Author Community:

  • [ 1 ] [Mou L.]Beijing University of Technology, Beijing Institute of Artificial Intelligence, Faculty of Information Technology, Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing, 100124, China
  • [ 2 ] [Zhou C.]Beijing University of Technology, Beijing Institute of Artificial Intelligence, Faculty of Information Technology, Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing, 100124, China
  • [ 3 ] [Xie P.]University of California, San Diego, 92093, CA, United States
  • [ 4 ] [Zhao P.]Beijing University of Technology, Beijing Institute of Artificial Intelligence, Faculty of Information Technology, Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing, 100124, China
  • [ 5 ] [Jain R.]University of California, Bren School of Information and Computer Sciences, Institute for Future Health, Irvine, 92697, CA, United States
  • [ 6 ] [Gao W.]Peking University, Institute of Digital Media, Beijing, 100871, China
  • [ 7 ] [Gao W.]Peking University Shenzhen Graduate School, School of Electronic and Computer Engineering, Shenzhen, 518055, China
  • [ 8 ] [Yin B.]Beijing University of Technology, Beijing Institute of Artificial Intelligence, Faculty of Information Technology, Beijing Key Laboratory of Multimedia and Intelligent Software Technology, Beijing, 100124, China

Reprint Author's Address:

Email:

Show more details

Related Keywords:

Source :

IEEE Transactions on Multimedia

ISSN: 1520-9210

Year: 2023

Volume: 25

Page: 529-542

7 . 3 0 0

JCR@2022

ESI Discipline: COMPUTER SCIENCE;

ESI HC Threshold:19

Cited Count:

WoS CC Cited Count: 0

SCOPUS Cited Count: 27

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 10

Affiliated Colleges:

Online/Total:1044/10531216
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.