Matrix-variate variational auto-encoder with applications to image process - Details

Author：

Indexed by：

EI Scopus SCIE

Abstract：

Variational　Auto-Encoder　(VAE)　is　an　important　probabilistic　technology　to　model　1D　vectorial　data.　However,　when　applying　VAE　model　to　2D　image,　vectorization　is　necessary.　Vectorization　process　may　lead　to　dimension　curse　and　lose　valuable　spatial　information.　To　avoid　these　problems,　we　propose　a　novel　VAE　model　based　on　matrix　variables　named　as　Matrix-variate　Variational　Auto-Encoder　(MVVAE).　In　this　model,　input,　hidden　and　latent　variables　are　all　in　matrix　form,　therefore　inherent　spatial　structure　of　2D　images　can　be　maintained　and　utilized　better.　Especially,　the　latent　variable　is　assumed　to　follow　matrix　Gaussian　distribution　which　is　more　suitable　for　describing　2D　images.　To　solve　the　weights　and　the　posterior　of　latent　variable,　the　variational　inference　process　is　given.　The　experiments　are　designed　for　three　real-world　application:　reconstruction,　denoising　and　completion.　The　experimental　results　demonstrate　that　MVVAE　shows　better　performance　than　VAE　and　other　probabilistic　methods　for　modeling　and　processing　2D　data.　(C)　2020　Elsevier　Inc.　All　rights　reserved.

Keyword：

Image denoising Face completion Variational inference Variational autoencoder Matrix Gaussian distribution

Author Community：

[ 1 ] [Li, Jinghua]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, Peoples R China
[ 2 ] [Yan, Huixia]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, Peoples R China
[ 3 ] [Kong, Dehui]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, Peoples R China
[ 4 ] [Wang, Lichun]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, Peoples R China
[ 5 ] [Wang, Shaofan]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, Peoples R China
[ 6 ] [Yin, Baocai]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, Peoples R China
[ 7 ] [Gao, Junbin]Univ Sydney, Univ Sydney Business School, Discipline Business Analyt, Sydney, NSW 2006, Australia
[ 8 ] [Yin, Baocai]Dalian Univ Technol, Fac Elect Informat & Elect Engn, Dalian, Peoples R China

Reprint Author's Address：

[Li, Jinghua]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, Peoples R China

Email：

lijinghua@bjut.edu.cn

Show more details

Related Keywords：

Nonlocal total variation regularization with Shape Adaptive Patches for image denoising via Split Bregman method
2021，Journal of Physics:Conference Series
Low-rank matrix recovery with total generalized variation for defending adversarial examples
2024，Frontiers of Information Technology and Electronic Engineering
Enhancing the Spatial Resolution of Galaxy Images Using an Advanced Conditional Denoising Diffusion Probabilistic Model
2024，2024 International Conference on New Trends in Computational Intelligence, NTCI 2024
Zero-shot industrial image denoising with lightweight network
2024，2024 International Workshop on Automation, Control, and Communication Engineering, IWACCE 2024

Source ：

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION

ISSN： 1047-3203

Year： 2020

Volume： 67

2 . 6 0 0

JCR@2022

ESI Discipline： COMPUTER SCIENCE;

ESI HC Threshold：132

Cited Count：

WoS CC Cited Count： 3

SCOPUS Cited Count： 8

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 3

Affiliated Colleges：

信息科学技术学院本学院/部未明确归属的数据

建筑与城市规划学院本学院/部未明确归属的数据

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to