Indexed by:
Abstract:
Recently emerged RGB-D sensors provide great promise for indoor scene understanding, which is a fundamental and challenging problem in computer vision. We present a discriminative model in this paper to semantically label indoor scenes from RGB-D images Unlike previous work which only labels pre-determined superpixels, we characterize the scenes with a set of planes and compose them into objects. The optimal way to composition and corresponding labels are inferred simultaneously using a greedy algorithm. Our model considers unary features and pairwise and co-occurrence context, as well as latent variables that account for multi-mode distributions of each object category. We train the model with latent structural SVM learning framework. Our approach achieves state-of-the-art performance on the Cornell RGB-D indoor scene dataset [1].
Keyword:
Reprint Author's Address:
Source :
SIXTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2013)
ISSN: 0277-786X
Year: 2013
Volume: 9067
Language: English
Cited Count:
WoS CC Cited Count: 0
SCOPUS Cited Count:
ESI Highly Cited Papers on the List: 0 Unfold All
WanFang Cited Count:
Chinese Cited Count:
30 Days PV: 7
Affiliated Colleges: