TY - JOUR
AU - Ohbuchi, Ryutarou
AB - Unsupervised representation learning of unlabeled multimedia data is important yet challenging problem for their indexing, clustering, and retrieval. There have been many attempts to learn representation from a collection of unlabeled 2D images. In contrast, however, less attention has been paid to unsupervised representation learning for unordered sets of high-dimensional feature vectors, which are often used to describe multimedia data. One such example is set of local visual features to describe a 2D image. This paper proposes a novel algorithm called Feature Set Aggregator (FSA) for accurate and efficient comparison among sets of high-dimensional features. FSA learns representation, or embedding, of unordered feature sets via optimization using a combination of two training objectives, that are, set reconstruction and set embedding, carefully designed for set-to-set comparison. Experimental evaluation under three multimedia information retrieval scenarios using 3D shapes, 2D images, and text documents demonstrates efficacy as well as generality of the proposed algorithm.
TI - Feature set aggregator: unsupervised representation learning of sets for their comparison
JO - Multimedia Tools and Applications
DO - 10.1007/s11042-019-08078-y
DA - 2019-08-20
UR - https://www.deepdyve.com/lp/springer-journals/feature-set-aggregator-unsupervised-representation-learning-of-sets-8gpAd5i8kR
SP - 35157
EP - 35178
VL - 78
IS - 24
DP - DeepDyve
ER -