Print Email Facebook Twitter Unified Binary Generative Adversarial Network for Image Retrieval and Compression Title Unified Binary Generative Adversarial Network for Image Retrieval and Compression Author Song, Jingkuan (University of Electronic Science and Technology of China) He, Tao (Monash University) Gao, Lianli (University of Electronic Science and Technology of China) Xu, Xing (University of Electronic Science and Technology of China) Hanjalic, A. (TU Delft Intelligent Systems) Shen, Heng Tao (University of Electronic Science and Technology of China) Department Intelligent Systems Date 2020 Abstract Binary codes have often been deployed to facilitate large-scale retrieval tasks, but not that often for image compression. In this paper, we propose a unified framework, BGAN+, that restricts the input noise variable of generative adversarial networks to be binary and conditioned on the features of each input image, and simultaneously learns two binary representations per image: one for image retrieval and the other serving as image compression. Compared to related methods that attempt to learn a single binary code serving both purposes, we demonstrate that choosing for two codes leads to more effective representations due to less concessions needed when balancing the requirements. The added value of using a unified framework compared to two separate frameworks lies in the synergy in data representation that is beneficial for both learning processes. When devising this framework, we also address another challenge in learning binary codes, namely that of learning supervision. While the most striking successes in image retrieval using binary codes have mostly involved discriminative models requiring labels, the proposed BGAN+ framework learns the binary codes in an unsupervised fashion, yet more effectively than the state-of-the-art supervised approaches. The proposed BGAN+ framework is evaluated on three benchmark datasets for image retrieval and two datasets on image compression. The experimental results show that BGAN+ outperforms the existing retrieval methods with significant margins and achieves promising performance for image compression, especially for low bit rates. Subject Binary codesGenerative adversarial networkImage compressionImage retrieval To reference this document use: http://resolver.tudelft.nl/uuid:e95be336-afd4-447f-8455-4e55b1af134a DOI https://doi.org/10.1007/s11263-020-01305-2 Embargo date 2021-02-18 ISSN 0920-5691 Source International Journal of Computer Vision, 128 (8-9), 2243-2264 Bibliographical note Accepted author manuscript Part of collection Institutional Repository Document type journal article Rights © 2020 Jingkuan Song, Tao He, Lianli Gao, Xing Xu, A. Hanjalic, Heng Tao Shen Files PDF ijcv2020_binary.pdf 1.46 MB Close viewer /islandora/object/uuid:e95be336-afd4-447f-8455-4e55b1af134a/datastream/OBJ/view