Abstract
Selecting an optimal set of icons is a crucial step in the pipeline of visual design to structure and navigate through content. However, designing the icons sets is usually a difficult task for which expert knowledge is required. In this work, to ease the process of icon set selection to the users, we propose a similarity metric which captures the properties of style and visual identity. We train a Siamese Neural Network with an on-line dataset of icons organized in visually coherent collections that are used to adaptively sample training data and optimize the training process. As the dataset contains noise, we further collect human-rated information on the perception of icon’s similarity which will be used for evaluating and testing the proposed model. We present several results and applications based on searches, kernel visualizations and optimized set proposals that can be helpful for designers and non-expert users while exploring large collections of icons.
Similar content being viewed by others
References
Airey D (2016) Logo design love a guide to creating iconic brand identities. Peachpit Press, Berkeley
Bai X, Yang X, Latecki LJ, Liu W, Tu Z (2010) Learning context-sensitive shape similarity by graph transduction. IEEE Trans Pattern Anal Mach Intell 32(5):861–874
Barnard M (2013) Graphic design as communication. Routledge, Abingdon
Bates CL, Cragun BJ, Day PR (2002) Automatic icon generation, US Patent 6,456,307
Bell S, Bala K (2015) Learning visual similarity for product design with convolutional neural networks. ACM Trans Graph (Proc SIGGRAPH) 34:4
Belongie S, Malik J, Puzicha J (2002) Shape matching and object recognition using shape contexts. IEEE Trans Pattern Anal Mach Intell 24(4):509–522
Bernstein GL, Li W (2015) Lillicon: using transient widgets to create scale variations of icons. ACM Trans Graph (TOG) 34(4):144
Bober M (2001) Mpeg-7 visual shape descriptors. IEEE Trans Circ Syst Video Technol 11(6):716–719
Bromley J, Guyon I, Lecun Y, Säckinger E, Shah R (1994) Signature verification using a ”siamese” time delay neural network. In: NIPS Proceedings
Crandall DJ, Huttenlocher DP (2006) Weakly supervised learning of part-based spatial models for visual object recognition. In: Leonardis A, Bischof H, Pinz A (eds) Computer Vision - ECCV 2006, pp 16–29
Demiralp Ç, Bernstein MS, Heer J (2014) Learning perceptual kernels for visualization design. IEEE Trans Vis Comput Graph 20(12):1933–1942
Doulamis A, Doulamis N (2004) Generalized nonlinear relevance feedback for interactive content-based retrieval and organization. IEEE Trans Circ Syst Video Technol 14(5):656–671
El-Naqa I, Yang Y, Galatsanos NP, Nishikawa RM, Wernick MN (2004) A similarity learning approach to content-based image retrieval: application to digital mammography. IEEE Trans Med Imaging 23(10):1233–1244
Farnung-Laursen L, Koyama Y, Chen H-T, Garces E, Gutierrez D, Harper R, Igarashi T (2016) Icon Set Selection via Human Computation. In: Pacific Graphics Short Papers
Fried O, Avidan S, Cohen-Or D (2017) Patch2vec: Globally consistent image patch representation. Pac Graph 36:7
Garces E, Agarwala A, Gutierrez D, Hertzmann A (2014) A similarity measure for illustration style. ACM Trans Graph (Proc SIGGRAPH) 33:4
Garces E, Agarwala A, Hertzmann A, Gutierrez D (2017) Style-based Exploration of Illustration Datasets. Multimed Tools Appl 76:11
Glorot X, Bengio Y (2010) Understanding the difficulty of training deep feedforward neural networks. In: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, vol 9 of Proceedings of Machine Learning Research, PMLR, pp 249–256
Goodfellow I, Bengio Y, Courville A (2016) Deep Learning. MIT Press, Cambridge. http://www.deeplearningbook.org
Gramazio CC, Laidlaw DH, Schloss KB (2017) Colorgorical Creating discriminable and preferable color palettes for information visualization. IEEE Trans Vis Comput Graph 23(1):521–530
Horton WK (1994) The icon book: Visual symbols for computer systems and documentation. Wiley, NY
Hu M-K (1962) Visual pattern recognition by moment invariants. IRE Trans Inf Theory 8(2):179–187
Ioffe S, Szegedy C (2015) Batch normalization Accelerating deep network training by reducing internal covariate shift. CoRR arXiv:1502.03167
Khotanzad A, Hong YH (1990) Invariant image recognition by zernike moments. IEEE Trans Pattern Anal Mach Intell 12(5):489–497
Kingma DP, Ba J (2014) Adam: A method for stochastic optimization. CoRR arXiv:1412.6980
Kleiman Y, van Kaick O, Sorkine-Hornung O, Cohen-Or D (2015) Shed: shape edit distance for fine-grained shape similarity. ACM Trans Graph (TOG) 34(6):235
Kwan KC, Sinn LT, Han C, Wong T-T, Fu C-W (2016) Pyramid of arclength descriptor for generating collage of shapes. ACM Trans Graph (Proc SIGGRAPH Asia) 35(6):1–12
Latecki LJ, Lakamper R, Eckhardt T (2000) Shape descriptors for non-rigid shapes with a single closed contour. In: IEEE Conference on Computer Vision and Pattern Recognition. IEEE, vol 1, pp 424–429
LeCun Y, Boser B, Denker JS, Henderson D, Howard RE, Hubbard W, Jackel LD (1989) Backpropagation applied to handwritten zip code recognition. Neural Comput 1(4):541–551
Lewis JP, Rosenholtz R, Fong N, Neumann U (2004) VisualIDs Automatic Distinctive Icons for Desktop Interfaces. ACM Trans Graph (Proc SIGGRAPH) 1(212):416–423
Liu T, Hertzmann A, Li W, Funkhouser T (2015) Style compatibility for 3D furniture models. ACM Trans Graph (Proc SIGGRAPH) 34(4):1–9
Liu Y, Agarwala A, Lu J, Rusinkiewicz S (2016) Data-driven iconification. In: Proceedings of Computational Aesthetics and Sketch Based Interfaces and Modeling and Non-Photorealistic Animation and Rendering. Eurographics Association, pp 113–124
Lun Z, Kalogerakis E, Sheffer A (2015) Elements of style learning perceptual shape style similarity. ACM Trans Graph (TOG) 34 4:84:1–14
Lupton E (2004) Thinking with type Critical Guide for Designers, Writers, Editors & Students
Lupton E, Phillips JC (2015) Graphic design: The New Basics: Revised and Expanded. Chronicle Books
Mohler DS, Vick JH (2015) Screen icon manipulation by context and frequency of use, US Patent 8,972,878
O’Donovan P, Agarwala A, Hertzmann A (2011) Color Compatibility From Large Datasets. ACM Trans Graph 30:4
O’Donovan P, Lı̄beks J, Agarwala A, Hertzmann A (2014) Exploratory font selection using crowdsourced attributes. ACM Trans Graph (TOG) 33(4):92
Osada R, Funkhouser T, Chazelle B, Dobkin D (2002) Shape distributions. ACM Trans Graph (TOG) 21(4):807–832
Parkhi OM, Vedaldi A, Zisserman A (2015) Deep face recognition. In: Proceedings of the British Machine Vision Conference (BMVC)
Rubinstein M, Gutierrez D, Sorkine O, Shamir A (2010) A comparative study of image retargeting. ACM Trans Graph (Proc SIGGRAPH Asia) 29 6:160:1–160:10
Saleh B, Dontcheva M, Hertzmann A, Liu Z (2015) Learning style similarity for searching infographics. In: Proceedings of the 41st graphics interface conference. GI ’15. Canadian Information Processing Society, pp 59–64
Schroff F, Kalenichenko D, Philbin J (2015) FaceNet: A unified embedding for face recognition and clustering. In: IEEE Conference on Computer Vision and Pattern Recognition, vol 07-12-June, pp 815–823
Setlur V, Mackinlay JD (2014) Automatic generation of semantic icon encodings for visualizations. Inproceedings of the SIGCHI Conference on Human Factors in Computing Systems, CHI’14, pp 541–550
Setlur V, Albrecht-Buehler C, Gooch AA, Rossoff S, Gooch B (2005) Semanticons Visual metaphors as file icons. Comput Graph Forum (Proc Eurographics) 24(3):647–656
Shugrina M, Lu J, Diverdi S (2017) Playful palette: an interactive parametric color mixer for artists. ACM Trans Graph (TOG) 36(4):61
Sikora T (2001) The mpeg-7 visual standard for content description-an overview. IEEE Trans Circ Syst Video Technol 11(6):696–702
Simo-Serra E, Ishikawa H (2016) Fashion Style in 128 Floats: Joint Ranking and Classification using Weak Data for Feature Extraction. In: Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR)
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. CoRR arXiv:1409.1556
Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R (2014) Dropout a simple way to prevent neural networks from overfitting. J Mach Learn Res 15(1):1929–1958
Torresani L (2014) Weakly supervised learning. Springer, Boston, pp 883–885
Upchurch P, Snavely N, Bala K (2016) From A to Z: supervised transfer of style and content using deep neural network generators. CoRR arXiv:1603.02003
van der Maaten L, Hinton GE (2008) Visualizing high-dimensional data using t-sne. J Mach Learn Res 9:2579–2605
Voulodimos A, Doulamis N, Doulamis A, Protopapadakis E Deep learning for computer vision: A brief review, pp 1–13
Wu P, Hoi SC, Xia H, Zhao P, Wang D, Miao C (2013) Online multimodal deep similarity learning with application to image retrieval. In: Proceedings of the 21st ACM International Conference on Multimedia, pp 153–162
Xia H, Hoi SCH, Jin R, Zhao P (2014) Online multiple kernel similarity learning for visual search. IEEE Trans Pattern Anal Mach Intell 36(3):536–549
Yin W, Schütze H, Xiang B, Zhou B (2015) ABCNN: Attention-based convolutional neural network for modeling sentence pairs. CoRR arXiv:1512.05193
Zeiler MD, Fergus R (2014) Visualizing and understanding convolutional networks. In European conference on computer vision. Springer, pp 818–833
Zhang D, Lu G (2002) Shape-based image retrieval using generic fourier descriptor. Signal Process Image Commun 17(10):825–848
Acknowledgments
We want to thank the anonymous reviewers and Adrian Jarabo for their insightful comments on the manuscript. This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (CHAMELEON project, grant agreement No 682080).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Lagunas, M., Garces, E. & Gutierrez, D. Learning icons appearance similarity. Multimed Tools Appl 78, 10733–10751 (2019). https://doi.org/10.1007/s11042-018-6628-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-018-6628-7