Medida de similitud basada en saliencia

Domínguez, Sergio

doi:10.1016/j.riai.2012.09.003

Información del artículo

Resumen

Texto completo

Bibliografía

Descargar PDF

Estadísticas

Resumen

La proliferación en todos los ámbitos de la producción multimedia está dando lugar a la aparición de nuevos paradigmas de recuperación de información visual. Dentro de éstos, uno de los más significativos es el de los sistemas de recuperación de información visual, VIRS (Visual Information Retrieval Systems), en los que una de las tareas más representativas es la ordenación de una población de imágenes según su similitud con un ejemplo dado. En este trabajo se presenta una propuesta original para la evaluación de la similitud entre dos imágenes, basándose en la extensión del concepto de saliencia desde el espacio de imágenes al de características para establecer la relevancia de cada componente de dicho vector. Para ello se introducen metodologías para la cuantificación de la saliencia de valores individuales de características, para la combinación de estas cuantificaciones en procesos de comparación entre dos imágenes, y para, finalmente, establecer la mencionada ponderación de cada característica en atención a esta combinación. Se presentan igualmente los resultados de evaluar esta propuesta en una tarea de recuperación de imágenes por contenido en comparación con los obtenidos con la distancia euclídea. Esta comparación se realiza mediante la evaluación de ambos resultados por voluntarios.

Palabras clave:

Bases de datos de imágenes

recuperación basada en contenido

medidas de similitud

modelos perceptuales

análisis de imágenes

Abstract

The ubiquitous growth of multimedia production is causing the creation of new visual information retrieval paradigms. One of the most relevant among them is that represented by Visual Information Retrieval Systems (VIRS), where a common task is ordering a set images according to their similarity to a given one. In this work a new proposal for evaluating similarity between two images is introduced; both images are represented by respective feature vectors, and the perceptual cue used to generate the similarity measure is saliency, a concept thoroughly known in Psychology. New methodologies for quantifying saliency of feature values, for combining them during a comparison process and, eventually, to weight that feature attending to the result of the combination, are introduced as well. The results for the evaluation of this similarity measure in an image based content retrieval task are presented, as well as their comparison with those obtained using euclidean distance in the same task. Both are validated by volunteers who labelled the retrieved sets.

Keywords:

Image databases

content based retrieval

similarity measures

perceptual models

image analysis

Texto completo

Referencias no citadas

Ashby and Perrin, 1988, Chen and Xie, 2011, Eidenberger, 2006, Fairhall et al., 6849, Fisher, 2011, Itti and Baldi, 2009, Itti and Koch, 2001, Itti et al., 1998, Kim et al., 2000, Kim and Kim, 1998, Koch and Ullman, 1985, Larkey and Markman, 2005, Rao and Ballard, 1999, Santini and Jain, 1999, Shepard, 1962a, Shepard, 1962b, Teh and Chin, 1988, Treue, 2003, Tsotsos et al., 1995, Tversky, 1977, Tversky and Gati, 1982 and Urdan, 2005.

Referencias

[Ashby and Perrin, 1988]

F. Ashby, N. Perrin.

Toward a unified theory of similarity and recognition.

Psychological Review, 95 (1988), pp. 124-150

[Chen and Xie, 2011]

G. Chen, W. Xie.

Wavelet-based moment invariants for pattern recognition.

Optical Engineering, 50 (2011),

[Eidenberger, 2006]

H. Eidenberger.

Evaluation and analysis of similarity measures for con- tent based visual information retrieval.

ACM Multimedia Systems Journal, 12 (2006), pp. 71-87

[Fairhall et al., 6849]

Fairhall, A.L., Lewen, G.D., Bialek, W., de Ryiter van Steveninck, R.R., August 2001. Efficiency and ambiguity in an adaptive neural code. Nature 412 (6849), 787-792.

[Fisher, 2011]

Fisher, R., 2011. URL: homepages.inf.ed.ac.uk/rbf/CVonline/Imagedbase.htm.

[Itti and Baldi, 2009]

L. Itti, P. Baldi.

Byesian surprise attracts human attention.

Vision Re- search, 49 (2009), pp. 1295-1306

[Itti and Koch, 2001]

Itti, L., Koch, C., March 2001. Computational modelling of visual attention. Nature Reviews Neuroscience 2, 194-203.

[Itti et al., 1998]

Itti, L., Koch, C., Niebur, E., November 1998. A model of saliency-based visual attention for rapid scene analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence 20 (11), 1254-1259.

[Kim et al., 2000]

H.-K. Kim, J.-D. Kim, D.-G. Sim, D.-I. Oh.

A modified zernike moment shape descriptor invariant to translation, rotation and scale for similarity-based image retrieval.

IEEE International Conference on Multi- media, 1 (2000), pp. 307-310

[Kim and Kim, 1998]

Y. Kim, W. Kim.

Content-based trademark retrieval using a visually salient feature.

Image and Vision Computing, 16 (1998), pp. 931-939

[Koch and Ullman, 1985]

C. Koch, S. Ullman.

Shifts in selective visual attention: Towards the underlying neural circuitry.

Human Neurobiology, 4 (1985), pp. 219-227

Medline

[Larkey and Markman, 2005]

L.B. Larkey, A.B. Markman.

Processes of similarity judgement.

Cog- nitive Science, 29 (2005), pp. 1061-1076

[Rao and Ballard, 1999]

Rao, R.P. N., Ballard, D.H., January 1999. Predictive coding in the visual cortex: A functional interpretation of some extra-classical receptive-field effects. Nature Neuroscience 2 (1), 79-87.

[Santini and Jain, 1999]

Santini, S., Jain, R., September 1999. Similarity measures. IEEE Transactions on Pattern Analysis and Machine Intelligence 21 (9), 871-883.

[Shepard, 1962a]

Shepard, R.N., June 1962a. The analysis of proximities: Multidimensional sca- ling with an unknown distance function. i. Psycometrika 27 (2), 125-140.

[Shepard, 1962b]

Shepard, R.N., September 1962b. The analysis of proximities: Multidimen-sional scaling with an unknown distance function.ii. Psycometrika 27 (3), 219-246.

[Teh and Chin, 1988]

Teh, C.-H., Chin, R.T., !’July 1988. On image analysis by the methods of moments. IEEE Transactions on Pattern Analysis and Machine Intelligen- ce 10 (4), 496-512.

[Treue, 2003]

S. Treue.

Visual attention: the where, what, how and why of saliency.

Current Opinion in Neurobiology, 13 (2003), pp. 428-432

Medline

[Tsotsos et al., 1995]

J.K. Tsotsos, S.M. Culhane, W.Y.K. Wai, Y. Lai, N. Davis, F. Nuflo.

Modeling visual attention via selective tuning.

Artificial Intelligence, 78 (1995), pp. 507-545

[Tversky, 1977]

Tversky, A., July 1977. Features of similarity. Psychological Review 84 (4), 327-352.

[Tversky and Gati, 1982]