Del píxel a las resonancias visuales: la imagen con voz propia

Rosado Rodrigo, Pilar; Figueras Ferrer, Eva; Reverter Comes, Ferran

Del píxel a las resonancias visuales: la imagen con voz propia

The objective of our research is to develop a series of computer vision programs to search for analogies in large datasets¿in this case, collections of images of abstract paintings¿ based solely on their visual content without textual annotation. We have programmed an algorithm based on a specific m...

Descripción completa

Detalles Bibliográficos
Autores:	Rosado Rodrigo, Pilar, Figueras Ferrer, Eva, Reverter Comes, Ferran
Tipo de recurso:	artículo
Estado:	Versión publicada
Fecha de publicación:	2016
País:	España
Institución:	Varias* (Consorci de Biblioteques Universitáries de Catalunya, Centre de Serveis Científics i Acadèmics de Catalunya)
Repositorio:	Recercat. Dipósit de la Recerca de Catalunya
OAI Identifier:	oai:recercat.cat:2445/100830
Acceso en línea:	https://hdl.handle.net/2445/100830
Access Level:	acceso abierto
Palabra clave:	Visió per ordinador Processament digital d'imatges Probabilitats Art abstracte Computer vision Digital image processing Probabilities Abstract art

Descripción
Sumario:	The objective of our research is to develop a series of computer vision programs to search for analogies in large datasets¿in this case, collections of images of abstract paintings¿ based solely on their visual content without textual annotation. We have programmed an algorithm based on a specific model of image description used in computer vision. This approach involves placing a regular grid over the image and selecting a pixel region around each node. Dense features computed over this regular grid with overlapping patches are used to represent the images. Analysing the distances between the whole set of image descriptors we are able to group them according to their similarity and each resulting group will determines what we call 'visual words'. This model is called Bag-of-Words representation Given the frequency with which each visual word occurs in each image, we apply the method pLSA (Probabilistic Latent Semantic Analysis), a statistical model that classifies fully automatically, without any textual annotation, images according to their formal patterns. In

Del píxel a las resonancias visuales: la imagen con voz propia

Similares en LA Referencia