Voronoi diagrams for page segmentation

Page segmentation is a key task in document processing, enabling effective extraction of structured information from diverse document types. This paper presents an in-depth analysis of the method proposed by Kise et al., a bottom-up approach using area Voronoi diagrams to identify spatial relationsh...

Descripción completa

Detalles Bibliográficos
Autores: Gardella, Marina, Ramírez Paulino, Ignacio
Tipo de recurso: artículo
Estado:Versión publicada
Fecha de publicación:2026
País:Uruguay
Institución:Universidad de la República
Repositorio:COLIBRI
Idioma:inglés
OAI Identifier:oai:colibri.udelar.edu.uy:20.500.12008/53742
Acceso en línea:http://www.ipol.im/pub/pre/591/
https://hdl.handle.net/20.500.12008/53742
https://doi.org/10.5201/ipol.2026.591
Access Level:acceso abierto
Palabra clave:Document layout analysis
Voronoi diagram
Descripción
Sumario:Page segmentation is a key task in document processing, enabling effective extraction of structured information from diverse document types. This paper presents an in-depth analysis of the method proposed by Kise et al., a bottom-up approach using area Voronoi diagrams to identify spatial relationships between document parts. Our work provides a detailed description of the method, emphasizing clarity, reproducibility, and transparency, particularly regarding aspects not fully specified in the original paper. We highlight the impact of the parameter settings and preprocessing steps on the method’s performance. Through extensive testing, we demonstrate that the method can handle a wide range of layouts but exhibits notable sensitivity to specific document characteristics, especially in handling complex elements like handwritten text, lists, drop-caps, and tables.