PHND: Pashtu handwritten numerals database and deep learning benchmark

In this paper we introduce a real Pashtu handwritten numerals dataset (PHND) having 50,000 scanned images and make publicly available for research and scientific use. Although more than fifty million people in the world use this language for written and oral communication, no significant efforts are...

Descripción completa

Detalles Bibliográficos
Autores: Khan, K. (Khalil)|||/items/037a32b4-31fc-4b17-8761-b1fcbe2f1ba6, Roh, B. (Byeong-hee)|||/items/5cd8bec8-24db-47ab-bb82-d7a07218d5db, Ali, J. (Jehad)|||/items/6b2afca8-1cfa-4c41-883c-4b960d2a5195, Ullah-Khan, R. (Rehan)|||/items/95061f14-b720-42e9-99ef-d8fc5f15ec76, Uddin, I. (Irfan)|||/items/24d89ccf-d434-4d92-a203-3f22cd141e8b, Hassan, S. (Saqlain)|||/items/ab209186-88af-4239-9691-b4e8df1cedf7, Riaz, R. (Rabia)|||/items/6d1bbe8b-786c-4dcf-8982-dbaf1c54f199, Ahmad, N. (Nasir)|||/items/314d81a3-92de-4963-9775-7f17842bf528
Tipo de recurso: artículo
Fecha de publicación:2020
País:España
Institución:Universidad de Navarra
Repositorio:Dadun. Depósito Académico Digital de la Universidad de Navarra
Idioma:inglés
OAI Identifier:oai:dadun.unav.edu:10171/67659
Acceso en línea:https://hdl.handle.net/10171/67659
Access Level:acceso abierto
Palabra clave:Pashtu handwritten numerals dataset (PHND)
Handwriting
Pashtu optical character recognition (POCR)
Algorithm
Deep learning
Recognition
Descripción
Sumario:In this paper we introduce a real Pashtu handwritten numerals dataset (PHND) having 50,000 scanned images and make publicly available for research and scientific use. Although more than fifty million people in the world use this language for written and oral communication, no significant efforts are devoted to the Pashtu Optical Character Recognition (POCR). We present a new approach for Pahstu handwritten numerals recognition (PHNR) based on deep neural networks. We train Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs) on high-frequency numerals for feature extraction and classification. We evaluated the performance of the proposed algorithm on the newly introduced Pashtu handwritten numerals database PHND and Bangla language number database CMATERDB 3.1.1. We obtained best recognition rate of 98.00% and 98.64% on PHND and CMATERDB 3.1.1. respectively.