Language:

ARDIS: a Swedish historical handwritten digit dataset

Neural computing & applications, 2020-11, Vol.32 (21), p.16505-16518 [Peer Reviewed Journal]

The Author(s) 2019 ;ISSN: 0941-0643 ;ISSN: 1433-3058 ;EISSN: 1433-3058 ;DOI: 10.1007/s00521-019-04163-3

Full text available

Citations Cited by

Actions
1. Add to My Research
2. Remove from My Research
3. E-mail
4. Print
5. Permalink
6. Citation
7. EasyBib
8. EndNote
9. RefWorks
10. Delicious
11. Export RIS
12. Export BibTeX

Title:
ARDIS: a Swedish historical handwritten digit dataset
Author: Kusetogullari, Huseyin ; Yavariabdi, Amir ; Cheddad, Abbas ; Grahn, Håkan ; Hall, Johan
Subjects: ARDIS dataset ; Artificial Intelligence ; Benchmark ; Computational Biology/Bioinformatics ; Computational Science and Engineering ; Computer Science ; Data Mining and Knowledge Discovery ; Handwritten digit recognition ; IAPR-MedPRAI ; Image Processing and Computer Vision ; Machine learning methods ; Probability and Statistics in Computer Science
Is Part Of: Neural computing & applications, 2020-11, Vol.32 (21), p.16505-16518
Description: This paper introduces a new image-based handwritten historical digit dataset named Arkiv Digital Sweden (ARDIS). The images in ARDIS dataset are extracted from 15,000 Swedish church records which were written by different priests with various handwriting styles in the nineteenth and twentieth centuries. The constructed dataset consists of three single-digit datasets and one-digit string dataset. The digit string dataset includes 10,000 samples in red–green–blue color space, whereas the other datasets contain 7600 single-digit images in different color spaces. An extensive analysis of machine learning methods on several digit datasets is carried out. Additionally, correlation between ARDIS and existing digit datasets Modified National Institute of Standards and Technology (MNIST) and US Postal Service (USPS) is investigated. Experimental results show that machine learning algorithms, including deep learning methods, provide low recognition accuracy as they face difficulties when trained on existing datasets and tested on ARDIS dataset. Accordingly, convolutional neural network trained on MNIST and USPS and tested on ARDIS provide the highest accuracies 58.80 % and 35.44 % , respectively. Consequently, the results reveal that machine learning methods trained on existing datasets can have difficulties to recognize digits effectively on our dataset which proves that ARDIS dataset has unique characteristics. This dataset is publicly available for the research community to further advance handwritten digit recognition algorithms.
Publisher: London: Springer London
Language: English
Identifier: ISSN: 0941-0643
ISSN: 1433-3058
EISSN: 1433-3058
DOI: 10.1007/s00521-019-04163-3
Source: Springer Open Access Journals
SWEPUB Freely available online
ProQuest Central

Back to results list


INSPIRE LIBRARY - TON DUC THANG UNIVERSITY	(84-028) 37 755 057	Feedback
19 Nguyen Huu Tho St. Dist.7, HCM	thuvien@tdtu.edu.vn	Feedback

ARDIS: a Swedish historical handwritten digit dataset

The Author(s) 2019 ;ISSN: 0941-0643 ;ISSN: 1433-3058 ;EISSN: 1433-3058 ;DOI: 10.1007/s00521-019-04163-3

Searching Remote Databases, Please Wait