Introducing a database for Farsi document image understanding and segmentation

Faraji, Amin; Saeed, Masoud; Nezamabadi-pour, Hossein

Introducing a database for Farsi document image understanding and segmentation

Document Type : Research Paper

Authors

¹ Master Student of Computer Engineering at Shahid Bahonar University of Kerman

² Dept. of Engineering, Shahid Bahonar University of Kerman

Abstract

Document images segmentation is one of the recent activities that have attracted researchers' attention. Unfortunately, there is no report on a benchmark dataset for Farsi document images understanding and segmentations applications that be available in the web. In the current article, a benchmark image dataset for the sake of the Farsi document images segmentation is presented, which includes 5598 images. The provided images are taken from the newspapers, textbooks and academic articles. Objects in the images are categorized and labeled into six different groups to be used easily in the subsequent applications. The object groups used in the dataset are paragraph(text), figure, table, logo, mathematical equation and header. To asset the effectiveness of the proposed document image dataset, three existing well-known methods based on deep learning are implemented on it and the results are presented.

Keywords

20.1001.1.23831197.1402.10.2.3.3

Journal of Machine Vision and Image Processing

Article View: 244
PDF Download: 268

Introducing a database for Farsi document image understanding and segmentation

Volume 10, Issue 2
July 2023
Pages 31-46

Files

Share

How to cite

Statistics

Introducing a database for Farsi document image understanding and segmentation

Volume 10, Issue 2July 2023Pages 31-46

Files

Share

How to cite

Statistics

Volume 10, Issue 2
July 2023
Pages 31-46