Persian Printed Document Analysis and Page Segmentation
Publish place: Journal of Computer and Robotics، Vol: 1، Issue: 1
Publish Year: 1386
نوع سند: مقاله ژورنالی
زبان: English
View: 530
This Paper With 16 Page And PDF Format Ready To Download
- Certificate
- من نویسنده این مقاله هستم
استخراج به نرم افزارهای پژوهشی:
شناسه ملی سند علمی:
JR_JCR-1-1_006
تاریخ نمایه سازی: 23 دی 1396
Abstract:
This paper presents, a hybrid method, low-resolution and high-resolution, for Persian page segmentation. In the low-resolution page segmentation, a pyramidal image structure is constructed for multiscale analysis and segments document image to a set of regions. By high-resolution page segmentation, by connected components analysis, each region is segmented to homogeneous regions and identifying them as texts, images, and tables/drawings. The proposed method was experiment with the Persian documents. The result of these tests have shown that the proposed method provide more accurate and speed results.
Keywords:
Page segmentation , pyramidal image structure , connected components , horizontal and vertical merging
Authors
Ali Broumandnia
Department of Computer & IT, Islamic Azad University-South Tehran Branch, Tehran, Iran
Jamshid Shanbehzadeh
Department of Computer, Tarbiat Moalem University, Iran