Document image binarization by using texture-edge descriptor
عنوان مقاله: Document image binarization by using texture-edge descriptor
شناسه ملی مقاله: CSICC14_056
منتشر شده در چهاردهمین کنفرانس بین المللی سالانه انجمن کامپیوتر ایران در سال 1388
شناسه ملی مقاله: CSICC14_056
منتشر شده در چهاردهمین کنفرانس بین المللی سالانه انجمن کامپیوتر ایران در سال 1388
مشخصات نویسندگان مقاله:
N Armanfard - Dept of Electrical Engineering, Tarbiat Modarres University
M Valizadeh - Dept of Electrical Engineering, Tarbiat Modarres University
M Komeili - Dept of Electrical Engineering, Tarbiat Modarres University
E Kabir
خلاصه مقاله:
N Armanfard - Dept of Electrical Engineering, Tarbiat Modarres University
M Valizadeh - Dept of Electrical Engineering, Tarbiat Modarres University
M Komeili - Dept of Electrical Engineering, Tarbiat Modarres University
E Kabir
In this paper we propose a new approach for text region extraction in camera-captured document images. Texture-Edge Descriptor, TED, is utilized for text region extraction. TED is an 8-bit binary number which its bits are structural. This structural bits and special text region characteristics in document images make TED an appropriate descriptor for text region extraction. Applying well-known water flow method to the text regions extracted by TED, results in fast and good quality document image binarization. Experimental results demonstrate the effectiveness of our method for text region extraction and document image binarization.
کلمات کلیدی: binarization, document image, textureedge descriptor, water flow
صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/73022/