CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

Predictive Analysis for Optimal Text Visibility: A Comprehensive Study on Frame-of-Interest Prediction in Book Digitization Videos

عنوان مقاله: Predictive Analysis for Optimal Text Visibility: A Comprehensive Study on Frame-of-Interest Prediction in Book Digitization Videos
شناسه ملی مقاله: JR_IJE-37-11_011
منتشر شده در در سال 1403
مشخصات نویسندگان مقاله:

G. Buddhawar - Sardar Vallabhbhai National Institute of Technology, Surat, India
D. Dave - Pimpri Chinchwad College of Engineering, Pune, India
K. N. Jariwala - Sardar Vallabhbhai National Institute of Technology, Surat, India
C. Chattopadhyay - School Computing and Data Sciences, FLAME University, Pune, India

خلاصه مقاله:
This research paper addresses an important challenge in book digitization, i.e., accurately predicting frames where text visibility is optimal. Existing models often suffer from high computational complexity, resulting in inefficiencies in automation and accuracy. In contrast, our proposed models offer a solution with lower complexity and higher accuracy. Leveraging a diverse dataset of book flipping videos, we introduce three novel models: the Regular CNN LeNet-۵ Model, the Custom LSTM Model, and the ۳D CNN Model. Evaluation reveals that our ۳D CNN Model achieves an accuracy score of ۹۹.۰۱%, with ۳۷۷,۹۲۱ parameters. These models demonstrate a significant increase in efficiency in terms of accuracy metric  with significantly less number of parametrers. Thereby the proposed approach enhances the process of identifying frames of interest. Our findings highlight the transformative potential of these models in streamlining book digitization workflows and improving accessibility to digitized textual content. This study contributes valuable insights at the intersection of computer vision, machine learning, and digitization efforts, offering a promising avenue for enhancing the usability of digitized textual resources.

کلمات کلیدی:
Book Flipping Videos, Frame of Interest, Book Digitization, predictive analysis

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/2027356/