Predictive Analysis for Optimal Text Visibility: A Comprehensive Study on Frame-of-Interest Prediction in Book Digitization Videos
Publish Year: 1403
نوع سند: مقاله ژورنالی
زبان: English
View: 6
This Paper With 12 Page And PDF Format Ready To Download
- Certificate
- من نویسنده این مقاله هستم
استخراج به نرم افزارهای پژوهشی:
شناسه ملی سند علمی:
JR_IJE-37-11_011
تاریخ نمایه سازی: 23 تیر 1403
Abstract:
This research paper addresses an important challenge in book digitization, i.e., accurately predicting frames where text visibility is optimal. Existing models often suffer from high computational complexity, resulting in inefficiencies in automation and accuracy. In contrast, our proposed models offer a solution with lower complexity and higher accuracy. Leveraging a diverse dataset of book flipping videos, we introduce three novel models: the Regular CNN LeNet-۵ Model, the Custom LSTM Model, and the ۳D CNN Model. Evaluation reveals that our ۳D CNN Model achieves an accuracy score of ۹۹.۰۱%, with ۳۷۷,۹۲۱ parameters. These models demonstrate a significant increase in efficiency in terms of accuracy metric with significantly less number of parametrers. Thereby the proposed approach enhances the process of identifying frames of interest. Our findings highlight the transformative potential of these models in streamlining book digitization workflows and improving accessibility to digitized textual content. This study contributes valuable insights at the intersection of computer vision, machine learning, and digitization efforts, offering a promising avenue for enhancing the usability of digitized textual resources.
Keywords:
Authors
G. Buddhawar
Sardar Vallabhbhai National Institute of Technology, Surat, India
D. Dave
Pimpri Chinchwad College of Engineering, Pune, India
K. N. Jariwala
Sardar Vallabhbhai National Institute of Technology, Surat, India
C. Chattopadhyay
School Computing and Data Sciences, FLAME University, Pune, India
مراجع و منابع این Paper:
لیست زیر مراجع و منابع استفاده شده در این Paper را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود Paper لینک شده اند :