Predictive Analysis for Optimal Text Visibility: A Comprehensive Study on Frame-of-Interest Prediction in Book Digitization Videos

Publish Year: 1403
نوع سند: مقاله ژورنالی
زبان: English
View: 6

This Paper With 12 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

JR_IJE-37-11_011

تاریخ نمایه سازی: 23 تیر 1403

Abstract:

This research paper addresses an important challenge in book digitization, i.e., accurately predicting frames where text visibility is optimal. Existing models often suffer from high computational complexity, resulting in inefficiencies in automation and accuracy. In contrast, our proposed models offer a solution with lower complexity and higher accuracy. Leveraging a diverse dataset of book flipping videos, we introduce three novel models: the Regular CNN LeNet-۵ Model, the Custom LSTM Model, and the ۳D CNN Model. Evaluation reveals that our ۳D CNN Model achieves an accuracy score of ۹۹.۰۱%, with ۳۷۷,۹۲۱ parameters. These models demonstrate a significant increase in efficiency in terms of accuracy metric  with significantly less number of parametrers. Thereby the proposed approach enhances the process of identifying frames of interest. Our findings highlight the transformative potential of these models in streamlining book digitization workflows and improving accessibility to digitized textual content. This study contributes valuable insights at the intersection of computer vision, machine learning, and digitization efforts, offering a promising avenue for enhancing the usability of digitized textual resources.

Authors

G. Buddhawar

Sardar Vallabhbhai National Institute of Technology, Surat, India

D. Dave

Pimpri Chinchwad College of Engineering, Pune, India

K. N. Jariwala

Sardar Vallabhbhai National Institute of Technology, Surat, India

C. Chattopadhyay

School Computing and Data Sciences, FLAME University, Pune, India

مراجع و منابع این Paper:

لیست زیر مراجع و منابع استفاده شده در این Paper را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود Paper لینک شده اند :
  • Savadi Hosseini M, Ghaderi F. A hybrid deep learning architecture ...
  • Firouzian I, Firouzian N, Hashemi S, Kozegar E. Pain facial ...
  • Panagou S, Neumann WP, Fruggiero F. A scoping review of ...
  • Caesarendra W, Pandiyan V, Umar MM, Pamungkas DS, Sulowicz M, ...
  • Saeed W, Omlin C. Explainable AI (XAI): A systematic meta-survey ...
  • Nagaraj A, Reimers I. Digitization and the market for physical ...
  • Bond M, Bedenlier S, Marín VI, Händel M. Emergency remote ...
  • Solayman S, Aumi SA, Mery CS, Mubassir M, Khan R. ...
  • Uchiyama T, Sogi N, Niinuma K, Fukui K, editors. Visually ...
  • Øvrelid E, Bygstad B, Ludvigsen S, Dæhlen M. Dual digitalization: ...
  • Das A, Rad P. Opportunities and challenges in explainable artificial ...
  • Stall S, Cervone G, Coward C, Cutcher-Gershenfeld J, Donaldson TJ, ...
  • Kalyanathaya KP. A literature review and research agenda on explainable ...
  • Brini I, Mehri M, Ingold R, Essoukri Ben Amara N, ...
  • Naseria J, Hasanpour H, Sorkhib AG. Accelerating Legislation Processes through ...
  • Yilmaz F, Tsamados M, Osborn D. Digitizing Ottoman daily weather ...
  • Kaneko H, Ishibashi R, Meng L. Deteriorated characters restoration for ...
  • Kim G, Kim BC. Classification of functional types of lines ...
  • Bryan-Kinns N, Ford C, Chamberlain A, Benford SD, Kennedy H, ...
  • Haque AB, Islam AN, Mikalef P. Explainable Artificial Intelligence (XAI) ...
  • Eberle O, Büttner J, El-Hajj H, Montavon G, Müller K-R, ...
  • Shedthi B S, Shetty V, Chadaga R, Bhat R, Bangera ...
  • Im C, Kim Y, Mandl T. Deep learning for historical ...
  • Hassanpour H, AlyanNezhadi M, Mohammadi M. A signal processing method ...
  • Al-Najjar HA, Pradhan B, Beydoun G, Sarkar R, Park H-J, ...
  • Correia S, Luck S. Digitizing historical balance sheet data: A ...
  • Spoorthy G, Sanjeevi S. Multi-criteria–recommendations using autoencoder and deep neural ...
  • Hao Y, Wang S, Cao P, Gao X, Xu T, ...
  • Patil J. Smart vision of iot technology and digitization in ...
  • نمایش کامل مراجع