A Survey on Visual Speech Recognition Classification Algorithms -Implementation Possibilities for Mobile Platforms

Publish Year: 1393
نوع سند: مقاله کنفرانسی
زبان: English
View: 544

This Paper With 7 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

CITCONF02_442

تاریخ نمایه سازی: 19 اردیبهشت 1395

Abstract:

Today modern methods of communication with mobile phones are highly regarded. One way to communicate with mobile devices is using visual information from the user's lips. People who suffer from speech disability or individuals associated with the breathing problem who have lacked long-term ability to speak are unable to talk with their phone. Camera phone can provide the ability to track user’s lip motion using lip reading algorithms to recognize the words and sentences. However, one of the challenges when implementing these algorithms for mobile phone is the limited resources such as memory and CPU. In this paper, after examining the constraints and challenges in the implementation of algorithms in mobile phones, a review on lip reading classification algorithms has been done and its suitability is discussed for implementation on a mobile phone.Among classification algorithm, Support Vector Machine and Hidden Markov Model are more suitable of others.

Keywords:

Authors

Fatemeh Sadat Lesani

PhD Candidate, University of Qom, Qom, Iran

Faranak Fotouhi Ghazvini

Department of Computer Engineering and Information Technology, University of Qom

Rouhollah Dianat

Department of Computer Engineering and Information Technology, University of Qom

مراجع و منابع این Paper:

لیست زیر مراجع و منابع استفاده شده در این Paper را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود Paper لینک شده اند :
  • National Conference on Applied Research in Computer Science and Information ...
  • Yu, D., (2008), The Application of Manifold based Visual Speech ...
  • Silveira, L.G.D., J. Facon, and DL. Borges, (2003), Visual speech ...
  • Hong, X.P., et al. (2006), A PCA Based Visual DCT ...
  • Potaminanos, G. H.P. Graf, and E. Cosatto. (1998), An image ...
  • K.Yu, X. J, and H. B. (2001), Sentence lip-reading using ...
  • Chan, M.T. (2001), HMM-Based Audio-Visuat Speech Recognition Integrating Geometric- and ...
  • Potamianos, G., et al. (2003), Recent Advances in the automatic ...
  • Werda, S., W. Mahdi, and A.B. Hamadou. (2007), Lip Localization ...
  • Pandzic, I.S. and R. Forchheimer (2002), MPEG-4 Facial Animation - ...
  • Wang, Z.M., L.H. Cai, and H.Z. Ai (2002), A dynamic ...
  • Foo, S.W. and Y. Lia. (2004), Recognition of visual speech ...
  • Dong, L. S.W. Foo, and Y. Lian. (2005), A two-channel ...
  • Gordan, M., C. Kotropoulos, and I. Pitas. (202), Application of ...
  • Freund, Y. and R. Schapire. (1999), A short introduction to ...
  • Rabiner, L.R. (1989), A tutorial On Hidden Markov Models and ...
  • Foo, S.W. and L. Dong. (203), A boosted multi-HMM classifier ...
  • Yau, W., K., Kant, and A.S. Poosapadi .(207), Visual recognition ...
  • نمایش کامل مراجع