A Survey on Visual Speech Recognition Classification Algorithms -Implementation Possibilities for Mobile Platforms

Fatemeh Sadat Lesani; Faranak Fotouhi Ghazvini; Rouhollah Dianat

A Survey on Visual Speech Recognition Classification Algorithms -Implementation Possibilities for Mobile Platforms

Publish place: The Second National Conference on Applied Research in Computer Science and Information Technology

Publish Year: 1393

نوع سند: مقاله کنفرانسی

زبان: English

This Paper With 7 Page And PDF Format Ready To Download

دریافت فایل کامل Paper

Certificate
من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

https://civilica.com/doc/455312

شناسه ملی سند علمی:

CITCONF02_442

تاریخ نمایه سازی: 19 اردیبهشت 1395

Abstract:

Today modern methods of communication with mobile phones are highly regarded. One way to communicate with mobile devices is using visual information from the user's lips. People who suffer from speech disability or individuals associated with the breathing problem who have lacked long-term ability to speak are unable to talk with their phone. Camera phone can provide the ability to track user’s lip motion using lip reading algorithms to recognize the words and sentences. However, one of the challenges when implementing these algorithms for mobile phone is the limited resources such as memory and CPU. In this paper, after examining the constraints and challenges in the implementation of algorithms in mobile phones, a review on lip reading classification algorithms has been done and its suitability is discussed for implementation on a mobile phone.Among classification algorithm, Support Vector Machine and Hidden Markov Model are more suitable of others.

Keywords:

Visual Speech Recognition , Classification , Mobile Phones , Lip Reading , Human Computer Interaction (HCI)

Authors

Fatemeh Sadat Lesani

PhD Candidate, University of Qom, Qom, Iran

Faranak Fotouhi Ghazvini

Department of Computer Engineering and Information Technology, University of Qom

Rouhollah Dianat

Department of Computer Engineering and Information Technology, University of Qom

مراجع و منابع این Paper:

لیست زیر مراجع و منابع استفاده شده در این Paper را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود Paper لینک شده اند :

National Conference on Applied Research in Computer Science and Information ...
Yu, D., (2008), The Application of Manifold based Visual Speech ...
Silveira, L.G.D., J. Facon, and DL. Borges, (2003), Visual speech ...
Hong, X.P., et al. (2006), A PCA Based Visual DCT ...
Potaminanos, G. H.P. Graf, and E. Cosatto. (1998), An image ...
K.Yu, X. J, and H. B. (2001), Sentence lip-reading using ...
Chan, M.T. (2001), HMM-Based Audio-Visuat Speech Recognition Integrating Geometric- and ...
Potamianos, G., et al. (2003), Recent Advances in the automatic ...
Werda, S., W. Mahdi, and A.B. Hamadou. (2007), Lip Localization ...
Pandzic, I.S. and R. Forchheimer (2002), MPEG-4 Facial Animation - ...
Wang, Z.M., L.H. Cai, and H.Z. Ai (2002), A dynamic ...
Foo, S.W. and Y. Lia. (2004), Recognition of visual speech ...
Dong, L. S.W. Foo, and Y. Lian. (2005), A two-channel ...
Gordan, M., C. Kotropoulos, and I. Pitas. (202), Application of ...
Freund, Y. and R. Schapire. (1999), A short introduction to ...
Rabiner, L.R. (1989), A tutorial On Hidden Markov Models and ...
Foo, S.W. and L. Dong. (203), A boosted multi-HMM classifier ...
Yau, W., K., Kant, and A.S. Poosapadi .(207), Visual recognition ...

نمایش کامل مراجع