Video Prediction Using Multi-Scale Deep Neural Networks

N. Shayanfar; V. Derhami; M. Rezaeian

Video Prediction Using Multi-Scale Deep Neural Networks

Publish place: Journal of Artificial Intelligence & Data Mining، Vol: 10، Issue: 3

Publish Year: 1401

Type: Journal paper

Language: English

This Paper With 10 Page And PDF Format Ready To Download

DOWNLOAD Paper

Certificate
I'm the author of the paper

این Paper در بخشهای موضوعی زیر دسته بندی شده است:

هوش مصنوعی > یادگیری عمیق

Export:

Link to this Paper:

https://civilica.com/doc/1525744

Document National Code:

JR_JADM-10-3_011

Index date: 1 October 2022

Video Prediction Using Multi-Scale Deep Neural Networks abstract

In video prediction it is expected to predict next frame of video by providing a sequence of input frames. Whereas numerous studies exist that tackle frame prediction, suitable performance is not still achieved and therefore the application is an open problem. In this article multiscale processing is studied for video prediction and a new network architecture for multiscale processing is presented. This architecture is in the broad family of autoencoders. It is comprised of an encoder and decoder. A pretrained VGG is used as an encoder that processes a pyramid of input frames at multiple scales simultaneously. The decoder is based on 3D convolutional neurons. The presented architecture is studied by using three different datasets with varying degree of difficulty. In addition, the proposed approach is compared to two conventional autoencoders. It is observed that by using the pretrained network and multiscale processing results in a performant approach.

Video Prediction Using Multi-Scale Deep Neural Networks Keywords:

deep learning , Convolutional autoencoder , Video prediction , multiscale processing

Video Prediction Using Multi-Scale Deep Neural Networks authors

N. Shayanfar

Computer engineering department, Yazd University, Yazd, Iran.

V. Derhami

Computer engineering department, Yazd University, Yazd, Iran.

M. Rezaeian

Computer engineering department, Yazd University, Yazd, Iran.

مراجع و منابع این Paper:

لیست زیر مراجع و منابع استفاده شده در این Paper را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود Paper لینک شده اند :

C. Zhang and J. Kim, “Modeling Long- and Short-Term Temporal ...

W. Liu, W. Luo, D. Lian, and S. Gao, “Future ...

X. Shi, Z. Chen, H. Wang, D.-Y. Yeung, W. Wong, ...

M. Mathieu, C. Couprie, and Y. LeCun, “Deep multi-scale video ...

W. Lotter, G. Kreiman, and D. Cox, “Unsupervised Learning of ...

W. Lotter, G. Kreiman, and D. Cox, “Deep predictive coding ...

S. Oprea et al., “A Review on Deep Learning Techniques ...

X. Jin et al., “Video Scene Parsing with Predictive Feature ...

J. Walker, K. Marino, A. Gupta, and M. Hebert, “The ...

M. Jamaseb Khollari, V. Derhami, and M. Yazdian Dehkordi, “Variational ...

C. Vondrick, H. Pirsiavash, and A. Torralba, “Generating Videos with ...

J. van Amersfoort, A. Kannan, M. Ranzato, A. Szlam, D. ...

L. A. Lim and H. Yalim Keles, “Foreground segmentation using ...

N. Srivastava, E. Mansimov, and R. Salakhutdinov, “Unsupervised Learning of ...

N. Mayer et al., “A Large Dataset to Train Convolutional ...

M. Menze and A. Geiger, “Object scene flow for autonomous ...

O. Ronneberger, P. Fischer, and T. Brox, “U-Net: Convolutional Networks ...

M. Sabokrou, M. Fathy, Z. Moayed, and R. Klette, “Fast ...

K. Simonyan and A. Zisserman, “Very deep convolutional networks for ...

C. Szegedy et al., “Going Deeper with Convolutions,” in Proceedings ...

Y. Wang, Z. Gao, M. Long, J. Wang, and P. ...

Y. Wang, M. Long, J. Wang, Z. Gao, and P. ...

J. Zhang, Y. Wang, M. Long, W. Jianmin, and P. ...

R. Mahjourian, M. Wicke, and A. Angelova, “Geometry-Based Next Frame ...

Z. Wang, A. C. Bovik, H. R. Sheikh, and E. ...

V. Patraucean, A. Handa, and R. Cipolla, “Spatio-temporal video autoencoder ...

T. Wang et al., “MSU-Net: Multiscale Statistical U-Net for Real-Time ...

نمایش کامل مراجع