Analyzing the Inference Process in Deep Convolutional Neural Networks using Principal Eigenfeatures, Saturation and Logistic Regression Probes

Mats Leon Richter; Leila Malihi; Anne-Kathrin Patricia Windler; Ulf Krumnack

Analyzing the Inference Process in Deep Convolutional Neural Networks using Principal Eigenfeatures, Saturation and Logistic Regression Probes

Publish place: Journal of Applied Research in Electrical Engineering، Vol: 2، Issue: 1

Publish Year: 1402

نوع سند: مقاله ژورنالی

زبان: English

This Paper With 10 Page And PDF Format Ready To Download

دریافت فایل کامل Paper

Certificate
من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

https://civilica.com/doc/1862766

شناسه ملی سند علمی:

JR_JAREE-2-1_001

تاریخ نمایه سازی: 3 دی 1402

Abstract:

The predictive performance of a neural network depends on the one hand on the difficulty of a problem, defined by the number of classes and complexity of the visual domain, and on the other hand on the capacity of the model, determined by the number of parameters and its structure. By applying layer saturation and logistic regression probes, we confirm that these factors influence the inference process in an antagonistic manner. This analysis allows the detection of over- and under-parameterization of convolutional neural networks. We show that the observed effects are independent of previously reported pathological patterns, like the “tail pattern”. In addition, we study the emergence of saturation patterns during training, showing that saturation patterns emerge early in the optimization process. This allows for quick detection of problems and potentially decreased cycle time during experiments. We also demonstrate that the emergence of tail patterns is independent of the capacity of the networks. Finally, we show that information processing within a tail of unproductive layers is different, depending on the topology of the neural network architecture.

Keywords:

convolutional neural networks , logistic regression probes , saturation , eigenfeatures , tail pattern

Authors

Mats Leon Richter

Institute of Cognitive Science, University of Osnabrück, Osnabrück, Germany

Leila Malihi

Institute of Cognitive Science, University of Osnabrück, Osnabrück, Germany

Anne-Kathrin Patricia Windler

Institute of Cognitive Science, University of Osnabrück, Osnabrück, Germany

Ulf Krumnack

Institute of Cognitive Science, University of Osnabrück, Osnabrück, Germany

مراجع و منابع این Paper:

لیست زیر مراجع و منابع استفاده شده در این Paper را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود Paper لینک شده اند :

M. Tan et al., “MnasNet: Platform-aware neural architecture search for ...
I. Garg, P. Panda, and K. Roy, "A Low Effort ...
W. Ahmed, S. Ansari, M. Hanif, A. Khalil, “PCA driven ...
I. Chakraborty, D. Roy, I. Garg Constructing, A. Ankit, and ...
G. Alain, and Y. Bengio, “Understanding intermediate layers using linear ...
M. L. Richter, W. Byttner, U. Krumnack, L. Schallner, and ...
M. L. Richter, L. Malihi, A.-K. P. Windler, and U. ...
J. Shenk, M. L. Richter, A. Arpteg, and M. Huss, ...
J. Shenk, M. L. Richter, W. Byttner, M. Marcinkiewicz “Delve: ...
D. P. Kingma, and J. Ba, “Adam: A method for ...
L. Bossard, M. Guillaumin, and L. Van Gool, “Food-۱۰۱ – ...
A. Krizhevsky, “Learning multiple layers of features from tiny images,” ...
Y. LeCun, and C. Cortes, “MNIST hand written digit database,”, ...

نمایش کامل مراجع