Multi-Task Learning Using Uncertainty for Realtime Multi-Person Pose Estimation

Publish Year: 1403
نوع سند: مقاله ژورنالی
زبان: English
View: 28

This Paper With 16 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

JR_JECEI-12-1_010

تاریخ نمایه سازی: 5 دی 1402

Abstract:

kground and Obejctives: Multi-task learning is a widespread mechanism to improve the learning of multiple objectives with a shared representation in one deep neural network. In multi-task learning, it is critical to determine how to combine the tasks loss functions. The straightforward way is to optimize the weighted linear sum of multiple objectives with equal weights. Despite some studies that have attempted to solve the realtime multi-person pose estimation problem from a ۲D image, major challenges still remain unresolved. Methods: The prevailing solutions are two-stream, learning two tasks simultaneously. They intrinsically use a multi-task learning approach for predicting the confidence maps of body parts and the part affinity fields to associate the parts to each other. They optimize the average of the two tasks loss functions, while the two tasks have different levels of difficulty and uncertainty. In this work, we overcome this problem by applying a multi-task objective that captures task-based uncertainties without any additional parameters. Since the estimated poses can be more certain, the proposed method is called “CertainPose”. Results: Experiments are carried out on the COCO keypoints data sets. The results show that capturing the task-dependent uncertainty makes the training procedure faster and causes some improvements in human pose estimation. Conclusion: The highlight advantage of our method is improving the realtime multi-person pose estimation without increasing computational complexity.

Keywords:

Realtime Multi-Person Pose Estimation , Multi-Task Learning , Loss Function , Task-Dependent Uncertainty

Authors

Z. Ghasemi-Naraghi

Artificial Intelligence and Robotics Department, Faculty of Computer Engineering and Information Technology, Amirkabir University of Technology, Tehran , Iran.

A. Nickabadi

Artificial Intelligence and Robotics Department, Faculty of Computer Engineering and Information Technology, Amirkabir University of Technology, Tehran , Iran.

R. Safabakhsh

Artificial Intelligence and Robotics Department, Faculty of Computer Engineering and Information Technology, Amirkabir University of Technology, Tehran , Iran.

مراجع و منابع این Paper:

لیست زیر مراجع و منابع استفاده شده در این Paper را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود Paper لینک شده اند :
  • Y. Yang, D. Ramanan, “Articulated pose estimation with flexible mixtures-of-parts,” ...
  • P. F. Felzenszwalb, D. P. Huttenlocher, “Pictorial structures for object ...
  • X. Chen, A. L. Yuille, “Articulated pose estimation by a ...
  • K. He, X. Zhang, S. Ren, J. Sun, “Deep residual ...
  • S. Ren, K. He, R. Girshick, J. Sun, “Faster r-cnn: ...
  • G. Papandreou, T. Zhu, N. Kanazawa, A. Toshev, J. Tompson, ...
  • K. He, G. Gkioxari, P. Dollár, R. Girshick, “Mask r-cnn,” ...
  • S. E. Wei, V. Ramakrishna, T. Kanade, Y. Sheikh, ” ...
  • H. S. Fang, S. Xie, Y. W. Tai, C. Lu, ...
  • L. Ladicky, P. H. Torr, A. Zisserman, “Human pose estimation ...
  • U. Iqbal, J. Gall, “Multi-person pose estimation with local joint-to-person ...
  • L. Pishchulin, E. Insafutdinov, S. Tang, B. Andres, M. Andriluka, ...
  • E. Insafutdinov, L. Pishchulin, B. Andres, M. Andriluka, B. Schiele, ...
  • Z. Cao, T. Simon, S. E. Wei, Y. Sheikh, “Realtime ...
  • X. Zhu, Y. Jiang, Z. Luo, “Multi-person pose estimation for ...
  • D. Osokin, “Real-time ۲d multi-person pose estimation on cpu: Lightweight ...
  • Z. Cao, G. Hidalgo, T. Simon, S. E. Wei, Y. ...
  • OpenPose library. https://github.com/CMU-Perceptual-Computing-Lab/openpose ...
  • H. Liu, D. Luo, S. Du, T. Ikenaga, “Resolution irrelevant ...
  • G. H. Martınez, “OpenPose: Whole-Body Pose Estimation,” April, ۲۰۱۹ ...
  • T. Gong, T. Lee, C. Stephenson, V. Renduchintala, S. Padhy, ...
  • Q. Dang, J. Yin, B. Wang, W. Zheng, “Deep learning ...
  • C. Zheng, W. Wu, T. Yang, S. Zhu, C. Chen, ...
  • T. L. Munea, Y. Z. Jembre, H. T. Weldegebriel, L. ...
  • W. Gong, X. Zhang, J. Gonzàlez, A. Sobral, T. Bouwmans, ...
  • G. Rogez, C. Schmid, “Mocap-guided data augmentation for ۳d pose ...
  • H. Jiang, “Finding human poses in videos using concurrent matching ...
  • H. Sidenbladh, F. De la Torre, M. J. Black, “A ...
  • J. J. Tompson, A. Jain, Y. LeCun, C. Bregler, “Joint ...
  • V. Ramakrishna, D. Munoz, M. Hebert, J. A. Bagnell, Y. ...
  • MSCOCO Dataset, https://cocodataset.org/home ...
  • T. Simon, H. Joo, I. Matthews, Y. Sheikh, “Hand keypoint ...
  • S. Kreiss, L. Bertoni, A. Alahi, “Pifpaf: Composite fields for ...
  • N. Nakano, T. Sakura, K. Ueda, L. Omura, A. Kimura, ...
  • N. D. Reddy, L. Guigues, L. Pishchulin, J. Eledath, S. ...
  • N. D. Reddy, M. Vo, S. G. Narasimhan, “Occlusion-net: ۲d/۳d ...
  • Y. Cheng, B. Wang, B. Yang, R. T. Tan, “Monocular ...
  • H. Tu, C. Wang, W. Zeng, “Voxelpose: Towards multi-camera ۳d ...
  • G. Zhang, J. Liu, H. Li, Y. Q. Chen, L. ...
  • M. Schwarz, H. Schulz, S. Behnke, “Rgb-d object recognition and ...
  • A. Krull, E. Brachmann, F. Michel, M. Y. Yang, S. ...
  • Y. Gal, :Uncertainty in deep learning,” University of Cambridge ۱(۳), ...
  • A. Kendall, Y. Gal, “What uncertainties do we need in ...
  • F. K. Gustafsson, M. Danelljan, T. B. Schon, “Evaluating scalable ...
  • I. Misra, A. Shrivastava, A. Gupta, M. Hebert, “Cross-stitch networks ...
  • Z. Chen, V. Badrinarayanan, C. Y. Lee, A. Rabinovich, “Gradnorm: ...
  • A. Kendall, Y. Gal, R. Cipolla, “Multi-task learning using uncertainty ...
  • T. Y. Lin, M. Maire, S. Belongie, J. Hays, P. ...
  • M. Ruggero Ronchi, P. Perona, “Benchmarking and error diagnosis in ...
  • A. Newell, Z. Huang, J. Deng, “Associative embedding: End-to-end learning ...
  • M. Kocabas, S. Karagoz, E. Akbas, “Multiposenet: Fast multi-person pose ...
  • نمایش کامل مراجع