Semantic Segmentation of Aerial Imagery: A Novel Approach Leveraging Hierarchical Multi-scale Features and Channel-based Attention for Drone Applications

Publish Year: 1403
نوع سند: مقاله ژورنالی
زبان: English
View: 32

This Paper With 14 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

JR_IJE-37-5_018

تاریخ نمایه سازی: 24 اسفند 1402

Abstract:

Drone semantic segmentation is a challenging task in computer vision, mainly due to inherent complexities associated with aerial imagery. This paper presents a comprehensive methodology for drone semantic segmentation and evaluates its performance using the ICG dataset. The proposed method leverages hierarchical multi-scale feature extraction and efficient channel-based attention Atrous Spatial Pyramid Pooling (ASPP) to address the unique challenges encountered in this domain. In this study, the performance of the proposed method is compared to several state-of-the-art models. The findings of this research highlight the effectiveness of the proposed method in tackling the challenges of drone semantic segmentation. The outcomes demonstrate its superiority over the state-of-the-art models, showcasing its potential for accurate and efficient segmentation of aerial imagery. The results contribute to the advancement of drone-based applications, such as surveillance, object tracking, and environmental monitoring, where precise semantic segmentation is crucial. The obtained experimental results demonstrate that the proposed method outperforms these existing approaches regarding Dice, mIOU, and accuracy metrics. Specifically, the proposed method achieves an impressive performance with Dice, mIOU, and accuracy scores of ۸۶.۵۱%, ۷۶.۲۳%, and ۹۱.۷۴%, respectively.

Keywords:

Semantic drone segmentation , Hierarchical Multi-Scale Feature Extraction , Efficient Channel-based Attention , Atrous Spatial Pyramid Pooling

Authors

E. Sahragard

Department of Electrical and Computer Engineering, University of Birjand, Birjand, Iran

H. Farsi

Department of Electrical and Computer Engineering, University of Birjand, Birjand, Iran

S. Mohamadzadeh

Department of Electrical and Computer Engineering, University of Birjand, Birjand, Iran

مراجع و منابع این Paper:

لیست زیر مراجع و منابع استفاده شده در این Paper را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود Paper لینک شده اند :
  • Bhatnagar S, Gill L, Ghosh B. Drone image segmentation using ...
  • Asgari Taghanaki S, Abhishek K, Cohen JP, Cohen-Adad J, Hamarneh ...
  • Chakravarthy AS, Sinha S, Narang P, Mandal M, Chamola V, ...
  • Habibi M, Hassanpour H. Splicing Image Forgery Detection and Localization ...
  • Zagoruyko S, Komodakis N. Wide residual networks. arXiv preprint arXiv:۱۶۰۵۰۷۱۴۶. ...
  • Kestur R, Farooq S, Abdal R, Mehraj E, Narasipura O, ...
  • Giang TL, Dang KB, Le QT, Nguyen VG, Tong SS, ...
  • Zhao H, Shi J, Qi X, Wang X, Jia J, ...
  • Shaw P, Uszkoreit J, Vaswani A. Self-attention with relative position ...
  • Wang Q, Wu B, Zhu P, Li P, Zuo W, ...
  • Prakash S, Shah P, Agrawal A. Exploiting CNNs for Semantic ...
  • Ronneberger O, Fischer P, Brox T, editors. U-net: Convolutional networks ...
  • Guo Z, Xu J, Liu A, editors. Remote sensing image ...
  • نمایش کامل مراجع