Application of imputation methods for missing values of PM۱۰ and O۳ data: Interpolation, moving average and K-nearest neighbor methods
Publish Year: 1400
نوع سند: مقاله ژورنالی
زبان: English
View: 235
This Paper With 12 Page And PDF Format Ready To Download
- Certificate
- من نویسنده این مقاله هستم
استخراج به نرم افزارهای پژوهشی:
شناسه ملی سند علمی:
JR_EHEM-8-3_007
تاریخ نمایه سازی: 6 مهر 1400
Abstract:
Background: PIn air quality studies, it is very often to have missing data due to reasons such as machine failure or human error. The approach used in dealing with such missing data can affect the results of the analysis. The main aim of this study was to review the types of missing mechanism, imputation methods, application of some of them in imputation of missing of PM۱۰ and O۳ in Tabriz, and compare their efficiency.
Methods: Methods of mean, EM algorithm, regression, classification and regression tree, predictive mean matching (PMM), interpolation, moving average, and K-nearest neighbor (KNN) were used. PMM was investigated by considering the spatial and temporal dependencies in the model. Missing data were randomly simulated with ۱۰, ۲۰, and ۳۰% missing values. The efficiency of methods was compared using coefficient of determination (R۲), mean absolute error (MAE) and root mean square error (RMSE).
Results: Based on the results for all indicators, interpolation, moving average, and KNN had the best performance, respectively. PMM did not perform well with and without spatio-temporal information.
Conclusion: Given that the nature of pollution data always depends on next and previous information, methods that their computational nature is based on before and after information indicated better performance than others, so in the case of pollutant data, it is recommended to use these methods.
Keywords:
Authors
Parisa Saeipourdizaj
Department of Statistics and Epidemiology, Faculty of Health, Tabriz University of Medical Sciences, Tabriz, Iran
Parvin Sarbakhsh
Corresponding author: Health and Environment Research Center, Tabriz University of Medical Sciences, Department of Statistics and Epidemiology, Faculty of Health, Tabriz University of Medical Sciences, Tabriz, Iran
Akbar Gholampour
Health and Environment Research Center, Tabriz University of Medical Sciences, Department of Environmental Health Engineering, School of Public Health, Tabriz University of Medical Sciences, Tabriz, Iran
مراجع و منابع این Paper:
لیست زیر مراجع و منابع استفاده شده در این Paper را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود Paper لینک شده اند :