Evaluation of Selected Machine Learning Algorithms to Predict Longevity in Iranian Holstein Dairy Cattle

Publish Year: 1404
نوع سند: مقاله ژورنالی
زبان: English
View: 132

This Paper With 14 Page And PDF Format Ready To Download

  • Certificate
  • من نویسنده این مقاله هستم

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این Paper:

شناسه ملی سند علمی:

JR_IJCCE-44-10_012

تاریخ نمایه سازی: 20 مهر 1404

Abstract:

The increasing need for sustainable dairy production has emphasized the importance of improving longevity in dairy cattle, which directly impacts farm profitability and herd efficiency. Traditional statistical models often fall short in capturing the complex, nonlinear factors influencing this trait, highlighting the potential of Machine Learning (ML) approaches. This study followed a retrospective observational design within a predictive regression modeling framework. This study evaluated the predictive performance of five models:  Gradient Boosting Machine (GBM), Random Forest (RF), Decision Tree (DT), Support Vector Machine (SVM), and a conventional Linear Model (LM) to estimate the longevity of Iranian Holstein dairy cattle. The dataset comprised ۴۲۲,۷۵۱ records from ۱۲,۸۰۷ sires across ۵,۴۱۷ herds, including production, reproductive, and geographic data over ten years. Longevity was defined as the interval from first calving to removal from the herd, and models were trained and tested using ۱۰-fold cross-validation under a regression framework. Among the tested models, SVM achieved the highest predictive performance with a coefficient of determination (R²) of ۰.۹۸۹ and the lowest Root Mean Squared Error (RMSE) of ۲.۵۱. RF and GBM also performed well (R² = ۰.۹۸۳ and ۰.۹۷۹, respectively). The most influential predictors across models were the number of calvings, average calving interval, and first calving year, highlighting the importance of reproductive history. While ML models showed superior predictive power, they also presented challenges such as higher computational demands and potential overfitting if not properly validated. The findings suggest that advanced ML models, particularly SVM and RF, offer valuable tools for improving decision-making and culling strategies in dairy herd management. Future research should validate these models on independent datasets and explore their integration into practical selection and monitoring systems.

Keywords:

Machine Learning (ML) , Dairy Cow Longevity , Iranian Holstein , Support Vector Machine (SVM) , Random forest (RF) , Gradient Boosting Machine (GBM) , predictive modeling

Authors

Ali Rezazadehvishkaei

Department of Animal Science, Ab. C., Islamic Azad University, Abhar, I.R. IRAN

Alireza Hasanibafarani

Department of Animal Science, Agricultural Research, Education and Extension Organization (AREEO), Agricultural Institute of Education and Extension (IATE), Tehran, I.R. IRAN

Kian Pahlevanafshari

Department of Animal Science, Va.P. C., Islamic Azad University, Varamin, I.R. IRAN

Aboozari Mehran

Department of Animal Science, Ab. C., Islamic Azad University, Abhar, I.R. IRAN