Risk Classification of Imbalanced Data for Car Insurance Companies: Machine Learning Approaches

This paper presents a mechanism for insurance companies to assess the most effective features to classify the risk of their customers for third party liability (TPL) car insurance. Basically, the process of underwriting is carried out based on the expert experiences and the industry suffers from lack of a systematic method to categorize their policyholders with respect to the risk level. We analyzed 13,388 observations of an insurance claim dataset from body injury reports provided by an Iranian insurance company. The main challenge is the imbalanced dataset. Here we employ logistic regression and random forest with different resampling of the original data in order to increase the performance of models. Results indicate that the random forest with the hybrid resampling methods is the best classifier and furthermore, victim age, premium, car age and insured age are the most important factors for claims prediction.

Risk Classification of Imbalanced Data for Car Insurance Companies: Machine Learning Approaches Keywords:

Machine Learning , supervised Learning , Imbalanced Data , Claim Risk , Classification

Risk Classification of Imbalanced Data for Car Insurance Companies: Machine Learning Approaches authors

Farzan Khamesian

Insurance Research Center, Tehran, Iran

Maryam Esna-Ashari

Insurance Research Center, Tehran, Iran

Eric Dei Ofosu-Hene

Department of Accounting and Finance, Faculty of Business and Law, De Montfort University, Leicester, UK

Farbod Khanizadeh

Insurance Research Center, Tehran, Iran

Certificate
I'm the author of the paper

این Paper در بخشهای موضوعی زیر دسته بندی شده است:

هوش مصنوعی > یادگیری ماشین

Export:

Link to this Paper:

https://civilica.com/doc/1628665

Document National Code:

JR_IJMAC-12-3_001

Index date: 11 April 2023

How to cite:

If you want to refer to this Paper in your research work, you can simply use the following phrase in the references section:

Khamesian, Farzan and Esna-Ashari, Maryam and Dei Ofosu-Hene, Eric and Khanizadeh, Farbod,1401,Risk Classification of Imbalanced Data for Car Insurance Companies: Machine Learning Approaches,https://civilica.com/doc/1628665

در داخل متن نیز هر جا که به عبارت و یا دستاوردی از این Paper اشاره شود پس از ذکر مطلب، در داخل پارانتز، مشخصات زیر نوشته می شود.
برای بار اول: (1401, Khamesian, Farzan؛ Maryam Esna-Ashari and Eric Dei Ofosu-Hene and Farbod Khanizadeh)
برای بار دوم به بعد: (1401, Khamesian؛ Esna-Ashari and Dei Ofosu-Hene and Khanizadeh)
برای آشنایی کامل با نحوه مرجع نویسی لطفا بخش راهنمای سیویلیکا (مرجع دهی) را ملاحظه نمایید.

Scientometrics

The specifications of the publisher center of this Paper are as follows:

Ranking of Insurance Research Center

Type of center: پژوهشگاه دولتی

Paper count: 271

In the scientometrics section of CIVILICA, you can see the scientific ranking of the Iranian academic and research centers based on the statistics of indexed articles.