Predicting Credit Risk in Peer-to-Peer Lending: A Machine Learning Approach with Few Features

Yu Chieh Cheng, Hui Ting Chang, Chia Yu Lin, Heng Yu Chang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

Peer-to-peer (P2P) lending provides borrowers with relatively low borrowing interest rates and gives lenders a channel for investment on an online platform. Since most P2P lending does not require any guarantees, the overdue payment of borrowers results in a massive loss of lending platforms and lenders. Many risk prediction models are proposed to predict credit risk. However, these works build models with more than 50 features, which causes a lot of computation time. Besides, in most P2P lending datasets, the number of non-default data far exceeds the number of default data. These researches ignore the data imbalance issue, leading to inaccurate predictions. Therefore, this study proposes a credit risk prediction system (CRPS) for P2P lending to solve data imbalance issues and only require few features to build the models. We implement a data preprocessing module, a feature selection module, a data synthesis module, and five risk prediction models in CRPS. In experiments, we evaluate CRPS based on the de-identified personal loan dataset of the LendingClub platform. The accuracy of the CRPS can achieve 99%, the recall reaches 0.95, and the F1-Score is 0.97. CRPS can accurately predict credit risk with less than 10 features and tackle data imbalance issues.

Original languageEnglish
Title of host publicationProceedings - 2021 International Conference on Technologies and Applications of Artificial Intelligence, TAAI 2021
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages295-300
Number of pages6
ISBN (Electronic)9781665408257
DOIs
StatePublished - 2021
Event26th International Conference on Technologies and Applications of Artificial Intelligence, TAAI 2021 - Taichung, Taiwan
Duration: 18 11 202120 11 2021

Publication series

NameProceedings - 2021 International Conference on Technologies and Applications of Artificial Intelligence, TAAI 2021

Conference

Conference26th International Conference on Technologies and Applications of Artificial Intelligence, TAAI 2021
Country/TerritoryTaiwan
CityTaichung
Period18/11/2120/11/21

Bibliographical note

Publisher Copyright:
© 2021 IEEE.

Keywords

  • Peer-to-peer lending
  • RFECV
  • XGBoost
  • borderline-SMOTE
  • credit risk prediction
  • data synthesis

Fingerprint

Dive into the research topics of 'Predicting Credit Risk in Peer-to-Peer Lending: A Machine Learning Approach with Few Features'. Together they form a unique fingerprint.

Cite this