• norsk
    • English
  • English 
    • norsk
    • English
  • Login
View Item 
  •   Home
  • Faculty of Social Sciences
  • Department of Economics
  • Master theses
  • View Item
  •   Home
  • Faculty of Social Sciences
  • Department of Economics
  • Master theses
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Machine learning vs logistic regression in credit scoring: A trade-off between accuracy and interpretability?

Hovdenakk, Arne Hesjedal
Master thesis
Thumbnail
View/Open
master thesis (974.1Kb)
URI
https://hdl.handle.net/11250/2762661
Date
2021-06-15
Metadata
Show full item record
Collections
  • Master theses [58]
Abstract
In this thesis, I compare logistic regression to the machine learning models k-nearest neighbor, decision trees, random forest, and gradient booster by creating different credit models. By using data from an anonymous Norwegian bank for consumer loan borrowers, I compare the models when continuous variables are split into intervals by using weight of evidence, and when they are kept in their raw form. By using Area under Receiver Operating Characteristic (AUROC) and Brier score as performance measures, I find that logistic regression and gradient booster are the most accurate models for this dataset, and logistic regression is recommended because of its interpretability.
Publisher
The University of Bergen
Copyright
Copyright the Author. All rights reserved

Contact Us | Send Feedback

Privacy policy
DSpace software copyright © 2002-2019  DuraSpace

Service from  Unit
 

 

Browse

ArchiveCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsDocument TypesJournalsThis CollectionBy Issue DateAuthorsTitlesSubjectsDocument TypesJournals

My Account

Login

Statistics

View Usage Statistics

Contact Us | Send Feedback

Privacy policy
DSpace software copyright © 2002-2019  DuraSpace

Service from  Unit