Scoring Models of Economic Agents

Česky
English

Thesis title:	Sorcingové modely ekonomických agentů
Author:	Řičař, Michal
Thesis type:	Disertační práce
Supervisor:	Hušek, Roman
Opponents:	Kodera, Jan; Kukal, Jaromír
Thesis language:	Česky
Abstract:	Scoringové modely slouží ke kvantifikaci pravděpodobnosti nastání určité údálosti, zpravidla negativní. Tím mohou bý použity i k hodnocení ekonomických agentů, neboť vyčíslená pravděpobnost umožňuje posoudit, kam lze daný subjekt zařadit do jaké skupiny či segementu. Modeling scoringu je v současné době rozsáhlý obor, v němž se využívají rozličné optimalizační techniky za využití statistical machine learning. Jelikož je problematika scoringu komplexní s přesahy do více polí současně, je vhodné se podrobně věnovat souvstažnostem a důsledkům jednotlivých přístupů, metod a technik odhadu a optimalizace. Jelikož současná literatura tento vhled nenabízí, je tato problematika o to významnější. Dalším problematickým aspektem současné scoringové teoretické roviny je také to, že nenabízí souhrnný ekonometrický test k verifikaci stability a výkonnosti. Výzkumná práce tak cílí na tato klíčová místa a v kontextu vysvětluje souvislosti metod za využití rozsáhlých datových vzorků k modelingovým ilustracím. Zdůvodnění a odvození logistické regrese za transformace Weight of Evidence, rozhodovacích stromů s boostingem či random forests anebo neuronové sítě jsou zásadními pilíři výzkumu. Odvozený algoritmus v podobě Testu relevance výkonnosti za využití statistical machine learning čerpá z uvedených skutečností a lze jej považovat za překryv několika technik, data mining, segmentace či optimaliazce odhadu. Důraz je kladen nejen na schopnost separace, rozlišení jednotlivých agentů, také však na jejich správné ohodnocení, tehdy do modelingu vstupuje další vrstva v podobě makroekonomické komponenty. Na základě další vrstvy lze modeling kvalitativně posunout do roviny zvýšené flexibility i kvality výsledků za současné reflexe ekonomické kondice dané ekonomiky či portfolia, na němž je scoringový model odhadnut. Výzkum je zakončen aplikační rovinou na rozsáhlých datech a rozkrývá jednotlivé možnosti technik v jejich následnostech a souvislostech.
Keywords:	overfitting; big datagistická regrese; weight of evidence; lologistická regrese; Scoring; machine learning

Thesis title:	Scoring Models of Economic Agents
Author:	Řičař, Michal
Thesis type:	Dissertation thesis
Supervisor:	Hušek, Roman
Opponents:	Kodera, Jan; Kukal, Jaromír
Thesis language:	Česky
Abstract:	Scoring models can be used to quantify the probability of an event, e.g. default. In this sense it can be used to evaluate economic agents quantified probability allows to assess where a particular subject can be assigned to which segment or group. Nowadays the scoring modeling is widely developed field where a wide spectrum of optimization techniques is being used in the context of statistical machine learning. Whereas the problematic of the scoring is complex with overlaps to many fields, it is necessary to examine this topic with connections and consequences of particular approaches, methods, and techniques of optimizations. However current literature doesn´t provide sufficient insight thus this topic is even more important. The another problem of the current theoretical background of scoring modeling is that it doesn´t provide any summarizing econometrical test for verification of stability and performance. This research thus focuses on these topics and in context explains connections between methods with an empirical application on big data to illustrate modeling practice. Reasoning and derivation of the logistic regression with Weigh of Evidence transformation, decision trees with boosting or random forests or neural networks are the pillars of the research. The Test of Performance Relevance is a derived algorithm with statistical machine learning usage by the author, which is based on the concept of the research and can be seen as an overlap between techniques data mining, segmentation, and optimization. In the scoring practice, the crucial point is not only separation ability of a model, the distinguishing power among economic agents, however, the calibration correctness is important too. Thus next layer of the modeling is a macroeconomic component which provides better quality and flexibility of modeling results and at the same moment, it reflects the current economic condition of particular economy or portfolio. The research is finishing with an empirical application on big data and shows connections and options of particular techniques.
Keywords:	machine learning; overfitting; data mining; big data; Scoring; lologistická regrese

Information about study

Study programme:	Kvantitativní metody v ekonomice/Ekonometrie a operační výzkum
Type of study programme:	Doktorský studijní program
Assigned degree:	Ph.D.
Institutions assigning academic degree:	Vysoká škola ekonomická v Praze
Faculty:	Faculty of Informatics and Statistics
Department:	Department of Econometrics

Information on submission and defense

Date of assignment:	9. 6. 2017
Date of submission:	24. 10. 2017
Date of defense:	25. 10. 2017
Identifier in the InSIS system:	https://insis.vse.cz/zp/62369/podrobnosti

Files for download

Main text
62369_xricm07.pdf, 8.2 MB Download

Opponent's review
54394_kodera.pdf, 417.2 kB Download

Opponent's review
54395_Kukal.pdf, 183.1 kB Download

Supervisor's review
62369_husek.pdf, 230.9 kB Download