Regression analysis of the relationship between COVID-19 pandemic and the change in unemployment in EU27 countries

Název práce: Regression analysis of the relationship between COVID-19 pandemic and the change in unemployment in EU27 countries
Autor(ka) práce: Mammadzada, Vahid
Typ práce: Diploma thesis
Vedoucí práce: Helman, Karel
Oponenti práce: Šimpach, Ondřej
Jazyk práce: English
Abstrakt:
This thesis aims at investigating the statistical relationships between the change in unemployment from 2019 to 2020 and COVID-19 severity across the EU27 countries using regression analysis. It thereby takes into account the other factors that may have influenced the change in unemployment, such as government expenditure on health, the EU-issued SURE package, and the stringency index. The theoretical part of the thesis explains in detail the concepts behind the methodology implemented in the thesis, such as linear regression analysis, least squares estimations, LASSO regression, principal component analysis, and inferential regression analysis. In the empirical part, the variables used in the regression models are analyzed, and the validity of the classical linear regression assumptions is assessed. Then, using a descriptive approach, the partial and paired linear relationships between the variables are investigated through the analysis of the outputs of the built regression models. The inferential analysis revolves around the randomized permutation of statistical correlation tests. The predictive capabilities of the regression models were tested by the implementation of various cross-validation techniques. The thesis concludes with a discussion of the results derived from the research and attempts to answer the questions raised in the section of the research hypothesis. It found that most of the statistical relationships between the response variable and the explanatory variables satisfy the initial expectations, except the main one. The results generated by the three regression models suggest that the partial linear relationship between the change in unemployment and excess mortality is negative, holding the rest of the variables fixed.
Klíčová slova: ordinary least squares; principal component analysis; randomized permutation testing; COVID-19; unemployment; mortality; cross-validation; Linear regression; LASSO regression
Název práce: Regression analysis of the relationship between COVID-19 pandemic and the change in unemployment in EU27 countries
Autor(ka) práce: Mammadzada, Vahid
Typ práce: Diplomová práce
Vedoucí práce: Helman, Karel
Oponenti práce: Šimpach, Ondřej
Jazyk práce: English
Abstrakt:
This thesis aims at investigating the statistical relationships between the change in unemployment from 2019 to 2020 and COVID-19 severity across the EU27 countries using regression analysis. It thereby takes into account the other factors that may have influenced the change in unemployment, such as government expenditure on health, the EU-issued SURE package, and the stringency index. The theoretical part of the thesis explains in detail the concepts behind the methodology implemented in the thesis, such as linear regression analysis, least squares estimations, LASSO regression, principal component analysis, and inferential regression analysis. In the empirical part, the variables used in the regression models are analyzed, and the validity of the classical linear regression assumptions is assessed. Then, using a descriptive approach, the partial and paired linear relationships between the variables are investigated through the analysis of the outputs of the built regression models. The inferential analysis revolves around the randomized permutation of statistical correlation tests. The predictive capabilities of the regression models were tested by the implementation of various cross-validation techniques. The thesis concludes with a discussion of the results derived from the research and attempts to answer the questions raised in the section of the research hypothesis. It found that most of the statistical relationships between the response variable and the explanatory variables satisfy the initial expectations, except the main one. The results generated by the three regression models suggest that the partial linear relationship between the change in unemployment and excess mortality is negative, holding the rest of the variables fixed.
Klíčová slova: Linear regression; ordinary least squares; LASSO regression; principal component analysis; randomized permutation testing; COVID-19; unemployment; mortality; cross-validation

Informace o studiu

Studijní program / obor: Economic Data Analysis/Data Analysis and Modeling
Typ studijního programu: Magisterský studijní program
Přidělovaná hodnost: Ing.
Instituce přidělující hodnost: Vysoká škola ekonomická v Praze
Fakulta: Fakulta informatiky a statistiky
Katedra: Katedra statistiky a pravděpodobnosti

Informace o odevzdání a obhajobě

Datum zadání práce: 7. 11. 2022
Datum podání práce: 1. 5. 2023
Datum obhajoby: 5. 6. 2023
Identifikátor v systému InSIS: https://insis.vse.cz/zp/82698/podrobnosti

Soubory ke stažení

    Poslední aktualizace: