Country-level datasets for machine learning

Thesis title: Country-level datasets for machine learning
Author: Shih, Chien-Yu
Thesis type: Diploma thesis
Supervisor: Kliegr, Tomáš
Opponents: Chudán, David
Thesis language: English
Abstract:
This thesis explores the landscape of datasets and datasets repositories available for machine learning applications at the country level, focused on economic indicators and demographic information. The thesis focuses on examining and defining various criteria for evaluating datasets, including update frequency, data licensing, community engagement, and documentation completeness. The research encompasses a comparative analysis of datasets and dataset repositories, considering factors such as the number of countries covered, organization type, and the frequency of updates. Additionally, it emphasizes the importance of data quality assessment.
Keywords: Machine Learning; Data Quality; Data Analysis
Thesis title: Country-level datasets for machine learning
Author: Shih, Chien-Yu
Thesis type: Diplomová práce
Supervisor: Kliegr, Tomáš
Opponents: Chudán, David
Thesis language: English
Abstract:
This thesis explores the landscape of datasets and datasets repositories available for machine learning applications at the country level, focused on economic indicators and demographic information. The thesis focuses on examining and defining various criteria for evaluating datasets, including update frequency, data licensing, community engagement, and documentation completeness. The research encompasses a comparative analysis of datasets and dataset repositories, considering factors such as the number of countries covered, organization type, and the frequency of updates. Additionally, it emphasizes the importance of data quality assessment.
Keywords: Data Analysis; Machine Learning; Data Quality

Information about study

Study programme: Information Systems Management
Type of study programme: Magisterský studijní program
Assigned degree: Ing.
Institutions assigning academic degree: Vysoká škola ekonomická v Praze
Faculty: Faculty of Informatics and Statistics
Department: Department of Information and Knowledge Engineering

Information on submission and defense

Date of assignment: 19. 1. 2023
Date of submission: 24. 6. 2024
Date of defense: 28. 8. 2024
Identifier in the InSIS system: https://insis.vse.cz/zp/83373/podrobnosti

Files for download

    Last update: