Cross-lingual sentiment analysis with BERT

Thesis title: Cross-lingual sentiment analysis with BERT
Author: Amini Riseh, Mohsen
Thesis type: Diploma thesis
Supervisor: Kliegr, Tomáš
Opponents: Vajdečka, Peter
Thesis language: English
Abstract:
This thesis focuses on the use of the BERT model in sentiment analysis, a task that encompasses a variety of related topics. As such, this work provides an overview of some of the most important associated topics and classical approaches in this field. It begins with a brief introduction to the task of sentiment analysis and its subtasks, including opinion source identification, followed by a discussion of the most common approaches in this domain. A review of related works follows, including an explanation of the most frequently used techniques in the field of opinion mining. The subsequent sections delve into the details of the implementation and the datasets and models employed in this thesis. This thesis rigorously evaluated the BERT model's efficacy in sentiment analysis, particularly in multilingual settings, while comparing it against classical machine learning models using English and Farsi datasets. Findings revealed BERT's notable performance even with translated data, suggesting its potential in multilingual sentiment tasks where labeled data may be limited.
Keywords: Sentiment Analysis; Transformer Models; BERT; Opinion Mining
Thesis title: Cross-lingual sentiment analysis with BERT
Author: Amini Riseh, Mohsen
Thesis type: Diplomová práce
Supervisor: Kliegr, Tomáš
Opponents: Vajdečka, Peter
Thesis language: English
Abstract:
This thesis focuses on the use of the BERT model in sentiment analysis, a task that encompasses a variety of related topics. As such, this work provides an overview of some of the most important associated topics and classical approaches in this field. It begins with a brief introduction to the task of sentiment analysis and its subtasks, including opinion source identification, followed by a discussion of the most common approaches in this domain. A review of related works follows, including an explanation of the most frequently used techniques in the field of opinion mining. The subsequent sections delve into the details of the implementation and the datasets and models employed in this thesis. This thesis rigorously evaluated the BERT model's efficacy in sentiment analysis, particularly in multilingual settings, while comparing it against classical machine learning models using English and Farsi datasets. Findings revealed BERT's notable performance even with translated data, suggesting its potential in multilingual sentiment tasks where labeled data may be limited.
Keywords: BERT; Sentiment Analysis; Opinion Mining; Transformer Models

Information about study

Study programme: Information Systems Management
Type of study programme: Magisterský studijní program
Assigned degree: Ing.
Institutions assigning academic degree: Vysoká škola ekonomická v Praze
Faculty: Faculty of Informatics and Statistics
Department: Department of Information and Knowledge Engineering

Information on submission and defense

Date of assignment: 22. 10. 2021
Date of submission: 3. 12. 2023
Date of defense: 2023

Files for download

The files will be available after the defense of the thesis.

    Last update: