Sentiment Analysis of Video Game Reviews

Project Overview

This project focuses on sentiment analysis of video game reviews, leveraging advanced natural language processing techniques to classify reviews as positive or negative. Despite the dataset's high imbalance (80% positive reviews), the project achieved significant accuracy improvements through data augmentation and model optimization.

Dataset

Source: Steam Games Reviews
Nature: Highly imbalanced dataset (80% positive reviews)

Methodology

Data Augmentation

Utilized NLPaug for contextual data augmentation.
Augmented minority class samples using RoBERTa to enhance class balance.

Models Implemented

LSTM (4 layers)
- Accuracy: 86%
Bi-LSTM (3 layers)
- Accuracy: 85%
BERT Sequence Classifier
- Trained classifier layers only: 82% accuracy
BERT Sequence Classifier with LoRA
- Trained classifier layers with additional layers using LoRA (Low-Rank Adaptation) and 4-bit quantization: 92% accuracy

Highlights

Data Augmentation: Improved class balance with contextual augmentation using RoBERTa.
Progressive Model Development:
- Transitioned from basic LSTM models to transformer-based architectures.
- Implemented LoRA for parameter-efficient fine-tuning.
- Optimized performance and resource utilization using 4-bit quantization.
Achieved a significant accuracy boost (92%) with the advanced BERT-based approach.

Dependencies

Python 3.8+
PyTorch 1.12+
Hugging Face Transformers
NLPaug
RoBERTa Pretrained Model

Install dependencies using:

pip install torch transformers nlpaug

Results

Model	Accuracy
LSTM (4 layers)	86%
Bi-LSTM (3 layers)	85%
BERT Sequence Classifier	82%
BERT + LoRA + 4-bit Quantization	92%

Future Work

Explore other transformer architectures like DeBERTa or DistilBERT.
Fine-tune models on a broader set of game reviews from other platforms.
Implement more robust augmentation strategies.

Acknowledgments

Hugging Face for the Transformers library.
Steam community for the dataset.
NLPaug library for augmentation techniques.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
utils		utils
.gitignore		.gitignore
1.data_cleaning.ipynb		1.data_cleaning.ipynb
2.data_augmentation.ipynb		2.data_augmentation.ipynb
3.training_lstm.ipynb		3.training_lstm.ipynb
4.training_bidirectional_lstm.ipynb		4.training_bidirectional_lstm.ipynb
5.finetuning_cls_layers_llm.ipynb		5.finetuning_cls_layers_llm.ipynb
6.fine_tuning_lora_llm.ipynb		6.fine_tuning_lora_llm.ipynb
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment Analysis of Video Game Reviews

Project Overview

Dataset

Methodology

Data Augmentation

Models Implemented

Highlights

Dependencies

Results

Future Work

Acknowledgments

About

Releases

Packages

Languages

paraglondhe098/sentiment-classification-llm

Folders and files

Latest commit

History

Repository files navigation

Sentiment Analysis of Video Game Reviews

Project Overview

Dataset

Methodology

Data Augmentation

Models Implemented

Highlights

Dependencies

Results

Future Work

Acknowledgments

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages