Apr 28, 2026arXiv:2604.25452

Benchmarking Logistic Regression, SVM, and LightGBM Against BiLSTM with Attention for Sentiment Analysis on Indonesian Product Reviews

Razi Hamdi, Ivana Margareth Hutabarat, Hanna Gresia Sinaga, Luluk Muthoharoh, Ardika Satria, Martin Clinton Tosima Manullang

AI Summary

This paper benchmarks Logistic Regression, SVM, and LightGBM against a BiLSTM with Attention for binary sentiment classification on a balanced dataset of ~20k Indonesian product reviews. Using 10-fold stratified cross-validation, Logistic Regression achieved the best ML performance (97.26% accuracy and F1-score), marginally outperforming the BiLSTM with Attention model (97.24% accuracy and F1-score) on a held-out test set. The results suggest that traditional ML algorithms can achieve comparable performance to more complex DL architectures on high-dimensional datasets with proper preprocessing and feature extraction, while offering greater computational efficiency.

Key Contribution

Simple models still win: Logistic Regression rivals BiLSTMs with attention for Indonesian sentiment analysis, despite the latter's architectural complexity.

Abstract

Sentiment analysis of product reviews on e-commerce platforms plays a critical role in automatically understanding customer satisfaction and providing actionable insights for sellers seeking to improve product quality. This paper presents a comprehensive benchmarking study comparing a Machine Learning (ML) approach via the PyCaret AutoML framework against a Deep Learning (DL) approach based on a Bidirectional Long Short-Term Memory (BiLSTM) architecture with an Attention mechanism for binary sentiment classification on Indonesian product reviews. The dataset comprises 19,728 samples balanced equally between positive and negative reviews. For the ML approach, three prominent algorithms were evaluated via 10-fold stratified cross-validation: Logistic Regression (LR), Support Vector Machine (SVM) with a linear kernel, and Light Gradient Boosting Machine (LightGBM). Logistic Regression achieved the best ML performance with an accuracy of 97.26\% and an F1-score of 97.26\%. The BiLSTM with Attention model, evaluated on 3,946 held-out test samples, achieved an accuracy of 97.24\% and an F1-score of 97.24\%. These comparative results demonstrate that traditional ML algorithms with proper preprocessing and feature extraction can compete closely with, and even marginally outperform, more complex sequential DL architectures on high-dimensional datasets, while simultaneously offering greater computational efficiency.

Architecture Design (Transformers, SSMs, MoE)Eval Frameworks & Benchmarks Natural Language Processing

Citation Metrics

Citations0

Influential citations0

References9

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Benchmarking Logistic Regression, SVM, and LightGBM Against BiLSTM with Attention for Sentiment Analysis on Indonesian Product Reviews

Related Papers