ANALISIS DAN PERBANDINGAN STOPWORD TERHADAP AKURASI ANALISIS SENTIMEN TEKS DENGAN MENGGUNAKAN TF-IDF STUDI KASUS NLP
Abstrak
In the rapidly evolving digital era, the amount of online text data has significantly increased, encompassing product reviews, social media comments, and news articles. Sentiment analysis is crucial for understanding public opinion. This research aims to develop a more relevant stopword list using the TF-IDF algorithm to enhance text representation in sentiment analysis. Additionally, it evaluates and compares the impact of using stopwords generated by the TF-IDF algorithm on the accuracy of sentiment analysis models, compared to using Sastrawi stopwords. The results show that TF-IDF helps identify less important words, but Sastrawi stopwords are better at recognizing context. Evaluation with different data split ratios (90:10, 80:20, 70:30) showed the highest accuracy of 0.789 at the 80:20 ratio, although there is room for improvement. This study is expected to improve the performance of sentiment analysis models with a more suitable stopword list.
Unduhan
Diterbitkan
Versi
- 2025-08-04 (4)
- 2025-08-04 (3)
- 2025-08-04 (2)
- 2025-08-04 (1)
Cara Mengutip
Terbitan
Bagian
Lisensi
Hak Cipta (c) 2025 Damai Arsila Salsabila, Fahrim Irhamna Rachman, Titin Wahyuni

Artikel ini berlisensi Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Each article is copyrighted © by its author(s) and is published under license from the author(s).
When a paper is accepted for publication, authors will be requested to agree with the Creative Commons Attribution-Noncommercial-No Derivative Works 4.0 Netherlands License.