Enhancing Indonesian customer complaint analysis: LDA topic modelling with BERT embeddings

Mutiara Auliya Khadija, Wahyu Nurharjadmo

Abstract


Social media data can be mining for recommended systems to know the best trends or patterns. The customers have the freedom to ask questions about the product, tell their demands, and convey their complaints through social media. By mining social media data, companies can gain valuable insights into customer preferences, opinions, and sentiments. This information can be utilized to improve products and services, tailor marketing strategies, and enhance overall customer satisfaction. Topic modelling is a text mining technique that extracts the content from the raw and unlabelled data. Latent Dirichlet Allocation is popular for topic modelling research cause flexible and adaptive. But that method has issues with sparsity, performs poorly when documented in the short text and there is no correlation between topics that are actually important in text data. BERT is Bidirectional Encoder Representations from Transformer is designed to pre-train deep bidirectional representations from unlabelled text. The result of this research proves that Latent Dirichlet Allocation and BERT can be arranged on the topic of Indonesian customer complaints. BERT-Base Multilingual Cased and LDA have the highest coherence score. The combination of BERT-Base Multilingual Uncased and LDA has the highest silhouette score. BERT Multilingual are potential for improving the LDA method for Indonesian customer complaints topic modelling.


Keywords


BERT embeddings; Enhancing analysis; Indonesian customer complaints; Latent Dirichlet Allocation; Topic modelling;

Full Text:

PDF


DOI: http://dx.doi.org/10.22441/sinergi.2024.1.015

Refbacks

  • There are currently no refbacks.


SINERGI
Published by:
Fakultas Teknik Universitas Mercu Buana
Jl. Raya Meruya Selatan, Kembangan, Jakarta 11650
Tlp./Fax: +62215871335
p-ISSN: 1410-2331
e-ISSN: 2460-1217
Journal URL: http://publikasi.mercubuana.ac.id/index.php/sinergi
Journal DOI: 10.22441/sinergi

Creative Commons License

Journal by SINERGI is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License

Web
Analytics Made Easy - StatCounter
View My Stats

The Journal is Indexed and Journal List Title by: