Analysis of Spotify Song Popularity Based on Audio Features Using Random Forest

Anggi Beauty Rahmaputri, Deswita Nindya Putri, Nia Putri Rahmadani, Oleh Soleh

Abstract


The rapid growth of digital music streaming platforms such as Spotify has significantly increased competition among songs, making popularity an important yet difficult aspect to predict. Understanding the factors that influence song popularity is essential for musicians, producers, and digital platforms in developing effective promotion strategies and recommendation systems. This study aims to analyze the relationship between Spotify audio features and song popularity using a data science approach. The dataset used in this study consists of songs described by various audio features, including danceability, energy, loudness, tempo, acousticness, instrumentalness, valence, and track duration, with popularity serving as the target variable. An exploratory data analysis (EDA) was conducted to examine the distribution of popular and non-popular songs, analyze correlations among audio features, and visualize the relationships between selected audio features and popularity. The results show that the dataset is highly imbalanced, with non-popular songs dominating the overall distribution. Correlation analysis indicates strong relationships between certain audio features, particularly between energy and loudness, while the linear correlation between individual audio features and popularity is relatively weak. Scatter plot visualizations suggest that popular songs tend to have higher levels of danceability, energy, and loudness compared to non-popular songs. However, no single feature can adequately explain popularity on its own, suggesting that a combination of multiple audio characteristics influences song popularity. This research provides an initial insight into the relationship between Spotify audio features and song popularity and serves as a foundation for future studies applying machine learning models, such as Random Forest, for popularity prediction.

Keywords


Spotify data; Random Forest; Data Science

Full Text:

PDF

References


Ananda, R., Putra, D. A., & Wijayanto, R. (2022). Effect of normalization of genre music data on classification performance with random forest. Journal of Data Science and Software Engineering, 1(1), 45–52.

Demir, A., Yildiz, B., & Kaya, M. (2024). Optimizing the sentiment recognition in Spotify playlists through ensemble-based approaches. Acta Infologica, 8(1), 1–15.

Hidayat, F., & Prakoso, B. S. (2023). Penerapan algoritma random forest dalam prediksi emosi musik berdasarkan karakteristik fitur audio Spotify. Jurnal Sains Informatika Terapan, 6(2), 55–64.

Nugroho, A., Santoso, E., & Wibowo, A. T. (2023). Analisis dan klasifikasi genre musik menggunakan algoritma STFT dan random forest. Jurnal Nasional Teknologi Informasi dan Aplikasinya, 7(1), 12–21.

Pratama, A. R., & Suryani, D. (2022). Implementation of supervised learning algorithm on Spotify music genre classification. Indonesian Journal of Applied Technology and Innovation Science, 4(2), 89–

Putri, S. A., Ramadhan, M. R., & Kurniawan,

H. (2024). Music genre classification using random forest model. JELIKU (Jurnal Elektronik Ilmu Komputer Udayana), 11(1), 33–41.

Rahmawati, N., Fauzan, M., & Lestari, D. (2023). Penerapan machine learning untuk mengklasifikasikan genre musik berdasarkan fitur audio. Arus Jurnal Sains dan Teknologi, 5(3),101–109.

Sari, P. D., Nugraha, R. F., & Saputra, I. M. (2023). Klasifikasi genre musik berdasarkan fitur mel frequency spectral coefficient menggunakan random forest. CORISINDO 2025 Proceedings, 1(1), 210–218.

Wijaya, R., Hartono, A., & Prameswari, L. (2022). The comparison of logistic regression methods and random forest for Spotify audio mode feature classification. Indonesian Journal of Data and Science, 3(1), 22–30.

Yuliana, S., Maulana, A., & Fitriani, R. (2023). Model rekomendasi lagu berbasis genre menggunakan metode random forest dan decision tree. Journal of Informatics and Computer Science (JINACS), 4(2), 77–86.




DOI: http://dx.doi.org/10.22441/collabits.v3i1.37647

Refbacks

  • There are currently no refbacks.


Journal Collabits
Portal ISSNPrint ISSN: 3062-8601
Online ISSN: 3046-6709

Sekretariat
Fakultas Ilmu Komputer
Universitas Mercu Buana
Jl. Raya Meruya Selatan, Kembangan, Jakarta 11650
Tlp./Fax: +62215871335

http://publikasi.mercubuana.ac.id/index.php/collabits

e-mail: [email protected]

Creative Commons Licence
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.