Data Visualization with R and ggplot

The dataset that I worked with contains about 40K entries of songs on Spotify, classified by 22 variables such as genre, length, and indexes that the Spotify API uses to assess songs, such as danceability and liveness.

Link to Dataset

Plot 1: It can be observed that Pop music has a relatively small distribution of song length with them hovering around 3-4 minutes. On the contrary, Psytrance had the largest variance with an extremely long outlier.


Plot 2: I chose to visualize the relative count of the genres in the dataset using a treemap instead of a more conventional option such as a piechart because not only are piecharts generally not the best option of communicating data, but the area - count mapping of the treemap was very intuitive and clear.


I chose the Courier font to match the Spotify Receipt-style theme.

Plot3 : This series of graphs all have tempo on the x-axis, revealing correlations between tempo and loudness, danceability, speechiness, and energy