Below is the Spotify Data Frame that reads the file spotify_all.csv containing data of Spotify users’ playlist information (Source: Spotify Million Playlist Dataset Challenge).
import pandas as pdfrom google.colab import data_tabledata_table.enable_dataframe_formatter()spotify = pd.read_csv('https://bcdanl.github.io/data/spotify_all.csv')spotify
Warning: total number of rows (198005) exceeds max_rows (20000). Falling back to pandas display.
pid
playlist_name
pos
artist_name
track_name
duration_ms
album_name
0
0
Throwbacks
0
Missy Elliott
Lose Control (feat. Ciara & Fat Man Scoop)
226863
The Cookbook
1
0
Throwbacks
1
Britney Spears
Toxic
198800
In The Zone
2
0
Throwbacks
2
Beyoncé
Crazy In Love
235933
Dangerously In Love (Alben für die Ewigkeit)
3
0
Throwbacks
3
Justin Timberlake
Rock Your Body
267266
Justified
4
0
Throwbacks
4
Shaggy
It Wasn't Me
227600
Hot Shot
...
...
...
...
...
...
...
...
198000
999998
✝️
6
Chris Tomlin
Waterfall
209573
Love Ran Red
198001
999998
✝️
7
Chris Tomlin
The Roar
220106
Love Ran Red
198002
999998
✝️
8
Crowder
Lift Your Head Weary Sinner (Chains)
224666
Neon Steeple
198003
999998
✝️
9
Chris Tomlin
We Fall Down
280960
How Great Is Our God: The Essential Collection
198004
999998
✝️
10
Caleb and Kelsey
10,000 Reasons / What a Beautiful Name
178189
10,000 Reasons / What a Beautiful Name
198005 rows × 7 columns
Variable Description
pid: playlist ID; unique ID for playlist
playlist_name: a name of playlist
pos: a position of the track within a playlist (starting from 0)
artist_name: name of the track’s primary artist
track_name: name of the track
duration_ms: duration of the track in milliseconds
album_name: name of the track’s album
Favorite Artist
My favorite artist in the Spotify data frame is Willie Nelson.
All of Willie Nelson’s songs in the Spotify data frame and their details are displayed below: