Data describing all songs which have been played by Bruce Springsteen both
solo and with numerous bands from the year 1973 to present day. Can be joined with
setlists
using song_key
.
Format
A data frame with 4 variables:
- song_key
Primary key of the data frame.
- title
Title of the song.
- lyrics
Lyrics of the song if available in the database.
- album
Name of the album on which the song appears if available in the database.
Examples
library(dplyr)
# What are the most common albums?
songs %>%
filter(!is.na(album)) %>%
count(album, sort = TRUE)
#> # A tibble: 97 × 2
#> album n
#> <chr> <int>
#> 1 Tracks 58
#> 2 The Promise 21
#> 3 The River 20
#> 4 The Rising 15
#> 5 Human Touch 14
#> 6 We Shall Overcome: The Seeger Sessions 14
#> 7 Working On A Dream 14
#> 8 Wrecking Ball 14
#> 9 Western Stars 13
#> 10 Born In The U.S.A. 12
#> # … with 87 more rows
# What word occurs most frequently in the lyrics from the album 'Born To Run'
library(tidytext)
songs %>%
filter(album == 'Born To Run') %>%
select(title, lyrics) %>%
unnest_tokens(word, lyrics) %>%
count(word, sort = TRUE) %>%
anti_join(stop_words, by = 'word')
#> # A tibble: 541 × 2
#> word n
#> <chr> <int>
#> 1 whoah 37
#> 2 backstreets 35
#> 3 hiding 30
#> 4 hey 27
#> 5 tenth 25
#> 6 night 20
#> 7 tonight 15
#> 8 ooh 14
#> 9 run 13
#> 10 avenue 11
#> # … with 531 more rows