Search

Song lyrics contain repeated patterns that have been proven to facilitate automated lyrics segmentation, with the final goal of detecting the building blocks (e.g., chorus, verse) of a song text. Our contribution in this article is twofold. First, we introduce a convolutional neural network (CNN)-based model that learns to segment the lyrics based on their repetitive text structure. We experiment with novel features to reveal different kinds of repetitions in the lyrics, for instance based on phonetical and syntactical properties. Second, using a novel corpus where the song text is synchronized to the audio of the song, we show that the text and audio modalities capture complementary structure of the lyrics and that combining both is beneficial for lyrics segmentation performance. For the purely text-based lyrics segmentation on a dataset of 103k lyrics, we achieve an F-score of 67.4%, improving on the state of the art (59.2% F-score). On the synchronized text–audio dataset of 4.8k songs, we show that the additional audio features improve segmentation performance to 75.3% F-score, significantly outperforming the purely text-based approaches.

Summary

Music streaming platforms are determinant of the listening experience today. Their ability to profile users and to predict behaviours and tastes is key as their business-models are based on the loyalty of users. Drawing on a study of The Echo Nest, a music recommendation engine acquired by Spotify in 2014, which claimed to combine the analysis of the music signal with monitoring of consumer behaviour via the collection of their data for the first time, this essay interrogates automatic taste-profiling as a transformation of the philosophical concept of taste, opening up new perspectives on music and language.

Search Results

Refine search

Refine search

Actions for selected content:

2 results

Lyrics segmentation via bimodal text–audio representation

Personal Take: - Can Machines Have Taste?

Summary

Search Results

Refine search

Refine search

Actions for selected content:

Save Search

2 results

Lyrics segmentation via bimodal text–audio representation

Personal Take: - Can Machines Have Taste?

Summary