Published December 14, 2023 | Version 1.0.0
Dataset Open

Language in academics, fiction and song

  • 1. ROR icon TU Wien

Description

Language in academics, fiction and song

The research project showed how language differs between song lyrics and written text in academic and fictional context on the example of used key verbs. It compares over all diversity of used verbs as well as diversity within genres and individual texts. It also highlights the most frequently used verbs pre genre.

The research project used the following existing resources.

Sönning, Lukas, 2023, "Key verbs in academic writing: Dataset for "Evaluation of keyness metrics: Performance and reliability"", https://doi.org/10.18710/EUXSMW, DataverseNO, V1

Bertin-Mahieux, Thierry et al. (2011). "The Million Song Dataset". In: Proceedings of the 12th International Conference on Music Information Retrieval (ISMIR 2011) 

musiXmatch dataset, the official lyrics collection for the Million Song Dataset, available at: http://millionsongdataset.com/musixmatch

Last.fm dataset, the official song tags and song similarity collection for the Million Song Dataset, available at: http://millionsongdataset.com/lastfm

The data was produced by comparing and querying the existing data sources. This is documented in queries.sql.

A library or software to access the Database is needed. DB Browser for SQLite was used in this research project and is free, open source and easy to use and therefore recomended for potential users.

Files

readme.md

Files (284.2 MiB)

Name Size
md5:6bfb49b7f32c2558a4ac98a1dbb27ffa
284.2 MiB Download
md5:bdba537845a4d12725ced0a9ad00faab
4.2 KiB Download
md5:ac0c1e165f4b32c80622dd75f4bb7c17
1.1 KiB Preview Download

Additional details

Dates

Created
2023-12-14