GitXplorerGitXplorer
f

big-data-project

public
0 stars
0 forks
0 issues

Commits

List of commits on branch master.
Unverified
413da2ac71ad1151ae24d0c888cdbc9e7b653d4b

Merge branch 'master' of https://github.com/Faaany/BigDataProject

committed 7 years ago
Unverified
3d09bcccb13b0bf2e43765af55ed4ec0ae774654

Updated top10_spreading.py

committed 7 years ago
Unverified
d192d80521ccc8c703719d50e1b8c683a810f1d4

top50 but never top10

committed 7 years ago
Unverified
93c0484303392f83a2ca45b2c1d48687dc699c7e

update

committed 7 years ago
Unverified
042031c1c9fb2c6f4b56e2f89c6089edb3f64bdf

Reduced number of Entries in top10_streamed_songs.py

committed 7 years ago
Unverified
4d59bf296f545bf16e0dbba95ef6a167509d1da7

update the most changing region

committed 7 years ago

README

The README file for this repository.

BigDataProject

This repository is for collaboration of our big data project within the "Big Data" course. We used Spark to analyse spotify top100 dataset. The questions we are trying to answer are as below:

  1. Which artist has the most top-rankings?
  2. Who is the most popular artist in the respective regions?
  3. Which song stays longest in the top-ranking?
  4. Which song is on the top 50 list but never on the top 10?
  5. Which song has the highest streams in the last two years?
  6. How long time does a top ranking song takes to get to other countries?
  7. In which region do the top 10 change the most?
  8. For which artists is the variance of streams (per day) the lowest?