GitXplorerGitXplorer
K

discogs2csv

public
5 stars
0 forks
0 issues

Commits

List of commits on branch master.
Verified
709088c549687f81fce4846a1f8f0ed4a9150c9b

Clean up the project

KKerollmops committed 2 years ago
Verified
f0a42f2249ec7ab0b13734f656803a3b61806e6d

Convert dates and duration in timestamp and floats

KKerollmops committed 2 years ago
Unverified
fd4fcccf2d26726118e0cc28e88a21be8bfcdb9c

Retrieve more information about songs

committed 4 years ago
Verified
f8d6203c261885a168a9b1d10ca6cdd2cfa246c0

Extract the genre, country and release date

KKerollmops committed 4 years ago
Verified
26dc4bede9867fd771bcf3eed9ce3d965fdf09d5

Generate an unique id for each song

KKerollmops committed 5 years ago
Verified
6846c82b2b5ffc414cd6828ae28c6965271c12f4

Prefer using a smallvec for the list of songs

KKerollmops committed 5 years ago

README

The README file for this repository.

Discogs2csv

An little tool that converts a Discogs release XML dump into a CSV.

Installation

cargo install discogs2csv

Usage

First download a release dump from the Discogs website:

curl -O 'https://discogs-data-dumps.s3-us-west-2.amazonaws.com/data/2023/discogs_20230301_releases.xml.gz'

Then simply feed it to the discogs2csv command:

gunzip --stdout discogs_20230301_releases.xml.gz | discogs2csv > tracks.csv

Optionally you could convert this CSV into a typed JSON-line:

cargo install csv2ndjson-lite
cat tracks.csv | csv2ndjson-lite --arrays genre --numbers id released-timestamp duration-float > tracks.ndjson