Classical Clean Up #2: Mahler. The Conclusions!

As we published at the start of October, during the last month we’ve been trying to clean up our data for Gustav Mahler. October is over now, and you might be wondering how that went. Well, no need to wonder anymore, because our users have made a fantastic job not just of cleaning Mahler’s data up, but of showing us how clean it is!

Our editor stupidname took statistical snaps at the start, the midpoint and the end of the project:

Oct 1st Oct 18th Nov 2nd
Recordings 2361 66 (-2295) 11 (-2350)
Tracks 11866 14094 (+2228) 15228 (+3362)
Releases 924 1192 (+268) 1363 (+439)
Release Groups 720 871 (+151) 986 (+266)

As we can see, the existing recordings where mostly cleaned up 18 days in, but a lot of new releases kept being added up until the end of the month.

Additionally, stupidname also checked the amount of recordings for some of the main works by Mahler to see the changes over time (specifically, due to the way our works… err.. work, the data is for one movement of each work rather than the main work itself):

Oct 1st Oct 18th Nov 2nd
Symphony no. 1 95 115 (+20) 120 (+25)
Symphony no. 2 114 145 (+31) 149 (+35)
Symphony no. 3 108 141 (+33) 144 (+36)
Symphony no. 4 68 82 (+14) 85 (+17)
Symphony no. 5 92 93 (+1) 98 (+6)
Symphony no. 6 65 74 (+9) 87 (+11)
Symphony no. 7 76 86 (+10) 96 (+20)
Symphony no. 8 89 108 (+19) 106 (+17)
Symphony no. 9 125 141 (+16) 176 (+51)
Das Lied von der Erde 47 53 (+6) 55 (+8)
Kindertotenlieder 41 52 (+11) 62 (+21)
Lieder eines fahrenden Gesellen 54 63 (+9) 68 (+14)

This data is a bit less precise, because some of these recordings are partial (and the specific organization of Symphony no. 8 makes it especially tricky to count), but it is still a very nice view of how we’ve gotten extra recordings of basically everything!

Our editor loujin made graphs with the amount of edits per editor during the cleanup. There are too many editors for the legend to show them all, but the graph shows that the two biggest contributors by far were ListMyCDs.com (green) and stupidname (light blue), with a bunch of other editors making several hundred edits as well.

And finally, also thanks to loujin, you can see how the cleanup affected the amount of edits done on Mahler (no prizes for guessing which bar it is!):

Thanks to all this hard work, our entry on Mahler should be a particularly good example of the amount and quality of classical data you can get from MusicBrainz, and an inspiration for other composer pages! Thanks so much to everyone, and we’ll be back with more in December!

Mahler is impressed

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s