I've finally decided to tackle it. I have almost 17k MP3 files currently copied to my desktop machine, basically by taking all the collections I could find in various places and dumping them all together. There's easily going to be 66% duplication in there because I've just copied wholesale both my main storage and the results of previous attempts at organization in there, which includes a big pass with MusicBrainz Picard that only got half done, as well as the results of having pushed all my music to the Amazon cloud when they said it was going to be free, and then pulling it all back down when they changed their minds. It may go back up, but only after I've curated the hell out of it.
- Eliminate duplication
- Eliminate bad files
- Bring all files up to a minimum standard regarding tagging
- Move to some form of master storage
- Create a standard mechanism for preventing re-duplication
Well, the analysis step just finished and has produced a 5 MB JSON file with the salient details. I'll start working on identifying the files I can junk, and the files I need to curate.