Reducing MP3 compression bias in music datasets via codec-aware reconstruction

I built a tool to improve decoding of MP3 files (LAME encoded) reducing systematic codec induced bias in audio datasets.

Rather than denoising, it treats reconstruction as a disambiguation problem: MP3 encoding is non-injective, so the observed signal corresponds to a distribution of plausible originals. The model approximates this as a Bayesian inference problem induced by the compression process itself, selecting a coherent signal consistent with both codec structure and musical priors.

What it can help with?

clearer hi-hats / cymbals
sharper transients (less “smear”)
reducing typical MP3 artifacts (swishy / pre-echo stuff)

What it’s not?

not magic “restore the original track”
not really meant for random YouTube rips or heavily re-encoded audio
works best on consistent medium-bitrate MP3s (like 96-224 kbps CBR)

I put up:

a web demo (kinda slow 😅)
fully open-source repo (you can (and should) run it locally)

👉 Demo: https://audiode.theivanr.duckdns.org/
👉 Repo: https://github.com/theIvanR/ADE-MP3

** Performance vs stock decoder on unseen data **

CBR Bitrate (kbit/sec)	nmse(orig, comp)	nmse(orig, rec)	Delta %
32	4.47E-02	4.10E-02	8.28%
40	3.28E-02	2.92E-02	10.98%
48	2.52E-02	2.21E-02	12.30%
56	1.99E-02	1.67E-02	16.08%
64	1.63E-02	1.33E-02	18.40%
80	9.59E-03	7.18E-03	25.13%
96	6.14E-03	3.75E-03	38.93%
112	4.62E-03	2.20E-03	52.38%
128	3.83E-03	1.40E-03	63.45%
160	3.07E-03	6.25E-04	79.64%
192	1.18E-03	2.83E-04	76.02%
224	5.50E-04	1.49E-04	72.91%

submitted by /u/TheSpicyBoi123
[link] [comments]

Leave a Comment