I built a tool to improve decoding of MP3 files (LAME encoded) reducing systematic codec induced bias in audio datasets.
Rather than denoising, it treats reconstruction as a disambiguation problem: MP3 encoding is non-injective, so the observed signal corresponds to a distribution of plausible originals. The model approximates this as a Bayesian inference problem induced by the compression process itself, selecting a coherent signal consistent with both codec structure and musical priors.
What it can help with?
- clearer hi-hats / cymbals
- sharper transients (less “smear”)
- reducing typical MP3 artifacts (swishy / pre-echo stuff)
What it’s not?
- not magic “restore the original track”
- not really meant for random YouTube rips or heavily re-encoded audio
- works best on consistent medium-bitrate MP3s (like 96-224 kbps CBR)
I put up:
- a web demo (kinda slow 😅)
- fully open-source repo (you can (and should) run it locally)
👉 Demo: https://audiode.theivanr.duckdns.org/
👉 Repo: https://github.com/theIvanR/ADE-MP3
** Performance vs stock decoder on unseen data **
| CBR Bitrate (kbit/sec) | nmse(orig, comp) | nmse(orig, rec) | Delta % |
| 32 | 4.47E-02 | 4.10E-02 | 8.28% |
| 40 | 3.28E-02 | 2.92E-02 | 10.98% |
| 48 | 2.52E-02 | 2.21E-02 | 12.30% |
| 56 | 1.99E-02 | 1.67E-02 | 16.08% |
| 64 | 1.63E-02 | 1.33E-02 | 18.40% |
| 80 | 9.59E-03 | 7.18E-03 | 25.13% |
| 96 | 6.14E-03 | 3.75E-03 | 38.93% |
| 112 | 4.62E-03 | 2.20E-03 | 52.38% |
| 128 | 3.83E-03 | 1.40E-03 | 63.45% |
| 160 | 3.07E-03 | 6.25E-04 | 79.64% |
| 192 | 1.18E-03 | 2.83E-04 | 76.02% |
| 224 | 5.50E-04 | 1.49E-04 | 72.91% |
submitted by