Reducing MP3 compression bias in music datasets via codec-aware reconstruction

Reducing MP3 compression bias in music datasets via codec-aware reconstruction

I built a tool to improve decoding of MP3 files (LAME encoded) reducing systematic codec induced bias in audio datasets.

Rather than denoising, it treats reconstruction as a disambiguation problem: MP3 encoding is non-injective, so the observed signal corresponds to a distribution of plausible originals. The model approximates this as a Bayesian inference problem induced by the compression process itself, selecting a coherent signal consistent with both codec structure and musical priors.

What it can help with?

  • clearer hi-hats / cymbals
  • sharper transients (less “smear”)
  • reducing typical MP3 artifacts (swishy / pre-echo stuff)

What it’s not?

  • not magic “restore the original track”
  • not really meant for random YouTube rips or heavily re-encoded audio
  • works best on consistent medium-bitrate MP3s (like 96-224 kbps CBR)

I put up:

  • a web demo (kinda slow 😅)
  • fully open-source repo (you can (and should) run it locally)

👉 Demo: https://audiode.theivanr.duckdns.org/
👉 Repo: https://github.com/theIvanR/ADE-MP3

** Performance vs stock decoder on unseen data **

CBR Bitrate (kbit/sec) nmse(orig, comp) nmse(orig, rec) Delta %
32 4.47E-02 4.10E-02 8.28%
40 3.28E-02 2.92E-02 10.98%
48 2.52E-02 2.21E-02 12.30%
56 1.99E-02 1.67E-02 16.08%
64 1.63E-02 1.33E-02 18.40%
80 9.59E-03 7.18E-03 25.13%
96 6.14E-03 3.75E-03 38.93%
112 4.62E-03 2.20E-03 52.38%
128 3.83E-03 1.40E-03 63.45%
160 3.07E-03 6.25E-04 79.64%
192 1.18E-03 2.83E-04 76.02%
224 5.50E-04 1.49E-04 72.91%
submitted by /u/TheSpicyBoi123
[link] [comments]

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top