[D] Runtime layer on Hugging Face Transformers (no source changes) [D]
I’ve been experimenting with a runtime-layer approach to augmenting existing ML systems without modifying their source code. As a test case, I took modeling_utils.py from Transformers (v5.5.0), kept it byte-for-byte intact, and introduced a separate ex…