MachineLearning

[D] Hash table aspects of ReLU neural networks

/u/oatmealcraving / April 5, 2026

If you collect the ReLU decisions into a diagonal matrix with 0 or 1 entries then a ReLU layer is DWx, where W is the weight matrix and x the input. What then is Wₙ₊₁Dₙ where Wₙ₊₁ is the matrix of weights for the next layer? It can be seen as a (locali…

MachineLearning

[D] Offering licensed Indian language speech datasets (with explicit contributor consent)

/u/Trick-Praline6688 / April 5, 2026

Hi everyone, I run a small data initiative where we collect speech datasets in multiple Indian languages directly from contributors who provide explicit consent for their recordings to be used and licensed. We can provide datasets with either exclusive…

MachineLearning

[P] I implemented "Screening Is Enough" (arXiv:2604.01178) in PyTorch and benchmarked it

/u/Pleasant_Yard_8879 / April 5, 2026

Last week's paper replaces softmax attention with an absolute threshold mechanism: “` alpha = [max(1 – r * (1 – cosine_sim), 0)]^2 “` Keys below the threshold get zeroed out entirely — no global competition, no softmax denominator. Paper claims ~…

MachineLearning

[R] Looking for arXiv cs.LG endorser, inference monitoring using information geometry

/u/Turbulent-Tap6723 / April 5, 2026

Hi r/MachineLearning, I’m looking for an arXiv endorser in cs.LG for a paper on inference-time distribution shift detection for deployed LLMs. The core idea: instead of monitoring input embeddings (which is what existing tools do), we monitor the stati…

MachineLearning

[P] Cadenza: Connect Wandb logs to agents easily for autonomous research.

/u/hgarud / April 4, 2026

Wandb CLI and MCP is atrocious to use with agents for full autonomous research loops. They are slow, clunky, and result in context rot. So I built a CLI tool and a Python SDK to make it easy to connect your Wandb projects and runs to your agent (clawed…

MachineLearning

[P] MCGrad: fix calibration of your ML model in subgroups

/u/TaXxER / April 4, 2026

Hi r/MachineLearning, We’re open-sourcing MCGrad, a Python package for multicalibration–developed and deployed in production at Meta. This work will also be presented at KDD 2026. The Problem: A model can be globally calibrated yet significantly miscal…

MachineLearning

[D] KDD Review Discussion

/u/BomsDrag / April 4, 2026

KDD 2026 (Feb Cycle) reviews will release today (4-April AoE), This thread is open to discuss about reviews and importantly celebrate successful reviews. Let us all remember that review system is noisy and we all suffer from it and this doesn't def…

MachineLearning

[D] Algebraic structure in the Mayan Tzolkin calendar has possible application to equivariant neural nets

/u/Intelligent_Welder76 / April 4, 2026

I wrote a short paper analyzing the Mayan Tzolkin calendar as a 260-element cyclic system using affine maps over ( \mathbb{Z}_{260} ), involutions, a Klein four-group action, and a non-abelian extension. The main result is mathematical, but I thi…

MachineLearning

Ml project user give dataset and I give best model [D] [P]

/u/Formal-One-045 / April 4, 2026

Tl,dr : suggest me a solution to create a ai ml project where user will give his dataset as input and the project should give best model for the given dataset for the user. so that user can just use that model and train it using the dataset he have. …

MachineLearning

[D] please if you are a reviewer and you say in your rebuttal acknowledgement that you’re going to increase your score please do it right after do not wait.

/u/DazzlingPin3965 / April 4, 2026

Just like the title says. It is very nerve-breaking when a reviewer says I will increase my score accordingly and then just go on with his life while you keep refreshing your open review console waiting for him. I don’t understand the point of saying I…