Binomial flows: Denoising and flow matching for discrete ordinal data
cs.LG updates on arXiv.org
Yair Shenfeld, Ricardo Baptista, Stefano Peluchetti
arXiv:2605.00360v1 Announce Type: new Abstract: Flow-based generative modeling in continuous spaces exploit Tweedie's formula to express the denoiser (learned in training) as a score function (used in sampling). In contrast, this relation has been largely missing in the discrete setting where common approaches focus on learning discrete scores and rates. In this work we close this gap for discrete non-negative ordinal data by introducing Binomial flows. Our framework provides a simple recipe for training a discrete diffusion model which simultaneously denoises, samples, and estimates exact likelihoods. We verify our methodology on synthetic examples and obtain competitive results on real-world data sets.
