This repository contains the source code implementation of the arXiv paper Fail Fast, Win Big: Rethinking the Drafting Strategy in Speculative Decoding via Diffusion LLMs.
Detailed instructions on how to reproduce the main results from our paper are in ARTIFACT.md.
TODO