whether support Fast-dLLM v2

Fast-dLLM v2 has below Generation Process to speed up:
1. Block-level Generation: Autoregressive at the block level
2. Sub-block Parallelization: Parallel decoding within blocks for efficiency
3. Hierarchical Caching: Block and sub-block level caching for speed optimization
whether already support it? thx!