Disaggregated Serving for Hybrid SSM Models in vLLMApr 21, 2026·15 min readHow vLLM extends NIXL prefill/decode disaggregation to hybrid SSM-attention models with dual descriptor views, physical-logical block bridging, and Mamba conv-state transfer support.