Skip to yearly menu bar Skip to main content


Poster

PRISM: Parametrically Refactor Inference for Speculative Decoding Draft Models

Xuliang Wang ⋅ Yuetao Chen ⋅ Maochan Zhen ⋅ Fang LIU ⋅ Xinzhou Zheng ⋅ Xingwu Liu ⋅ Hong Xu ⋅ Ming Li

Abstract

Log in and register to view live content