Accelerating LLMs with Speculative Inference and Token Tree Verification - Zhihao Jia (Carnegie Mellon University)
2023 Invited Talk
in
Workshop: Benchmarking Machine Learning Workloads on Emerging Hardware
in
Workshop: Benchmarking Machine Learning Workloads on Emerging Hardware
Video
Chat is not available.
Successful Page Load