Top
New
Ask
Show
Accelerating LLM Inference with Parallel Draft Models (PARD)
1 point by
dhruvdh
2 months ago |
0 comments