Accelerating LLM Inference with Parallel Draft Models (PARD)

1 point by dhruvdh 2 months ago | 0 comments