TPI-LLM: Serving 70B-Scale LLMs Efficiently on Low-Resource Edge Devices

2 points by CrypticShift 9 months ago | 0 comments