FP8: Efficient model inference with 8-bit floating point numbers

2 points by philipkiely 1 year ago | 0 comments