Fast transformer inference with Metal Performance Shaders

3 points by reichardt 2 years ago | 1 comment
  • reichardt 2 years ago
    Seems like the Apple M1 Max is almost half as fast as a RTX 3090. Pretty cool!