WINA: Weight informed Neuron activation for accelerating LLM inference

2 points by Ratelman 1 month ago | 0 comments