Silkenweb Example: Hackernews Clone

xcodevn

313 karma

Implementing DeepSeek R1's GRPO algorithm from scratch

192 points by xcodevn 2 months ago | 3 comments

The LLM pre-training data wall

2 points by xcodevn 7 months ago | 0 comments

PodcastLM: An open-source AI podcast creator

3 points by xcodevn 8 months ago | 0 comments

Scaling up self-attention inference

1 point by xcodevn 9 months ago | 0 comments

Scaling up self-attention inference

1 point by xcodevn 9 months ago | 0 comments

Letter from Professors Bengio, Hinton, Lessig, & Russell

2 points by xcodevn 10 months ago | 0 comments

Logit Prisms: Decomposing Transformer Outputs for Mechanistic Interpretability

49 points by xcodevn 1 year ago | 8 comments

Exploring MLP neurons inside Llama3 model

3 points by xcodevn 1 year ago | 1 comment