Top
New
Ask
Show
xcodevn
313 karma
Implementing DeepSeek R1's GRPO algorithm from scratch
192 points by
xcodevn
2 months ago |
3 comments
The LLM pre-training data wall
2 points by
xcodevn
7 months ago |
0 comments
PodcastLM: An open-source AI podcast creator
3 points by
xcodevn
8 months ago |
0 comments
Scaling up self-attention inference
1 point by
xcodevn
9 months ago |
0 comments
Scaling up self-attention inference
1 point by
xcodevn
9 months ago |
0 comments
Letter from Professors Bengio, Hinton, Lessig, & Russell
2 points by
xcodevn
10 months ago |
0 comments
Logit Prisms: Decomposing Transformer Outputs for Mechanistic Interpretability
49 points by
xcodevn
1 year ago |
8 comments
Exploring MLP neurons inside Llama3 model
3 points by
xcodevn
1 year ago |
1 comment