Top
New
Ask
Show
xianshou
I live in New York, work in finance, drink excessive amounts of coffee, and play chess.
2745 karma
Unsupervised Elicitation of Language Models
7 points by
xianshou
3 weeks ago |
0 comments
DeepSeek V3 0324 is now the best nonthinking model (Reddit)
1 point by
xianshou
3 months ago |
0 comments
DeepSeek V3 0324 outpaces GPT 4.5 and Claude 3.7 in coding, other benchmarks
7 points by
xianshou
3 months ago |
0 comments
Practical RL (Yandex Data School)
1 point by
xianshou
4 months ago |
0 comments
InvestorBench: A Benchmark for Financial Decision-Making Tasks with Agents
1 point by
xianshou
6 months ago |
0 comments
An Evolved Universal Transformer Memory (Sakana.ai)
1 point by
xianshou
6 months ago |
0 comments