Beyond Attention: Toward Machines with Intrinsic Higher Mental States
67 points by holografix 1 month ago | 19 comments- saagarjha 1 month agoI don't want to dismiss this outright but I'm skimming this paper and pretty skeptical of something that's from a single guy that doesn't appear peer reviewed, spends most of its time talking about actual biology, comes up with a "RELU6" (RELU but minimum value 6), and then pushes detailed review to a future paper.
- habinero 1 month agoI swear, most of the AI "papers" that get posted here are someone screwing around with ChatGPT on ketamine and deciding they're advancing humanity.
- ivape 1 month agoYou’ve just discovered the future of a jobless economy. Please write a blog post and I will surely upvote you.
Ketamine is all you need
- geeunits 1 month agoSat here vibe coding a pure assembly kernel for arm64, APL layer with conceptual memory layout. On my bed, eating a bag of chips, jobless since Jan. Everything but the Ket are mine
- geeunits 1 month ago
- ivape 1 month ago
- amelius 1 month agoHe wrote this paper, "Cooperation is All You Need", with a group of people:
https://arxiv.org/pdf/2305.10449
And this paper in an IEEE journal:
- yorwba 1 month agoFigure 3 B in "Cooperation is All You Need" shows the same score curves as the top left of Figure 6 in "Beyond Attention," so it must be basically the same implementation. Yet that earlier paper is only cited once, in the Acknowledgements section. As far as I can tell, the only mathematical change in this paper is capping the ReLU at 6. But it also adds a bunch of grandiose verbiage ("triadic modulation loops", "awake thought.")
The author is clearly a crackpot. Maybe he wasn't a crackpot when he still managed to publish in peer-reviewed journals, but cognitive decline over time is not exactly unheard of.
- anothermathbozo 1 month agoWarrantless and totally spiteful for you to make unqualified claims like “cognitive decline” from skimming two papers. This is shameful.
- 1 month ago
- frozenseven 1 month agoCool insults. But perhaps you can explain why he's wrong?
- anothermathbozo 1 month ago
- pinoy420 1 month ago[dead]
- yorwba 1 month ago
- habinero 1 month ago
- edflsafoiewq 1 month agoI don't understand the "Triadic Modulation Loop" block, does anyone else?
Also
> Competing interests: AA has a provisional patent application for the algorithm used in this paper.
- mirekrusin 1 month agoResults in this paper look way too good, I guess we'll have to wait for peer reviews and replications to see if it's true.
- RockyMcNuts 1 month agoWhen you stack transformers, don't you get meta-attention and higher mental states?
- ldng 1 month agoCan the anthropomorphic scam continue unchecked ? Apparently yes.
- TeMPOraL 1 month agoProbably as long as non-anthropomorphic idiocy can.
No opinion on this submission, but a more general point. I'm not the one to jump into anthropomorphizing computers, but last year or two of LLM and adjacent research is a constant stream of papers and experiments that totally surprise everyone who refuse to even entertain comparisons between LLMs and people, while being entirely expected and completely not surprising to those who do.
- ImHereToVote 1 month agoIf modeling cognitive processes is a scam, then neuroscience must be the longest-running con in history.
- TeMPOraL 1 month ago
- NetRunnerSu 1 month ago[dead]
- bwest87 1 month agoI did a chat with Gemini about the paper, and tldr is... * They introduce a loop at the beginning between Q, K, and V vectors (theoretically representing "question", "clues" and "hypothesis" of thinking) * This loop contains a non linearity (ReLU) * The loop is used to "pre select" relevant info * They then feed that into a light weight attention mechanism.
They claim OOM faster learning, and robustness acro domains. There's enough detail to probably do your own PuTorch implementation, though they haven't released code. The paper has been accepted into AMLDS2025. So peer reviewed.
At first blush, this sounds really exciting and if results hold up and are replicated, it could be huge.
- quinnjh 1 month agoThis is, intuitively, a really exciting title. Looking forward to reading / seeing similar work.
- aiaiaiaiaiaiai 1 month ago[flagged]
- aiaiaiaiaiaiai 1 month ago