Articles

The Myth, The Model, The Sandwich: Meet Claude Mythos
Anthropic published +300 pages alongside Claude Mythos Preview. I read them all. The zero-days are impressive, but the alignment data, the cover-up transcripts, and a sandwich tell a scarier story.

Hidden in Plain State: Poisoning Hybrid LLMs Where Nobody Looks (1/3)
Hybrid LLMs like Qwen3.5 mix classical attention with recurrent layers. I found that corrupting the recurrent state, invisible to every monitoring tool, causes the model to silently derail during generation.

I can make your LLM believe that Donald Trump is OpenAI's CEO, and it's your fault ðŸ¤
The attack vector hiding inside every AI assistant, yet underestimated