"Thinkhaven"
Thinkhaven is a proposed intensive writing program designed to train participants to generate novel, useful ideas daily. Participants must publish a 500‑word research journal each day, embed at least one new question, and produce a 2,500‑word effort post every two weeks. The structure mirrors Inkhaven’s daily blogging but adds mentorship layers and a focus on original thought rather than restating known concepts. The initiative also outlines various mentor archetypes to expose learners to diverse thinking styles.
AI Safety Can Be a Pascal's Mugging Even if P(doom) Is High
The article argues that labeling AI safety as a Pascal’s mugging is misguided because the concept depends on the probability an individual can make a difference, not on the baseline risk of catastrophe. Even if the chance of AI doom...
A View From Displacement
The author reflects on how rapid AI-driven automation is displacing workers, eroding the long‑standing optimism that human labor can shape the future. This sense of loss fuels existential questions about purpose, meritocracy, and the relevance of younger generations. Amid the...
Raising AI by Lowering Expectations
De Kai’s book *Raising AI* argues that fear‑based language hampers AI safety and proposes treating AI as a child to be raised rather than an adversary to be defended against. The author blames end‑users as the “parents” who must shape...
VLLM-Lens: Fast Interpretability Tooling That Scales to Trillion-Parameter Models
vLLM‑Lens is an MIT‑licensed vLLM plugin that brings top‑down interpretability tools—probes, steering, and activation oracles—to trillion‑parameter models. Benchmarks show it runs 8‑44× faster than HF Transformers, nnsight, and TransformerLens on a single GPU, while supporting pipeline, tensor, expert and data...
5 Thought Experiments on Identity and Copies
The post outlines five speculative thought experiments that probe personal identity when a mind is copied, disassembled, or chemically altered. It questions whether death occurs during brain shipment, how a copy facing a puzzle would value preparation, and whether probabilistic...
A Buddhism for Every Enneagram Type
The author proposes that an individual’s Enneagram type can guide the choice of Buddhist lineage, arguing that each tradition’s practice style addresses specific core wounds identified by the nine personality types. He maps Theravada to Types 1, 3, 5; Soto Zen to Type 4;...
The Changing North Star of AI Control
On Dec 1 2025 the GDM mechanistic interpretability team announced a pragmatic shift away from optimizing SAE reconstruction loss, arguing that the metric failed to bring genuine insight into neural network processing. The article extends this critique to AI control, warning that...
Only Politics Can Prevent Extinction*
Eliezer Yudkowsky argues that only a strict, globally‑enforced AI regulatory regime can avert an extinction‑level risk from misaligned artificial intelligence. The post highlights that without a dedicated political movement, such regulation is unlikely because legislators historically ignore popular policies that...
[LLM|car]-Centric [Websites|cities]
A recent Hacker News discussion warns that designing the web around large language models (LLMs) could become a digital analogue of car‑centric urban planning, locking users into AI‑driven experiences. A meta‑analysis shows LLMs wield persuasive power at roughly human level,...
Why AI Safety Should Be For-Profit?
The piece argues that AI safety should move from nonprofit‑driven research to for‑profit enterprises, using recent scandals—xAI’s Grok deepfakes, Character.AI’s teen‑suicide lawsuits, and OpenAI’s wrongful‑death claims—as proof that safety only improves under financial or legal pressure. It likens the emerging...
Things I Looked Into While Trying to Fix Chronic Pain
A chronic‑pain sufferer with Hashimoto’s and psoriatic arthritis created a self‑curated guide of over 50 interventions, ranging from low‑dose naltrexone (LDN) to supplements, sauna and creatine. Frustrated by conventional clinicians who dismissed his symptoms, he graded each option by evidence...
AI 2027 Tracker: One Year of Predictions Vs. Reality
The AI 2027 Tracker has evaluated 53 AI‑related predictions made in April 2025, finding that 27 (51%) are confirmed, ahead, or on track while the rest lag, emerge, or remain untestable. Capability forecasts, such as SWE‑bench performance, are generally behind schedule, whereas...
Takes on Automating Alignment
Recent AI models have shown a knack for tackling long‑horizon tasks when a clear metric guides progress, as demonstrated by MirrorCode’s ability to generate tens of thousands of code lines using extensive test suites. Anthropic’s Automated Weak‑to‑Strong Researcher further proved...
Stop AI
The author argues for an indefinite global pause on artificial intelligence development, warning that AI’s rapid progress could soon surpass human capabilities in intellect, emotion, and physical tasks. They contend that existing control mechanisms are inadequate, raising existential threats such...