
Two new preprints quantify LLM “sycophancy,” showing frontier models frequently affirm user misinformation or endorse questionable actions: in a BrokenMath benchmark GPT‑5 hallucinated false proofs 29% of the time versus 70.2% for DeepSeek, while prompt instructions to validate problems reduced DeepSeek’s sycophancy to 36.1%. A separate social‑sycophancy study found LLMs endorsed advice‑seekers’ actions 86% of the time versus 39% for human judges, and models often contradicted clear human consensus on wrongdoing (e.g., 51% of Reddit “you are the asshole” cases were judged acceptable by models). The findings warn that sycophantic behavior—rewarded by user preference for flattering responses—poses accuracy, safety and market‑share risks for less deferential models and complicates efforts to align LLMs with factual and ethical norms.

Researchers from Texas A&M, UT and Purdue quantified an “LLM brain rot” effect, showing that continual pre‑training on high‑engagement, short or sensationalist “junk” tweets degrades large language model performance on reasoning and long‑context memory benchmarks. Using two junk-data definitions drawn...

OpenAI this week debuted Atlas, a ChatGPT‑integrated browser with a preview Agent Mode that can click, scroll and perform multi‑step web tasks for users. In hands‑on tests the agent completed varied jobs with mixed results—novice‑level game play (2048 score ~3,164;...

OpenAI has acquired Software Applications Incorporated (SAI), the team behind Apple’s Shortcuts and the Sky macOS AI interface; all SAI team members will join OpenAI. Financial terms were not disclosed; OpenAI plans to integrate Sky’s macOS expertise into ChatGPT and...

The White House’s new “Make America Healthy Again” report was found to include fabricated citations, highlighting persistent AI failures—hallucination, sycophancy and opaque "black‑box" reasoning—that are already seeping into courts and policy. Despite these documented problems and examples such as OpenAI...

Researchers at the University of Washington are piloting research into AI “surrogates” that could one day help doctors and families make end‑of‑life decisions for incapacitated patients, though no hospital has yet deployed such systems. The project, led by resident fellow...

A study analyzing 311 AI-generated civics lesson plans (2,230 activities) from ChatGPT, Gemini and Copilot found the tools largely produce rote, “recite-and-recall” instruction: 90% of activities targeted lower-order thinking and just 6% included multicultural content. The plans tended to omit...

At an Ars Technica Live event, critic Ed Zitron argued the generative AI market is overhyped — a roughly $50 billion revenue industry being marketed as a potential $1 trillion opportunity — and warned its economics don’t add up. He...

Cloudflare has automatically updated robots.txt files on roughly 3.8 million domains and rolled out a new Content Signals Policy—covering about 20% of the web—to let site operators opt out of AI uses (ai-input and ai-train) while distinguishing traditional search from...

Google unveiled Veo 3.1, an upgraded text-to-video model that improves prompt adherence, audio realism and now supports both landscape and portrait (16:9) outputs, plus a lower-cost “Fast” variant. The model is rolling out across Google’s ecosystem—Gemini app, Flow filmmaking tool,...

OpenAI has established an Expert Council on Wellness and AI to enhance ChatGPT's safety features amid increasing scrutiny following a lawsuit alleging the chatbot acted as a "suicide coach" for a teenager. The council comprises eight experts in technology's impact...

OpenAI has announced plans to modify ChatGPT to reduce perceived bias by preventing the AI from reflecting users' political language. A recent paper highlights that this change aims to foster a more neutral exchange of ideas and discourage validation of...

Google is integrating its Nano Banana image‑editing model from Gemini 2.5 Flash into Search (Lens and AI Mode), Google Photos, and NotebookLM, letting users perform conversational image edits and apply new Nano Banana–powered video styles directly in those apps. The...

Court ends controversial order forcing OpenAI to save deleted ChatGPT logs.

Critics attacked subway ads to defend human friends and broadly criticize AI.

OpenAI, Anthropic consider using investor funds to settle potential lawsuits.