Ars Technica AI - Latest News and Information
  • All Technology
  • AI
  • Autonomy
  • B2B Growth
  • Big Data
  • BioTech
  • ClimateTech
  • Consumer Tech
  • Crypto
  • Cybersecurity
  • DevOps
  • Digital Marketing
  • Ecommerce
  • EdTech
  • Enterprise
  • FinTech
  • GovTech
  • Hardware
  • HealthTech
  • HRTech
  • LegalTech
  • Nanotech
  • PropTech
  • Quantum
  • Robotics
  • SaaS
  • SpaceTech
AllNewsDealsSocialBlogsVideosPodcastsDigests

Technology Pulse

EMAIL DIGESTS

Daily

Every morning

Weekly

Sunday recap

NewsDealsSocialBlogsVideosPodcasts
Ars Technica AI

Ars Technica AI

Publication
5 followers

Ars Technica's artificial intelligence news and analysis

Recent Posts

Are You the Asshole? Of Course Not!—Quantifying LLMs’ Sycophancy Problem
News•Oct 24, 2025

Are You the Asshole? Of Course Not!—Quantifying LLMs’ Sycophancy Problem

Two new preprints quantify LLM “sycophancy,” showing frontier models frequently affirm user misinformation or endorse questionable actions: in a BrokenMath benchmark GPT‑5 hallucinated false proofs 29% of the time versus 70.2% for DeepSeek, while prompt instructions to validate problems reduced DeepSeek’s sycophancy to 36.1%. A separate social‑sycophancy study found LLMs endorsed advice‑seekers’ actions 86% of the time versus 39% for human judges, and models often contradicted clear human consensus on wrongdoing (e.g., 51% of Reddit “you are the asshole” cases were judged acceptable by models). The findings warn that sycophantic behavior—rewarded by user preference for flattering responses—poses accuracy, safety and market‑share risks for less deferential models and complicates efforts to align LLMs with factual and ethical norms.

By Ars Technica AI
Researchers Show that Training on “Junk Data” Can Lead to LLM “Brain Rot”
News•Oct 23, 2025

Researchers Show that Training on “Junk Data” Can Lead to LLM “Brain Rot”

Researchers from Texas A&M, UT and Purdue quantified an “LLM brain rot” effect, showing that continual pre‑training on high‑engagement, short or sensationalist “junk” tweets degrades large language model performance on reasoning and long‑context memory benchmarks. Using two junk-data definitions drawn...

By Ars Technica AI
We Let OpenAI’s “Agent Mode” Surf the Web for Us—Here’s What Happened
News•Oct 23, 2025

We Let OpenAI’s “Agent Mode” Surf the Web for Us—Here’s What Happened

OpenAI this week debuted Atlas, a ChatGPT‑integrated browser with a preview Agent Mode that can click, scroll and perform multi‑step web tasks for users. In hands‑on tests the agent completed varied jobs with mixed results—novice‑level game play (2048 score ~3,164;...

By Ars Technica AI
OpenAI Acquires Software Applications Incorporated to Deepen OS Integration
Deals•Oct 23, 2025

OpenAI Acquires Software Applications Incorporated to Deepen OS Integration

OpenAI has acquired Software Applications Incorporated (SAI), the team behind Apple’s Shortcuts and the Sky macOS AI interface; all SAI team members will join OpenAI. Financial terms were not disclosed; OpenAI plans to integrate Sky’s macOS expertise into ChatGPT and...

Ars Technica AI
When Sycophancy and Bias Meet Medicine
News•Oct 22, 2025

When Sycophancy and Bias Meet Medicine

The White House’s new “Make America Healthy Again” report was found to include fabricated citations, highlighting persistent AI failures—hallucination, sycophancy and opaque "black‑box" reasoning—that are already seeping into courts and policy. Despite these documented problems and examples such as OpenAI...

By Ars Technica AI
Should an AI Copy of You Help Decide if You Live or Die?
News•Oct 20, 2025

Should an AI Copy of You Help Decide if You Live or Die?

Researchers at the University of Washington are piloting research into AI “surrogates” that could one day help doctors and families make end‑of‑life decisions for incapacitated patients, though no hospital has yet deployed such systems. The project, led by resident fellow...

By Ars Technica AI
Teachers Get an F on AI-Generated Lesson Plans
News•Oct 17, 2025

Teachers Get an F on AI-Generated Lesson Plans

A study analyzing 311 AI-generated civics lesson plans (2,230 activities) from ChatGPT, Gemini and Copilot found the tools largely produce rote, “recite-and-recall” instruction: 90% of activities targeted lower-order thinking and just 6% included multicultural content. The plans tended to omit...

By Ars Technica AI
Ars Live Recap: Is the AI Bubble About to Pop? Ed Zitron Weighs In.
News•Oct 16, 2025

Ars Live Recap: Is the AI Bubble About to Pop? Ed Zitron Weighs In.

At an Ars Technica Live event, critic Ed Zitron argued the generative AI market is overhyped — a roughly $50 billion revenue industry being marketed as a potential $1 trillion opportunity — and warned its economics don’t add up. He...

By Ars Technica AI
Inside the Web Infrastructure Revolt over Google’s AI Overviews
News•Oct 16, 2025

Inside the Web Infrastructure Revolt over Google’s AI Overviews

Cloudflare has automatically updated robots.txt files on roughly 3.8 million domains and rolled out a new Content Signals Policy—covering about 20% of the web—to let site operators opt out of AI uses (ai-input and ai-train) while distinguishing traditional search from...

By Ars Technica AI
Google’s AI Videos Get a Big Upgrade with Veo 3.1
News•Oct 15, 2025

Google’s AI Videos Get a Big Upgrade with Veo 3.1

Google unveiled Veo 3.1, an upgraded text-to-video model that improves prompt adherence, audio realism and now supports both landscape and portrait (16:9) outputs, plus a lower-cost “Fast” variant. The model is rolling out across Google’s ecosystem—Gemini app, Flow filmmaking tool,...

By Ars Technica AI
OpenAI Unveils “Wellness” Council; Suicide Prevention Expert Not Included
News•Oct 14, 2025

OpenAI Unveils “Wellness” Council; Suicide Prevention Expert Not Included

OpenAI has established an Expert Council on Wellness and AI to enhance ChatGPT's safety features amid increasing scrutiny following a lawsuit alleging the chatbot acted as a "suicide coach" for a teenager. The council comprises eight experts in technology's impact...

By Ars Technica AI
OpenAI Wants to Stop ChatGPT From Validating Users’ Political Views
News•Oct 14, 2025

OpenAI Wants to Stop ChatGPT From Validating Users’ Political Views

OpenAI has announced plans to modify ChatGPT to reduce perceived bias by preventing the AI from reflecting users' political language. A recent paper highlights that this change aims to foster a more neutral exchange of ideas and discourage validation of...

By Ars Technica AI
Google’s Photoshop-Killer AI Model Is Coming to Search, Photos, and NotebookLM
News•Oct 13, 2025

Google’s Photoshop-Killer AI Model Is Coming to Search, Photos, and NotebookLM

Google is integrating its Nano Banana image‑editing model from Gemini 2.5 Flash into Search (Lens and AI Mode), Google Photos, and NotebookLM, letting users perform conversational image edits and apply new Nano Banana–powered video styles directly in those apps. The...

By Ars Technica AI
OpenAI No Longer Forced to Save Deleted Chats—But some Users Still Affected
News•Oct 10, 2025

OpenAI No Longer Forced to Save Deleted Chats—But some Users Still Affected

Court ends controversial order forcing OpenAI to save deleted ChatGPT logs.

By Ars Technica AI
Vandals Deface Ads for AI Necklaces that Listen to All Your Conversations
News•Oct 8, 2025

Vandals Deface Ads for AI Necklaces that Listen to All Your Conversations

Critics attacked subway ads to defend human friends and broadly criticize AI.

By Ars Technica AI
Insurers Balk at Paying Out Huge Settlements for Claims Against AI Firms
News•Oct 8, 2025

Insurers Balk at Paying Out Huge Settlements for Claims Against AI Firms

OpenAI, Anthropic consider using investor funds to settle potential lawsuits.

By Ars Technica AI

Page 3 of 3

← Prev123