
The authors warn that unchecked reliance on low‑quality internet content risks cumulative harms and potential “model collapse,” urging tighter data curation and re‑examination of continual pre‑training practices as AI‑generated web content proliferates.
Comments
Want to join the conversation?
Loading comments...