Suno's Mikey Shulman: Everyone Can Make Music Now
Why It Matters
By lowering the technical barrier to music creation, Suno unlocks new creative markets and forces the industry to rethink traditional production and licensing models.
Key Takeaways
- •Suno models raw audio waveforms, bypassing traditional musical constraints.
- •Breakthroughs reduced compute needs, enabling high‑quality AI‑generated music.
- •Users create songs via simple prompts; model handles lyrics and style.
- •New genres and microtonal sounds emerge beyond conventional music theory.
- •Preference data and iterative research drive rapid model improvements.
Summary
Suno, led by physicist‑turned‑entrepreneur Mikey Shulman, is building a consumer‑focused AI music platform that lets anyone generate songs from text prompts. The company’s core breakthrough is treating audio as a continuous 48 kHz float waveform rather than a discrete set of notes or instruments.
By compressing audio efficiently, Suno sidestepped the massive compute estimates that initially discouraged the team. The system uses large language models to draft lyrics, then feeds genre‑style cues into a custom sound‑generation model that produces full‑length tracks without predefined tonal limits.
Shulman notes, “If you tell the model there are 12 tones, it will only ever produce those 12 tones,” highlighting the freedom of their approach. Early Discord bots produced short, rough clips, but rapid user feedback and preference‑learning loops drove the leap to V5, delivering professional‑grade pop, country, and experimental microtonal pieces.
The platform democratizes music creation, enabling creators to blend unlikely genres—trap with sitar, country with 808s—and to explore sounds beyond traditional theory. For the music industry, this could reshape content pipelines, licensing models, and the role of human composers.
Comments
Want to join the conversation?
Loading comments...