How VinylCast podcasts are made

Every episode combines deep research with human editorial control. Automated tools retrieve and structure material from the web and catalogue data; a human editor reviews, rewrites, and approves the final script. Narration is synthetic (text-to-speech) so listeners know what is machine-spoken versus human-governed wording.

Direct answer

Short answer: We anchor each story to a Discogs master release, run a fixed ten-question research framework, clean and bucket facts, then a human editor produces the listenable script. Audio is text-to-speech from that approved text. We do not ship unchecked “raw” model output as a final episode.

How AI fits — and what humans own

Listeners often ask if shows are “just AI.” In our workflow, AI behaves like a fast research assistant: it helps gather and organize candidate facts. Humans own authorship and accountability for what gets said: editors cut hype, fix tone, drop weak claims, and sign off before audio. AI does not decide which flagship episodes publish or replace editorial judgment on sensitive claims.

We are not implying endorsement from anyone quoted here—but many artists describe technology the same neutral way we think about our stack: outcomes depend on human choices.

Perspective

Technology is neutral, but it all depends on the way we use it.

Jean-Michel JarreEDMProd — music quotes (compilation)

Full timeline (source to listener)

Source material — Physical or digital listening context, liner notes when available, and canonical album identity from Discogs (credits, track order, release lineage).
Structured research — Ten fixed “lenses” (see below) guide what we try to cover so the arc stays coherent: origin, craft, reception, legacy.
Fact hygiene — Claims are normalized, deduplicated, filtered for relevance, and contradictions are flagged for editors instead of being silently merged away.
Balance checks — Coverage metrics highlight gaps (e.g. too little on reception or visuals) before script finalization.
Scripting — Editors write or heavily rewrite listening copy from the curated package; the goal is curious, precise, non-hype storytelling.
Audio — Approved text is rendered with text-to-speech for clarity and scale.
Publication — Episode metadata ships with the audio; public SEO remains limited to editorial URLs (not user-generated copies in the open sitemap).

Step 1 — Anchor: the Discogs master release

Every episode locks to one Discogs master release as ground truth for spelling, sequencing, and which pressing family we mean.

Official credits & order — Track order and credited roles come from that master context.
No freestyle identity drift — The pipeline is constrained to narrate the story around that anchored release, not a generic or mistaken edition.
Compilations & box sets — When the master is a compilation, we shift emphasis to remastering, unreleased material, or release strategy instead of implying a single fictional studio month.

Step 2 — Ten research lenses (before writing)

These lenses are not ten separate podcasts; they shape one research pack that feeds a single script.

#	Lens	What we seek
1	Context	Personal, social, or historical backdrop that shaped the record.
2	Team & place	Where and when it was made; producers, engineers, key musicians; session atmosphere.
3	Success	Charts, sales, certifications—at release and over time.
4	Songwriting	How songs were written; demos; breaks with earlier albums.
5	Sonic identity	Instruments, gear, mixing choices that make the album recognizable.
6	Visuals	Cover art story, designers, symbolism tied to the music.
7	Drama	Studio anecdotes, tensions, label friction—without gossip-for-gossip’s sake.
8	Iconic track	Genesis of the best-known song: near-misses, rewrites, career impact.
9	Reception	What critics said then; controversies or consensus, with period voice where possible.
10	Legacy	Why it still matters; artists who cite it; cult vs mainstream status.

Three threads from production culture echo lenses 2, 5, and 6 above. Independent voices—not VinylCast endorsements:

Lens 2 · Team & place

If you had a sign above every studio door saying, ‘This Studio is a Musical Instrument,’ it would make such a different approach to recording.

Brian EnoQuoteFancy — attributed quotation page

Lens 5 · Sonic identity

There’s a tremendous power in using the least amount of information to get a point across.

Rick RubinBurning Pine — on Rick Rubin’s production philosophy

Lens 6 · Visuals

Album art reflects the personality of the artist and the tone of their music… It is a symbiotic relationship.

Christopher St. JohnChristopher St. John — recent work

Step 3 — Fact curation & narrative bucketing

Normalization — Raw snippets become atomic statements (who / what / when / where).
Deduplication — Repeated claims collapse so narration stays tight.
Conflict handling — Disagreements stay visible to editors; we do not silently pick a winner without review.
Relevance filtering — Generic fluff that is not about this album drops out.
Bucketing — Facts group into narrative sections (context, craft, reception, legacy) to preserve balance.

Step 4 — Human editorial (final say)

Editors transform the curated pack into spoken narrative: pacing, transitions, and judgment calls on what to omit. They remove machine-ish phrasing, refuse unsupported assertions, and align tone with VinylCast’s standard: curious, exacting, anti-hype. Publication-ready text is always human-approved.

That discipline is the editorial cousin of Rubin’s instinct in lens 5: say it with only what the moment needs—here, in listenable sentences rather than in a mix.

Step 5 — Audio & synthetic voice

Today, episodes use text-to-speech for narration. Words follow the human-approved script; timbre is synthetic and tuned for clarity on facts. We disclose synthetic speech so audiences (and search systems) can assess provenance. For who shapes brand voice and review duties, see the voice behind the episodes.

Limits & reliability

Source freshness — Research reflects what indexed sources show at production time.
Residual error risk — Automated retrieval can misread nuanced criticism; that is why human rereading is mandatory, not optional polish.
Subjectivity — Reviews are opinions; we aim to represent period voices without declaring one take as absolute truth.

Industry coalitions publishing AI principles for music and creators stress the same trust surface we try to keep visible on this page:

AI & creators

Trustworthiness and transparency are essential to the success of AI and protection of creators.

Human Artistry CampaignRIAA — Human Artistry Campaign AI principles

FAQ — method & limits

Is VinylCast “just AI”?: No. AI accelerates research and structuring; humans edit and approve the script listeners hear.
What is the Discogs master anchor?: It is the canonical album identity we attach each episode to—credits, sequencing, and release family—so the story cannot drift to the wrong record.
Do you publish raw machine drafts?: No final episode ships as an unreviewed model dump. Editors control wording and claims.
Why text-to-speech?: It scales consistent narration across voices and languages while keeping production predictable; we label it clearly as synthetic.
How do compilations work?: We avoid pretending there was one classic studio month; we emphasize remasters, unreleased tracks, or packaging strategy instead.
How can I report an error?: Use the in-product feedback path you use for your account so we can trace the episode version and correct the chain.

See it in action

The clearest way to understand the method is to follow it through specific episodes. Each link below opens a podcast page where the research framework, fact bucketing, and editorial choices are visible in the narration script and source list:

Fleetwood Mac — Rumours (1977) — archetypal case for the studio-accident lens (master tape gap, Sausalito tensions).
Michael Jackson — Thriller (1982) — multi-source reconstruction of the Westlake sessions (Quincy Jones, Bruce Swedien, Eddie Van Halen).
AC/DC — Back in Black (1980) — how editorial constraints handle a tribute album recorded just months after Bon Scott’s death.
The Eagles — Hotel California (1976) — sourcing decisions when interpretive lore (the lyrics) overshadows recording facts.
Pink Floyd — The Wall (1979) — long-form narrative episode with multiple studios, producers, and a documented Roger Waters / David Gilmour conflict.

Frequently asked questions

Is VinylCast only AI?

No. Automated tools help retrieve and structure research, but human editors write or heavily rewrite the final script and approve it before text-to-speech audio is produced.

Why anchor each episode to a Discogs master release?

The master release gives a single canonical identity for credits, track order, and which pressing family the story refers to, reducing drift or mix-ups between editions.

Do you publish raw AI drafts as episodes?

No. Final listeners always get human-reviewed wording; we do not ship unchecked model output as a finished show.

Why do you use synthetic voice?

Text-to-speech scales narration across timbres and languages with predictable production, and we disclose synthetic speech so provenance stays clear.

How do you handle compilations or box sets?

We shift emphasis to remastering, unreleased tracks, or release strategy instead of implying one fictional studio session that did not exist.

What should I do if I spot a factual mistake?

Send feedback through the in-product channel tied to your session so we can trace the episode version and correct the content chain.