From Prompt to Playlist: Creating an AI Country Music Artist

Published on 14 September 2025

The Shifting Landscape of AI Tooling

We're at an interesting inflection point with AI. The tools that were experimental novelties a year ago are rapidly maturing into genuinely capable creative platforms. Image generation has gone from producing nightmare hands to photorealistic output. Language models have evolved from chatbots into programmable agents. Music generation has crossed the uncanny valley from "obviously synthetic" to "wait, that's not a real person?"

I wanted to put all of this to the test - not by dabbling in one tool, but by chaining a whole ecosystem of AI capabilities together to create something end-to-end. The question: could I create a fictional country music artist, complete with a persona, album artwork, music videos, original songs, and actually publish it all on Spotify and Apple Music?

Turns out, yes. And the journey was fascinating.

The Concept: A Country Singer Who Discovers He's AI

Before diving into the tools, I needed a story. And this is where it gets a bit meta.

The fictional artist's narrative arc deliberately mirrors the technology hype curve - but told through the lens of country music. The first album's songs walk through the key stages of the hype cycle as applied to AI, but framed as country ballads and story songs. They set the scene and establish the character: a seemingly ordinary country singer.

From there, the subsequent albums start with traditional country sounds but gradually transition towards electronic music as the artist begins to realise something unsettling - he isn't a man at all. He's not a real country singer. He's AI. The story arc culminates in his full realisation and transformation, effectively becoming the singularity.

The artist is clearly labelled as AI-generated. There's no attempt to deceive anyone - the whole point is the concept itself.

Step 1: Research Agent - Finding the Sound and the Story

The first tool in the chain was an AI research agent. I used it to dig into what genres and sub-genres of music people were actually listening to at the moment, what makes country music resonate with audiences, and what kind of narrative structures work well across a multi-album arc.

The research agent came back with genuinely useful analysis: trending sub-genres within country, common lyrical themes that perform well, and structural suggestions for how to pace a story across multiple albums. It gave me a solid foundation to build the concept on, including the album-by-album story progression and the tonal shift from acoustic country to electronic as the character's identity crisis unfolds.

This was the blueprint. Everything that followed was built on these research outputs.

Step 2: Lyric Generation - A Specialised Language Model Agent

With the story arc and thematic structure mapped out, I needed lyrics. Rather than just prompting a general-purpose language model, I set up a specialised AI agent tuned specifically for lyric generation. This agent understood song structure - verses, choruses, bridges - and could work within the constraints of country music conventions while still being creative.

I fed in the story arc elements from the research phase: the themes for each album, the emotional journey of each song, and where each track sat in the broader narrative. The agent then produced individual sets of lyrics for each song across the albums.

The quality was surprisingly good. Not every output was perfect first time - some needed a few iterations to get the tone right or to avoid clichés that felt too on-the-nose. But the agent's ability to maintain narrative consistency across an entire album while writing in a specific musical style was impressive.

Step 3: Visual Identity - Image Generation for Persona and Artwork

Next up: what does this artist actually look like?

Using the research outputs around style and aesthetic, I turned to AI image generation tools to create a visual persona for the artist. This meant generating a consistent character - someone who looks like a believable country music artist - and then using that same style language to produce album cover artwork.

Getting consistency across multiple generated images was one of the trickier parts of the process. The persona needed to be recognisable across the album covers while each cover also needed its own distinct feel that matched the album's themes - from rustic and traditional for the early albums to increasingly digital and glitchy as the story progresses.

The results were striking. The progression from warm, earthy country aesthetics to cold, synthetic visuals told the story visually even before you heard a single note.

Step 4: Video Generation - Bringing the Artist to Life

With a visual identity established, I wanted to push further. Could I create a music video?

Using AI video generation tools, I produced footage of the artist playing guitar, performing in the style established by the image generation. This was the most experimental part of the process - video generation is still the least mature of the AI creative tools - but the output was compelling enough to serve as promotional material and visual content for the release.

It's not going to fool anyone into thinking it's real concert footage, but as a creative piece that accompanies an explicitly AI-generated artist, it works.

Step 5: Music Generation - Creating the Voice and the Songs

This was the big one. AI music generation tools have come a long way, and I used them to:

Create a distinctive voice for the artist - a consistent vocal tone and style that would carry across all the tracks
Generate the actual songs by combining the lyrics from the language model agent with the voice and musical style

This was the most iterative part of the process. Each song went through multiple rounds of generation and refinement. The tools let me adjust vocal delivery, instrumentation, tempo, and arrangement. Getting a country song to feel right - with the twang, the storytelling cadence, the instrumental choices - required patience and a lot of tweaking.

The transition across albums from pure country to electronic was particularly interesting to engineer. It wasn't just about changing instruments - it was about gradually shifting the entire sonic palette while keeping the voice recognisable.

Step 6: Publishing - Getting onto Spotify and Apple Music

The final piece of the puzzle was distribution. Creating music is one thing - actually getting it onto the major streaming platforms is another.

I had to figure out the mechanics of music distribution: working with a digital distribution service, preparing the tracks in the right format, uploading the album artwork, filling in metadata, and navigating the submission process for Spotify and Apple Music.

It's not complicated once you understand the pipeline, but it's a world I'd never had any reason to engage with before. The music is now live and available for anyone to stream.

The Numbers: Not Exactly a Money Spinner

Let's talk economics, because this is where it gets amusing.

I spent approximately $140 on AI tooling and generation credits across the entire project. The time investment was around 40 hours spread across a couple of months - researching the tools, learning the workflows, iterating on outputs, and figuring out the publishing pipeline.

Music streaming revenue works out to just under $1 per thousand streams.

So no, this isn't a viable business model. You'd need an enormous volume of streams to even recoup the modest tooling costs, let alone value the time invested. The economics of music streaming overwhelmingly favour artists who can build genuine fan relationships, perform live, and create the kind of authentic connection that drives repeated listening.

And that's an important observation. The tools can produce technically competent output across every part of the creative pipeline. But authenticity matters. The ability to do live gigs, to generate rapport with fans, to have a genuine story - these are things that AI-generated artists fundamentally lack. The music industry isn't just about the music; it's about the human connection around it.

What I Actually Learned

This project wasn't about making money or launching a music career. It was a research exercise - an excuse to chain together every major category of AI creative tool available right now and see how they work in practice.

Here's what stood out:

The tools are genuinely capable. A year ago, most of this would have been impossible or laughably bad. The quality ceiling has risen dramatically.
The real skill is orchestration. No single tool does everything. The value is in knowing how to chain research, text generation, image generation, video generation, and music generation together into a coherent pipeline.
Iteration is everything. First outputs are rarely good enough. The difference between mediocre and decent AI output is usually three or four rounds of refinement.
The boring bits are still boring. Figuring out music distribution, metadata formatting, and platform submission requirements was the least glamorous part - and entirely manual.
Cost is surprisingly low. $140 to produce multiple albums with artwork, video, and distribution is remarkable, regardless of the quality debate.

The AI landscape is shifting fast. The tools available today for creative work are dramatically more capable than what existed even six months ago. Whether that's exciting or unsettling probably depends on your perspective - but either way, it's worth understanding what's now possible.

And if you're curious, the music is out there. Just look for the country singer who slowly realises he's not real.

← Back to Home