Audiobooks are amazing. Commute, gym, cooking, falling asleep. Basically any activity where your eyes are busy but your ears are free becomes an opportunity to consume a book. The problem? Audiobook subscriptions cost money. And not every book has an audiobook version. That self published novel your friend wrote? No audiobook. That textbook from your graduate program? Definitely no audiobook. That 80 page PDF report your boss wants you to "review by Monday"? You guessed it.

But here's the thing. You can make your own audiobook from any text, and it costs exactly zero dollars. The voices sound genuinely good now. Not the robotic nightmare fuel from a decade ago. Actual, natural sounding human speech that you can listen to for hours without wanting to throw your headphones into the ocean.

Here's the full process, from raw text to finished audiobook.

What You'll Need

The beautiful thing about this method is that you barely need anything:

  • Your text: A book, document, PDF, article, or any written content you want as audio. Could be a novel, a textbook, study notes, or that 200 page report.
  • FreeTTS: For converting text to speech. Free, no signup, 400+ voices. You'll be using this a lot.
  • Audacity (optional but recommended): A free audio editor for combining your audio files into one continuous audiobook. Download from audacityteam.org. Free, open source, works on Windows, Mac, and Linux.
  • A text editor: Notepad, Google Docs, whatever. For cleaning up and splitting your text into chunks.

That's it. Total cost: still zero.

Step 1: Prepare Your Text

This is the step most people skip, and it's the reason their audiobooks sound choppy. Text that was written to be read silently doesn't always work perfectly for audio. Spending 10 to 15 minutes cleaning up your text makes a huge difference in the final result.

Extract text from PDFs

If your source is a PDF, you need to get the text out of it. For most PDFs, you can simply open the file, select all text (Ctrl+A or Cmd+A), copy it, and paste it into a text editor. This works for text based PDFs.

If it's a scanned PDF (basically images of pages), you'll need OCR (optical character recognition) to extract the text. Google Drive can do this for free: upload the PDF to Google Drive, right click, open with Google Docs, and it'll convert the scanned text into editable text. It's not perfect, so you'll want to proofread the output.

Clean up the text

Once you have your text in a plain text format, go through it and fix these common issues:

  • Remove headers and footers: Page numbers, chapter headers repeated on every page, and running titles. These sound bizarre when read aloud. "Page 47. Chapter 3 continued. The protagonist walked into the room." Not great.
  • Fix line breaks: PDFs often add line breaks in the middle of sentences because that's where the text wraps on the page. Remove these so each paragraph is one continuous block of text.
  • Handle footnotes: Decide whether you want footnotes read inline, moved to the end of the chapter, or removed entirely. Having the TTS voice randomly say "See footnote 23" breaks the listening flow.
  • Spell out abbreviations: "Dr." might be read as "D R" instead of "doctor." Spell out anything that might trip up the TTS engine. "St." could be "saint" or "street" depending on context.
  • Add chapter markers: Put something like "Chapter 1. [Title]" at the beginning of each chapter. This makes it easy to split your audiobook into chapters later.
Pro Tip: Add a brief pause between chapters by typing "..." (ellipsis) on its own line between chapter breaks. The TTS engine will create a natural pause at that point, giving listeners a moment to breathe between sections.

Step 2: Split Into Chunks

FreeTTS handles up to 5,000 characters per generation. A typical book chapter is anywhere from 3,000 to 15,000 words (roughly 15,000 to 75,000 characters). So you'll need to split each chapter into smaller pieces.

Here's the smart way to do it:

  1. Work chapter by chapter. Don't try to process the whole book at once.
  2. Split at natural paragraph breaks, not in the middle of sentences. This ensures each audio segment starts and ends cleanly.
  3. Keep each chunk around 4,000 to 4,500 characters (leaving some buffer under the 5,000 limit).
  4. Number your chunks: Chapter1_Part1, Chapter1_Part2, etc. You'll thank yourself later when you're combining 50 audio files.

For a typical 300 page book, you're looking at roughly 60 to 80 chunks. Sounds like a lot, but the actual generation is fast. Each chunk takes about 5 seconds to process. The real time investment is in the text preparation.

Step 3: Choose Your Voice (And Stick With It)

This decision matters more than you might think. You're going to be listening to this voice for hours. Pick one you actually enjoy.

Go to FreeTTS and test a few voices with a sample paragraph from your book. Listen to each one for at least 30 seconds. Some voices that sound great for a single sentence get annoying after 10 minutes. Others that seem unremarkable at first actually wear the best over long periods.

For audiobooks specifically, I recommend:

  • Fiction: Pick a voice with some warmth and expression. Jenny and Aria work well for female narration, Guy and Davis for male. The slight expressiveness in these voices helps with dialogue and emotional scenes.
  • Non-fiction: A clear, slightly formal voice works best. You want authority without boredom. Sara and Christopher are solid choices for this.
  • Textbooks and technical content: Go for maximum clarity. Neutral accent, standard speed. Jenny or Guy at 0.9x speed gives you clear, unhurried delivery.
  • Children's books: A slightly higher pitched, expressive voice with normal to slightly slower speed.

Whatever you choose, use the same voice for the entire book. Switching voices between chapters is disorienting. It's like having a different narrator take over every 20 pages. Unless you're doing a dialogue heavy book with intentional voice changes, keep it consistent.

Step 4: Generate Chapter by Chapter

Now the actual conversion. Open FreeTTS and start processing your chunks:

  1. Paste Chunk 1 into the text area
  2. Make sure your voice, speed, and pitch settings are correct
  3. Click Generate Speech
  4. Listen to a few seconds to verify it sounds right
  5. Click Download MP3
  6. Save with your naming convention (Chapter01_Part01.mp3, etc.)
  7. Repeat for every chunk

Each generation takes about 3 to 8 seconds depending on text length. For a full book of 70 chunks, you're looking at roughly 30 to 45 minutes of generation time. Not bad for a complete audiobook that would otherwise cost $15 to $30 on Audible.

Pro Tip: Don't try to marathon all 70 chunks in one sitting. Do a chapter or two at a time. This gives you a chance to listen back, catch any issues, and adjust your text if something sounds off. It's much easier to fix problems chapter by chapter than to discover them after processing the entire book.

Step 5: Combine Into a Complete Audiobook

You've got a folder full of MP3 files. Now it's time to stitch them together. This is where Audacity (free audio editor) comes in.

  1. Open Audacity
  2. Go to File > Import > Audio
  3. Select all the MP3 files for Chapter 1 (in order)
  4. They'll appear as separate tracks. Select the second track, then Edit > Copy. Click at the end of the first track, then Edit > Paste. Repeat until all parts are combined into one track.
  5. Add a 2 second silence between chapters (Generate > Silence > 2 seconds)
  6. Repeat for each chapter
  7. Once the whole book is assembled, use Effect > Normalize to even out the volume
  8. Export as MP3 (File > Export > Export as MP3)

For a simpler approach, you can also just export each chapter as its own MP3 file. Many audiobook listeners prefer this because it lets them easily navigate between chapters. Just like buying a real audiobook with separate tracks per chapter.

Alternative: Use a command line tool

If you're comfortable with the command line, you can combine MP3 files even faster using FFmpeg (free):

Create a text file listing all your MP3 files in order, then run a single command to concatenate them. Takes about 2 seconds regardless of how many files you have. But this is optional. Audacity works perfectly fine for most people.

Step 6: Add Chapter Markers (Optional)

If you exported your audiobook as a single MP3 file and want chapter navigation, you can add chapter markers using free tools. MP3 chapter marker tools let you place timestamps in the file so that podcast apps and audiobook players can jump between chapters.

But honestly, the easiest approach is just exporting each chapter as a separate file. Most audiobook players (including Apple Books, Google Play Books, and VLC) handle folders of MP3 files perfectly well.

How Good Does It Actually Sound?

Let me be honest here. A free TTS audiobook is not going to sound identical to a professionally narrated audiobook recorded in a studio with a voice actor who spent 40 hours doing takes and retakes.

But here's the reality check: it sounds way better than you'd expect. The neural voices used by FreeTTS handle natural pacing, appropriate pauses, question intonation, and emphasis surprisingly well. For non-fiction, technical content, and informational books, the quality is genuinely excellent. Most listeners won't think twice about it.

For fiction with heavy dialogue and emotional scenes, TTS handles it well enough to be enjoyable, though you might notice slightly less dramatic delivery than a human narrator would give. The voice won't sob during the sad parts or whisper during the tense scenes. It reads everything with a consistent, pleasant delivery. For some listeners, that's actually preferable to over acting.

What About Copyright?

Important topic. You can legally create an audiobook for personal use from any text you own or have access to. Reading a book you purchased out loud (even if the "reader" is a computer) is perfectly legal for personal use.

What you cannot do is create a TTS audiobook of a copyrighted work and distribute or sell it. That's the same as scanning a book and uploading the PDF. The text belongs to the author and publisher.

Where this gets straightforward:

  • Your own writing: You wrote it, you own it. Convert it to audio, distribute it, sell it. All good.
  • Public domain works: Books whose copyright has expired (generally published before 1928 in the US). Project Gutenberg has thousands of these. Convert away.
  • Creative Commons content: Text published under CC licenses that allow derivative works. Check the specific license terms.
  • Personal use: Converting a textbook you purchased into audio for your own studying. Legal for personal use.
  • Work documents: Your company's reports, manuals, training materials. As long as your company is okay with it.

Real World Use Cases

Here are some practical ways people are using this method right now:

  • Students: Converting textbook chapters to audio for studying during commutes. Instead of staring at dense paragraphs, they listen and re-listen until the material sticks.
  • Authors: Self published writers creating audiobook versions of their own books without paying $2,000 to $5,000 for professional narration. Some even sell the TTS versions at a lower price point alongside the human narrated version.
  • Researchers: Turning long academic papers into audio to absorb while walking, exercising, or doing other tasks. Research papers are notoriously difficult to read on screen for extended periods.
  • Language learners: Converting foreign language texts to audio for immersion practice. Hearing the correct pronunciation while following along with the text accelerates learning significantly.
  • Visually impaired users: Creating audio versions of documents, manuals, and books that don't have official audiobook releases. This isn't a convenience for these users. It's accessibility.

Final Thoughts

Making an audiobook from text used to require either a professional voice actor or accepting genuinely terrible robotic audio. Neither option was accessible to most people. The voice actors cost thousands. The robot voices made you want to never listen to anything ever again.

That middle ground didn't really exist until neural TTS got good. And now that it's good AND free, there's honestly no reason not to convert that stack of books, papers, and documents you've been meaning to read into audio you can listen to anywhere.

Will it replace professionally narrated audiobooks? No. Will it let you listen to literally any text as a decent quality audiobook for free? Absolutely. And for most people, that's more than enough.

Start Making Your Audiobook

400+ neural voices, free MP3 downloads, no signup. Paste your first chapter and go.

Open FreeTTS