How to scan books to PDF for personal use

Destructive vs non-destructive methods, taming spine curvature, splitting two-page spreads, and OCR to make it searchable — the full book-archival workflow, plus the copyright line.

7 min read

How to scan books to PDF for personal use

By ScoutMyTool Editorial Team · Last updated: 2026-05-21

I started scanning my own books after a water leak nearly took out a shelf of out-of-print favourites — I wanted personal digital backups I could search and read on a tablet. Scanning a book, I quickly learned, is not the same as scanning a stack of loose documents: the binding fights you, the pages curve into the spine, and an open book photographs as an awkward two-page spread. There is a real craft to doing it well. This guide covers the choices that matter — destructive versus non-destructive methods, handling the spine, splitting spreads, and OCR to make the result searchable — for building a personal library of books you own. One note up front: this is about your own books for your own use, not reproducing or sharing copyrighted works.

Choosing a method

MethodThe bookSpeedQuality notes
Phone + scanning appKept intactSlow, page by pageGood with care; watch curvature & lighting
Flatbed scannerKept intactSlow; press pages flatHigh, but gutter shadow near the spine
Overhead / book scannerKept intactFaster; designed for booksHigh; handles curved pages
Guillotine + sheet-feedDestroyed (spine cut off)Fastest by farExcellent, flat pages — but the book is gone

Step by step — scan a book to a searchable PDF

  1. Decide destructive vs non-destructive. Only cut the binding for cheap, replaceable books where speed matters; otherwise keep the book intact and scan with a flatbed, book scanner, or phone.
  2. Light evenly and capture flat. Press pages as flat as you can and light them evenly to avoid a shadow in the gutter; scan one page at a time to reduce curvature distortion.
  3. Split two-page spreads. If you captured open-book spreads, split each down the middle so the PDF is one page per page.
  4. Deskew, crop, and order. Straighten and crop each page to the content, and confirm the pages are in the right order and orientation.
  5. Run OCR. Add a searchable text layer so you can search and copy from the book; spot-check the recognition on a few pages.
  6. Compress and bookmark. Compress the image-heavy result so it opens fast, add chapter bookmarks for navigation, and keep a backup of your master scans.

The honest trade-offs

Two tensions run through book scanning. The first is book-versus-time: cutting the binding gives you the fastest, flattest scan but destroys the object, so it is right only for replaceable volumes and wrong for anything you value as a physical book. The second is speed-versus-quality on intact books: rushing produces curved, shadowed, crooked pages, while a careful single-page pass with even lighting and a cleanup step produces an archive you will actually enjoy reading. And underneath both sits the copyright line — scan what you own, for yourself, and do not distribute it. Get those judgements right and the payoff is real: a searchable, backed-up personal library that survives leaks, moves, and the slow disappearance of out-of-print titles, readable on any device you own.

Related reading

FAQ

Is it legal to scan a book to PDF?
For your own personal use, of a book you own, it is generally treated very differently from copying and sharing — but copyright still applies and the rules vary by country, so this is the one part worth being careful about. The widely accepted, low-risk case is format-shifting a book you legally own for your own private use: making a personal digital backup or a searchable copy for yourself. What crosses the line is distributing the scan — uploading it, sharing it, or making copies for other people — which is copyright infringement regardless of how you made the file. This article is about scanning your own books for your own library; it is not advice to reproduce or share copyrighted works. When in doubt about a specific situation, check your local copyright law or ask the rights holder.
What is the difference between destructive and non-destructive scanning?
Destructive scanning means cutting the binding off the book — typically with a guillotine or trimmer — so the pages become a loose stack you can run through a fast automatic sheet-feed scanner. It is by far the quickest and gives beautifully flat, consistent pages, but it physically destroys the book, so it is only for cheap or replaceable volumes you are willing to sacrifice. Non-destructive scanning keeps the book intact, using a flatbed, an overhead/book scanner, or a phone to capture pages one or two at a time. It is much slower and has to contend with the curve of the pages near the spine, but the book survives. The choice is simply whether the physical book or your time is more valuable for that particular title.
How do I deal with the curved pages and shadow near the spine?
The bend near the binding (the gutter) is the classic book-scanning problem, and there are three lines of defence. First, capture: press the page as flat as you reasonably can on a flatbed, or use an overhead/book scanner designed to photograph curved pages, and light the page evenly to avoid a dark shadow falling into the gutter. Second, technique: scanning one page at a time rather than a two-page spread keeps each page flatter and reduces distortion. Third, post-processing: deskew and crop each page afterwards, and some tools can flatten residual curvature. You will rarely get a cut-and-sheet-fed level of flatness from an intact book, but careful lighting plus single-page capture plus a cleanup pass gets you a thoroughly readable result.
Should I scan two-page spreads or single pages?
Capture however is fastest for your method, but finish as single pages. Photographing an open book gives you a two-page spread in one shot, which is quicker, but a PDF that is a series of double-page spreads is awkward to read on most screens and e-readers, and the two pages share the worst of the gutter curve. The clean approach is to split each spread down the middle into two single pages during processing, so the final PDF is one page per page, the way the book reads. If your method captures single pages natively (a flatbed page at a time), you skip this step. Either way, aim for a final document of single, upright, correctly-ordered pages.
How do I make the scanned book searchable?
Run OCR over it, because a raw scan is just images of pages with no actual text — you cannot search it, select from it, or have it read aloud until a text layer is added. Optical character recognition analyses the page images and builds an invisible, selectable text layer behind them, turning your stack of scans into a document you can search by content and copy from, while keeping the original page appearance. For a personal library this is the step that makes the archive genuinely useful rather than just a pile of pictures. Check the OCR output on a few pages, since recognition is not perfect on poor scans or unusual fonts, but even imperfect OCR transforms a scanned book from un-findable to searchable.
Is it safe to process my scans with an online tool?
Prefer tools that work on your own device for a personal library. Your scans may be of books tied to your identity or interests, and many online PDF tools upload files to a third-party server to OCR, compress, or combine them. Client-side (in-browser) tools do that work locally so your files never leave your computer — ScoutMyTool’s PDF tools work this way, which suits building a private archive. Confirm a tool is client-side before uploading, or use offline software, and keep your master scans backed up somewhere you control. A personal library is exactly the kind of collection you do not want sitting on someone else’s servers.

Citations

  1. Wikipedia — Book scanning (methods, destructive vs non-destructive)
  2. Wikipedia — Image scanner (flatbed and sheet-feed capture)
  3. Wikipedia — Optical character recognition (adding a searchable text layer)
  4. Wikipedia — PDF (the archival container)

Finish your scanned book — in your browser

Make your scans searchable, crop the pages, and compress the result with ScoutMyTool — free, no signup, and client-side so your personal library never leaves your computer.

Open the PDF tools →