6 min read
How to extract bookmarks from a PDF as an outline
By ScoutMyTool Editorial Team ยท Last updated: 2026-05-21
Introduction
I needed a table-of-contents page for a 120-page report that already had a perfectly good bookmark panel, and I started typing the section titles out by hand before it hit me: the structure I needed was already in the file โ I just had to extract it. Pulling a PDFโs bookmark outline out as text turns the clickable side panel into data you can reshape: a printed contents page, a document map, a reusable list, or the split points for breaking the file into sections. This guide covers extracting bookmarks as an outline โ the formats you can get, what to do when there are no bookmarks yet, the page-number gotcha, and how to turn the extracted structure into something useful.
What you can do with an extracted outline
| Use case | What you get | Why |
|---|---|---|
| Build a TOC page | Titles + page numbers | Turn the outline into a printed contents page |
| Document map / overview | Indented text outline | See the whole structure at a glance |
| Reuse in another doc | Plain or Markdown list | Seed a new document or a summary |
| Audit / QA structure | Full bookmark tree | Check nesting, naming, broken targets |
| Split by section | One file per bookmark | Break a doc along its outline |
| Feed an index / search | Structured titles + pages | Power a navigation or search feature |
Step by step โ extract a PDF outline
- Confirm the PDF has bookmarks. Open it and look for an outline panel. If there are none, create them first โ see bookmarking sections (auto-from-headings or from a pattern), then extract.
- Extract the outline. Run Extract Bookmarks to pull the titles, nesting, and target pages out as text.
- Choose a format that keeps hierarchy. An indented or Markdown outline preserves parent/child structure; a title+page list suits feeding another tool. A flat list loses the relationships, so prefer a structured form.
- Mind the page numbering. Extracted targets use physical page positions; if the document has unnumbered front matter, reconcile against the printed numbers before you publish a TOC.
- Turn it into a TOC if needed. Use the extracted titles and pages to build a contents page โ see adding a table of contents.
- Reuse or audit. Drop the outline into a new document, or review it for consistent naming and correct nesting across the whole file.
- Split along the outline if useful. Break the document into per-section files with Split by Bookmarks.
Related reading and tools
- Bookmark sections in a PDF: creating the outline in the first place.
- PDF bookmarks explained: how outlines work.
- Add a table of contents: turning an outline into a TOC page.
- PDF for IT teams: navigable runbooks and docs.
- Academic PDF workflows: structuring long documents.
- Extract Bookmarks tool: export the outline in your browser.
- All ScoutMyTool PDF tools: the full toolkit.
FAQ
- What exactly am I extracting โ bookmarks or a table of contents?
- Bookmarks (the PDF spec calls them "outlines") are the clickable navigation tree in the side panel of a reader, each entry pointing to a destination in the document. A table of contents is a printed page of titles and page numbers inside the document body. Extracting bookmarks pulls out that outline structure โ the titles, their nesting levels, and the pages they point to โ as text you can use elsewhere. From that extracted outline you can, among other things, generate a TOC page. So you are extracting the navigation structure; what you do with it (a contents page, a document map, a reusable list) is up to you.
- In what formats can I get the extracted outline?
- Typically as a plain text list, an indented outline that preserves the nesting (parent sections with children beneath), or a structured form like Markdown or a simple data list with title and page-number pairs. The indented or structured forms are the most useful because they keep the hierarchy, which is the whole value of the outline โ a flat list of titles loses the relationship between sections. Pick the format by destination: indented text for a human-readable map, Markdown to drop into docs, or a title+page list to feed a tool that builds a TOC or an index.
- Why would I extract an outline instead of just looking at it?
- Because text you can manipulate is far more useful than a panel you can only click. Extracting lets you generate a printed table-of-contents page from the existing bookmarks, reuse the structure to seed a new document or a summary, audit the outline for consistent naming and correct nesting across a long document, or feed the titles and pages into a search or index feature. It also lets you review the whole structure at once rather than expanding nodes one by one. In short, extraction turns navigation into data you can reshape, check, and reuse.
- What if the PDF has no bookmarks to extract?
- Then there is no outline to pull out โ but you can usually create one first. If the document has real heading styles, generate a bookmark outline automatically from those headings, then extract it. If it has section titles in a consistent pattern, generate bookmarks from the pattern. Once the bookmarks exist, extraction works normally. So "extract bookmarks" on a document with none becomes a two-step job: build the outline, then export it. A document with neither headings nor bookmarks has no structure to extract until you add some.
- Do the extracted page numbers match the printed page numbers?
- Extracted bookmark targets refer to the PDF's physical page positions (the first page of the file is 1), which can differ from the printed page numbers if the document has unnumbered front matter โ a cover and a contents page can offset things by a page or two. When you turn an extracted outline into a TOC, decide which numbering you want the reader to see and adjust accordingly. If your bookmarks were generated from the document's own structure, the relationship is usually consistent, but it is worth a quick check against a couple of known sections.
- Can I split the PDF using its bookmarks after extracting them?
- Yes, and it is a natural follow-on. Once you can see the outline, splitting the document into one file per top-level bookmark turns a big combined PDF into per-section files โ useful for distributing chapters separately or reorganising a document. The bookmarks define the split points, so a clean, well-structured outline produces a clean split. Extraction and splitting are complementary: extract to understand and reuse the structure, split to physically divide the document along it.
- Is it safe to extract an outline from a confidential PDF online?
- The outline itself is just structure, but the document may be sensitive, so prefer a tool that processes files locally. ScoutMyTool extracts bookmarks entirely in your browser tab, so the document never leaves your machine. For anything you would not publish openly, confirm the tool does not upload before using it.
Citations
- Wikipedia โ โPDFโ (ISO 32000), describing document outlines (bookmarks) and destinations. en.wikipedia.org/wiki/PDF
- Wikipedia โ โTable of contents,โ the structure an extracted outline can generate. en.wikipedia.org/wiki/Table_of_contents
- Wikipedia โ โPDF/UAโ (ISO 14289), accessibility expectations for navigable document structure. en.wikipedia.org/wiki/PDF/UA
Turn your PDF's structure into data
Extract a PDFโs bookmark outline as text with ScoutMyToolโs in-browser tool โ the document never leaves your machine, and the structure becomes yours to reuse.
Open Extract Bookmarks โ