6 min read
PDF metadata best practices — title, author, keywords
By ScoutMyTool Editorial Team · Last updated: 2026-05-20
PDF metadata is the cheapest SEO win available — set six fields once during export and your document shows up in search results with a real title and description instead of "Microsoft Word — Document1". Most authors set metadata accidentally (whatever the export tool defaults to) and the cost shows up in lower click-through, weaker DMS indexing, and PDFs that look unprofessional everywhere they surface. This article maps the six fields that matter, what each is for, and the workflow for setting them correctly on both new and existing PDFs.
Six metadata fields and what each is for
| Field | Purpose | Best practice |
|---|---|---|
| Title | Displayed in SERP titles, browser tabs, DMS indexes | Match H1 of document; under 60 chars; include primary keyword |
| Author | Displayed in document properties; SERP "by" attribution | Real organisation or individual name; never "Microsoft Word" / "Adobe InDesign" |
| Subject | Used as SERP description when no on-page summary detected | 120–160 chars; one-paragraph summary; primary keyword once naturally |
| Keywords | Indexed by Google (lightly); used by some DMS for tagging | 3–7 phrases comma-separated; relevant to body content; no keyword stuffing |
| Producer | Identifies the software used to create the PDF | Usually auto-set; leave alone unless removing for privacy |
| Creation date | Used by Google for "freshness" signals; DMS sorting | Set explicitly on re-publish so Google sees a fresh date |
Step by step — set metadata correctly
- Set in the source document before export. Word: File → Info → Properties. Docs: File → Document properties. Set Title, Author, Subject, Keywords.
- Export to PDF with "Document properties" / "Document metadata" preservation enabled. Most exporters preserve by default.
- Verify in the exported PDF. Acrobat: File → Properties. Confirm each field reads what you intended.
- If wrong, edit without re-exporting using PDF Metadata Editor — change the fields, save, no re-export needed.
- Publish and submit to Google Search Console for indexing. Confirm the SERP title and description match what you set after first re-crawl.
XMP metadata — the deeper layer beneath the basic fields
Beyond the six basic fields (Title, Author, Subject, Keywords, Producer, Creation date) PDFs can carry XMP (eXtensible Metadata Platform) data — a richer XML-based metadata stream that includes copyright, licensing, language, version, geolocation, and arbitrary custom fields. Adobe products write XMP by default; most other authoring tools do not unless explicitly enabled. XMP is what tools like Adobe Bridge, Lightroom, and DAM (digital asset management) systems use to organise large file libraries. For solo and small-team use, the basic six fields are usually enough; for organisations managing thousands of PDF assets, XMP is worth setting up.
ScoutMyTool PDF Metadata Editor reads and writes both the basic Document Information Dictionary fields and XMP. For most workflows you only edit the basic fields; XMP is exposed for power users. The two layers should agree (Title in both sets to the same string) — most tools sync them automatically, but if you edit one and not the other, you can end up with a PDF reporting different titles in different contexts.
Removing metadata for privacy
Sometimes the right action is to strip metadata rather than set it. PDFs created by anonymous authors, redacted court filings, leaked-document analyses, and sanitised distribution copies should not carry author names, creation tools, or timestamps. Run ScoutMyTool PDF Metadata Editor → "Strip all" or use `exiftool -all= file.pdf` (command line) to remove every metadata field. The stripped PDF carries no identifying breadcrumbs for forensic analysis. For high-stakes privacy contexts, also flatten the PDF and re-export through a clean tool to remove any residual XMP that the strip might miss.
Note that "Producer" metadata (the tool that created the PDF) is often the forensic giveaway. A PDF labelled "Microsoft Word 365" tells a recipient you authored on Word; "Adobe InDesign 2025" reveals you used Adobe. For anonymous distribution, either strip Producer or set it to a generic value like "PDF". Most recipients never look at Producer, but in adversarial contexts (whistleblowing, anonymous leaks, redacted public-records releases) Producer can be the breadcrumb that re-identifies the source.
Related reading
- PDF best practices for SEO: the broader on-PDF SEO checklist.
- PDF metadata editor: tool for editing metadata on existing PDFs.
- Searchable PDF: text-layer is what makes metadata indexing actually useful.
- PDF naming conventions: filenames and metadata should align.
- Make a PDF look professional: metadata is part of the polish.
FAQ
- What is the single most important PDF metadata field?
- The Title field. Google displays it as the search result title; PDF viewers show it in the title bar and tab; document-management systems index it as the document name; bookmarks and reading lists use it as the label. A PDF with Title="Document1.pdf" looks unprofessional everywhere it surfaces. Set the Title explicitly to a meaningful sentence matching the document's H1 — every authoring tool exposes this in its File → Properties → Title field, and ScoutMyTool PDF Metadata Editor lets you set it on an existing PDF without re-exporting.
- How do I see what metadata a PDF currently has?
- Three ways. Acrobat Reader: File → Properties → Description tab. Apple Preview: Tools → Show Inspector → General tab. Command line: `pdfinfo file.pdf` (from poppler-utils, free). All show Title, Author, Subject, Keywords, Producer, Creator, Creation Date, Modified Date. Cross-check against the file's intended purpose — defaults like "Microsoft Word - Document1" signal an unprofessional PDF and likely hurt SEO indexing.
- Does Google really weight PDF metadata for ranking?
- Yes, but selectively. Title is heavily weighted — it appears as the SERP title and likely influences click-through-rate-driven re-ranking. Subject (used as SERP description) influences click-through. Author appears in some "by" attributions. Keywords are weakly weighted today (the field was abused in the late 1990s and Google heavily discounts it now); set them correctly anyway for DMS indexing. Creation date and modification date feed Google's freshness signals — keep them honest (set on re-publish, not artificially bumped on every minor edit).
- Can I edit PDF metadata without re-exporting from the source?
- Yes. Acrobat Pro: File → Properties → Description → edit. ScoutMyTool PDF Metadata Editor: drop the PDF, edit fields, save. Command line: `exiftool -Title="My Title" file.pdf`. All three modify the PDF's Document Information Dictionary (per ISO 32000-1) without changing page content. Useful when you receive a PDF with bad metadata and cannot re-export from source.
- What should I put in the Keywords field?
- Three to seven relevant phrases, comma-separated, matching the content of the document. For a mortgage-calculator guide: "mortgage calculator, loan calculator, APR, amortization, mortgage payment". Avoid keyword stuffing (15+ terms, repetitive variants) — Google discounts it and DMS noise hurts more than it helps. Match phrases to what users would actually search for; do not invent terms. For internal DMS use, keywords double as a tagging system — coordinate with your team on a controlled vocabulary if multiple people contribute documents.
Citations
- ISO 32000-1:2008 — "Document management — Portable document format" — Document Information Dictionary specification.
- Adobe XMP Specification — extensible metadata platform for PDF and other formats.
- Google Search Central — PDF indexing documentation.
- ExifTool documentation — open-source metadata editor for PDF and many other formats.
Edit PDF metadata in your browser
ScoutMyTool PDF Metadata Editor runs client-side. Set the fields, save, and re-publish — no upload, no account.
Open Metadata Editor →