How to Remove Metadata from a PDF
PDF files silently store your name, software, GPS coordinates, and edit history. Here's what's inside — and how to strip it out before you share the file.
What is PDF metadata?
Every PDF contains two layers of information: the visible content you can read, and a hidden metadata layer stored in the document's header. Most people share PDFs without ever checking this layer.
Metadata is stored in two places inside a PDF file:
- DocInfo dictionary — author name, title, subject, keywords, and the application used to create the file, plus creation and last-modified timestamps.
- XMP packet — an extended XML block that can include GPS coordinates (when the source document contained mobile photos), camera settings, company and department names, and custom application-specific data.
What metadata does a PDF typically contain?
Open any PDF in Acrobat (File → Properties → Description) and you will commonly find:
- Author — usually your OS username or the name registered in your Office or Acrobat license
- Creator — the application that produced the source document (e.g. "Microsoft Word 16.85", "Adobe InDesign 2024")
- Producer — the PDF rendering engine (e.g. "macOS Version 14.4 Quartz PDFContext", "Adobe PDF Library 23.6")
- Created / Modified — exact timestamps, often down to the second and including timezone
- Title / Subject / Keywords — frequently auto-populated from the document's properties or filename
Some PDFs carry more sensitive data still:
- GPS coordinates via XMP when the original document contained photos shot on a smartphone
- Revision history in Microsoft Office–originated PDFs saved with "track changes" metadata intact
- Printer name and driver version from desktop print-to-PDF workflows
- Company and department fields populated automatically from Active Directory profiles
When does metadata become a privacy risk?
For everyday document sharing it is usually harmless. But in certain contexts it matters:
- Legal proceedings — opposing counsel can inspect metadata to establish authorship timelines or dispute document authenticity.
- Journalism and whistleblowing — metadata can re-identify the source of a leaked document even after the visible content has been carefully anonymized.
- Business proposals and negotiations — revealing edit history or the author's real name can undermine your position.
- Academic blind review — author identity embedded in the file can compromise the anonymity of peer-review submissions.
- GDPR and data minimization — sharing more personal data than necessary — including hidden metadata — may create compliance exposure.
Step-by-step: remove metadata from a PDF with PrivaPDF
Go to PrivaPDF's Secure PDF Converter. All processing runs locally in your browser — nothing is ever uploaded.
Drag your PDF onto the tool or click to browse. You can confirm nothing leaves your device by opening DevTools → Network tab before dropping the file — you will see zero outbound requests.
Run the conversion. The output PDF is rebuilt locally, which strips Author, Creator, Producer, and timestamp fields from the DocInfo dictionary and XMP packet.
Download the output PDF. Open File → Properties in Acrobat, Preview, or any PDF viewer to check that the metadata fields are now empty. On macOS you can also right-click the file → Get Info → More Info to see the embedded fields.
How to verify metadata was removed
The most reliable check is to open the cleaned PDF in a plain text editor (not a PDF viewer). XMP metadata appears as readable XML near the beginning of the file between <?xpacket> tags. If the block is absent or empty, the metadata has been removed. Alternatively, open File → Properties → Description in Acrobat — all fields should be blank.
Frequently asked questions
Does re-saving a PDF in Acrobat strip metadata?
Not automatically. Acrobat's standard Save and Save As preserve all existing metadata. To remove it in Acrobat, use Tools → Redact → Sanitize Document — but this requires Acrobat Pro (paid). Re-exporting through PrivaPDF's converter is a free alternative that achieves the same result.
Can I remove metadata without converting the PDF?
Yes, but it requires a tool that specifically targets the XMP and DocInfo blocks and zeroes them out. Simply re-exporting to a new PDF (as PrivaPDF does locally) is the most cross-platform approach and avoids any upload risk.
Does removing metadata also remove hidden text or redactions?
No. Metadata removal targets the document properties layer only. If a PDF contains "redacted" content that is actually just drawn over with a black rectangle, the underlying text layer still exists and is still searchable. Use a dedicated redaction tool to permanently remove the text layer for sensitive content.
Is there metadata I cannot remove?
Standard DocInfo and XMP fields can all be cleared. However, digital signatures and some watermarks embed identity data in the content layer itself rather than the metadata — those require a different approach (removing the signature or watermark entirely).
Will removing metadata break the PDF?
No. Metadata is purely informational and has no effect on the document's content, structure, or rendering. The cleaned file will look and behave identically to the original.