How to Redact Sensitive Information from PDFs Properly
Black boxes aren't enough. Here's why most redaction methods fail — and how to actually protect your private data before sharing documents.

Look, I'm just going to say it: most people are terrible at redacting PDFs.
I've seen resumes sent with salaries "hidden" using a black highlighter (you can just select the text underneath). Legal documents with client names covered by a rectangle (right-click, delete annotation, boom — name revealed). Even government agencies have accidentally published documents where you could copy-paste the "redacted" social security numbers.
The problem? People think redaction is just about making text look invisible. But in a PDF, that text is still there — lurking in the file structure, waiting to be discovered.
Here's what you need to know to actually remove sensitive information, not just hide it.
Why Simple Methods Don't Work
Let's start with what doesn't count as redaction:
- Black highlighter or marker tools — These add a layer on top of the text. The text underneath is still selectable, searchable, and copy-able.
- Drawing shapes over text — Same problem. The shape is just an annotation floating above the original content.
- White text on white background — Come on. This one's embarrassing. Just select all and it shows up clear as day.
- Changing text color to match background — Again, selecting text reveals it immediately.
All of these methods leave the underlying data intact. And here's the thing — anyone with basic PDF knowledge (or even just curiosity) can reveal what you tried to hide in about 10 seconds.
In 2020, a major tech company accidentally published a legal filing where they'd "redacted" confidential financial data using black boxes. Journalists just opened the PDF in a text editor and found all the numbers sitting right there in plain text. Oops.
What Real Redaction Actually Means
True redaction means permanently removing the data from the PDF file. Not covering it up. Not hiding it. Deleting it from the file structure entirely.
Think of it like this: if you write something embarrassing in a Word doc and then delete it before saving, that text is gone (mostly — let's not get into document forensics). But if you just change the font color to white, the text is still there. Redaction is the "delete" version — the text needs to disappear from the actual file data.
Proper redaction tools will:
- Remove the text from the PDF content stream
- Replace it with a solid black box (or whatever color you choose)
- Make the change irreversible — once it's redacted, it's gone
- Optionally strip metadata, comments, and hidden data
This is why Adobe Acrobat Pro has a separate "Redact" tool that's different from the regular annotation tools. (Though even Adobe's redaction feature has had security issues in the past, so always test your output.)
The Right Way to Redact a PDF
Here's my recommended process:
Step 1: Make a copy
Never work on your only copy. Seriously. Redaction is permanent, and if you mess up, you'll want that original file.
Step 2: Use a proper redaction tool
You need software that actually removes data, not just covers it. Options include:
- Adobe Acrobat Pro (the paid version — Reader doesn't have redaction)
- Foxit PhantomPDF (has a dedicated redaction feature)
- PDF-XChange Editor (cheaper alternative with redaction)
- Open-source tools like pdftk or qpdf (command-line, but very reliable)
For quick browser-based work, you can use tools like KokoConvert's PDF tools to handle the rest of your workflow — merging, compressing, or converting files after redaction.
Step 3: Mark what needs to be redacted
Don't rush this. Go through the document carefully and mark every instance of sensitive data:
- Names, addresses, phone numbers
- Social security numbers, account numbers, IDs
- Email addresses (if applicable)
- Proprietary information, trade secrets
- Signatures (if required)
Use the "Search & Redact" feature if your tool has it — this lets you find every instance of a specific word or pattern (like SSN format) and redact them all at once. Way faster than hunting manually.
Step 4: Apply the redactions
In most tools, marking areas for redaction is a two-step process: first you mark them (they'll show up as colored boxes), then you apply the redactions. This gives you a chance to review before making it permanent.
Once you click "Apply Redactions," the text is gone forever from that file. Double-check your markings first.
Step 5: Remove hidden data
This is the part most people forget. PDFs can contain:
- Metadata (author name, company, revision history)
- Comments and annotations from other reviewers
- Hidden layers or overlapping text
- Embedded files or attachments
- Form field data
Adobe Acrobat has a "Sanitize Document" feature specifically for this. In other tools, look for "Remove Hidden Information" or "Clean Document" options. Run this after redacting visible text.
Step 6: Test the redacted PDF
Before you send that file out into the world, test it:
- Try to select text in the redacted areas (you shouldn't be able to)
- Use Ctrl+F (or Cmd+F) to search for terms you redacted — they shouldn't be found
- Check the file properties for metadata
- Open the PDF in a different viewer (not just the one you used to redact) to make sure it looks right
If you're really paranoid (or dealing with truly sensitive data), you can even open the PDF in a text editor and search for the redacted terms. They absolutely should not appear anywhere in the raw file data.
The Nuclear Option: Flatten Everything
When you absolutely, positively need to make sure nothing is recoverable, there's one more trick: print to PDF.
After you've redacted everything properly, print the PDF to a new PDF file. This essentially creates a "screenshot" of each page and builds a new document from scratch.
Why does this work? Because the new file only contains the visual representation of the pages — no underlying text, no metadata from the original, no hidden layers. It's like taking a photo of the document.
The downside: the new PDF won't have selectable text anymore (unless you run OCR on it), and the file size might increase. But if you're dealing with classified information or legal documents where even a tiny leak could be catastrophic, this method is bulletproof.
You can use PDF compression tools afterward to reduce the file size if needed.
Special Cases and Gotchas
Scanned documents
If your PDF is a scan (basically just images of pages), redacting is actually simpler in one way — there's no underlying text to worry about. But you need to make sure you're truly covering the pixels, not just adding an annotation. Again, flattening the PDF after redaction is your friend here.
OCR'd scans
Here's where it gets tricky. Some scanned PDFs have been run through OCR (Optical Character Recognition), which creates a hidden text layer for searching. You might redact the visible image but forget about the hidden text layer underneath. Always use tools that handle both layers, or flatten the PDF after redacting.
Partially redacted sentences
When you redact part of a sentence, be aware that context clues can sometimes reveal what was hidden. "The suspect, ████████, was arrested at 3 PM" might be obvious if there's only one person mentioned earlier. This is more of an editorial issue than a technical one, but worth considering.
Images within PDFs
If sensitive info appears in an embedded image (like a screenshot showing an email address), you need to redact the image itself, not just cover it. Some tools let you redact image content; others require you to edit the image separately first.
Don't Forget About Backups
One more thing: after you've carefully redacted a PDF and sent it off, make sure the unredacted version isn't sitting in obvious places like:
- Your email "Sent" folder with the original attached
- Cloud storage with file versioning turned on (Google Drive keeps old versions)
- Your desktop or Downloads folder
- Backup drives or services that auto-sync
The whole point of redaction is controlling who sees what. If the original file is casually accessible in five different places, you haven't really accomplished much.
When to Use Password Protection Instead
Sometimes redaction isn't what you need — you just need to control access. If you want some people to see the full document and others not to see it at all, password protection is the way to go.
But here's the key difference: password-protected PDFs still contain all the data. If someone cracks the password (or if you accidentally send them the password), they see everything. Redaction is for when you want to share a version of the document where certain data is simply gone.
Think of it this way: redaction is a Sharpie on paper. Password protection is a locked filing cabinet. Both have their uses, but they're not interchangeable.
The Bottom Line
Redacting PDFs properly isn't hard, but it does require using the right tools and following the right process. Don't rely on visual tricks or annotation layers. Use actual redaction features that remove data permanently, sanitize hidden content, and always test your output before sharing.
And if you're working with truly sensitive data — financial records, legal filings, medical documents, classified information — seriously consider the "print to PDF" flattening method as a final safeguard. A few extra megabytes of file size is worth sleeping soundly knowing that your redactions actually stick.
Because at the end of the day, the only thing worse than not redacting is thinking you redacted when you didn't.