Learn

284 articlesCategory: All
Metadata

How to Recheck Metadata After Removal

Metadata can remain even when you think you have removed it.

In images, PDFs, Office documents, video, and audio files, the author name, capture time, GPS, edit history, application name, file name, and embedded information may be stored in multiple places.

It is dangerous to assume that a file is safe just because you pressed a "remove" button once.

For anonymity, what matters is not that you performed removal, but that you check, within the range you can verify after removal, whether anything remains.

This article organizes what you should recheck after removing metadata.

Why Rechecking Is Necessary

Metadata removal tools and apps are useful.

However, they cannot remove metadata from every format, every tag, and every kind of embedded information in the same way.

ReasonExplanation
Storage locations differ by formatJPEG, PDF, Office, and video files have different internal structures
Only some information is removedGPS may be removed while the creation application name remains
Resaving can add new informationThe editing software name or update time may be added
Visible information is not removedBackgrounds, reflections, text, and voices must be checked separately
Tool results are easy to overtrustYou may publish while thinking removal was complete

Rechecking is not cleanup after the removal work.

It is part of the removal work itself.

Compare Before and After Removal

When rechecking, compare the file before removal with the file after removal.

This makes it easier to judge what disappeared and what remained than looking only at the post-removal file.

StageWhat to look atReason
Before removalWhich metadata is presentUnderstand the risky items
Removal processWhich tool was used and what was doneRecord the processing details
After removalWhether the same items remainConfirm the removal result
Check with another toolCheck from another perspectiveReduce misses by a single tool
Visual reviewCheck backgrounds and textLook for clues other than metadata

For high-risk files, clearly separate the original file from the publication copy.

The basic rule is not to upload the original file as-is.

Items to Check When Rechecking

The items to check differ by format.

FormatItems to check
ImageGPS, capture time, device model, editing software, thumbnail
PDFAuthor, creation application, annotations, embedded files, update history
OfficeAuthor, organization name, comments, revision history, hidden sheets
AudioID3 tags, recording time, creation application, environmental sound
VideoCapture time, GPS, creation application, audio, background, text inside frames

Check the file name as well as the metadata.

A file name such as 田中_履歴書_final.pdf leaves personal information even if the metadata has been removed.

Rechecking Means Different Things by Format

Even under the word metadata, how information remains differs by file format.

In images, GPS and capture time are central.

In PDFs, the author, creation application, annotations, and embedded files become issues.

In Office documents, the author, company name, revision history, comments, and hidden sheets become issues.

In audio and video, voices and background sounds matter as well as tags.

FormatWhat to emphasize when recheckingReason
JPEG image, GPS, thumbnailCapture location and capture time tend to remain
PNG imageText chunks, creation applicationScreenshot and editing information may remain
PDFAuthor, annotations, embedded contentInformation not visible in appearance remains
Office documentComments, revision history, hidden elementsWork process and organization names may remain
Audio and videoTags, creation application, audio contentBoth metadata and content need to be reviewed

If you treat every format the same way with only one checking method, you increase the chance of missing something.

Recheck after learning what tends to remain in each format.

Why Check Locally

As a rule, avoid uploading files where anonymity matters to external sites for checking.

When you upload to an external site, the file contents and access logs are passed to that site.

For highly confidential files, prioritize local tools.

ExifTool is a representative tool that can check metadata locally.

URL : https://exiftool.org/

However, even if you use a local tool, its usefulness is limited if the device itself is not safe.

When working on a workplace device, school device, shared PC, or cloud-synced folder, also consider logs on the device and storage location.

Rechecking Flow

The practical flow can be organized as follows.

OrderTaskReason
1Store the original file and create a publication copyDo not mix the original and publication copy
2Check metadata in the publication copyUnderstand what remains
3Perform necessary removal or regenerationReduce risky information
4Recheck the same items after removalConfirm the processing result
5Review appearance from another perspectiveCheck backgrounds, reflections, text, and audio
6Check the file name and storage locationPrevent leakage from information outside the file contents

This flow may look tedious.

However, once a file is published, it may be copied, saved, and redistributed.

It is far easier to check before publishing than to notice after publication.

Risks That Remain After Rechecking

Rechecking does not remove every risk.

Even if metadata is gone, clues remain in the content itself.

Review writing style, document content, image backgrounds, audio voices, video environmental sound, and quotations or timelines inside PDFs separately.

Logs also remain in the route used to send a file.

Cloud sharing, email, social media DMs, upload forms, and whistleblowing tools each involve different records.

Whether to Keep a Record of Rechecking

In high-risk work, you may want to keep a personal record of what you checked.

However, that record itself can become a new risk.

RecordCaution
Screenshot of check resultsFile names, paths, or user names may appear
Work notesYou may write too much about real names, places, or circumstances
Cloud storageIt can connect to sync history or an account
Consultation by chatThe consultation partner and sending time are recorded
Paper notesThe storage location and disposal method become issues

Even when records are necessary, avoid placing them in an environment or personal cloud used with a real-name account.

For anonymity, also consider traces of the checking work itself.

Summary

When removing metadata, rechecking after removal is important.

Using a removal tool alone is not enough to judge that every piece of information has disappeared.

Images, PDFs, Office documents, audio, and video files store information in different places and forms.

You need to compare before and after removal, check from another perspective, and also inspect the file name and visible information.

For anonymity, "I think I removed it" is not treated as safe.

Create a publication copy, remove information, recheck it, and finally check the appearance and sending route.

Related tools

Archive check

Wayback Machine

An external resource related to this article. Open it only when it fits your situation and threat model.

Why it is listed: It can help with the article topic, but it is outside Anonymity Sense and should be checked before use.

URL : https://web.archive.org/

Open external site
Metadata inspection

ExifTool

An external resource related to this article. Open it only when it fits your situation and threat model.

Why it is listed: It can help with the article topic, but it is outside Anonymity Sense and should be checked before use.

URL : https://exiftool.org/

Open external site
Metadata removal

MAT2

An external resource related to this article. Open it only when it fits your situation and threat model.

Why it is listed: It can help with the article topic, but it is outside Anonymity Sense and should be checked before use.

URL : https://0xacab.org/jvoisin/mat2

Open external site

Related articles