Learn

284 articlesCategory: All
Text and content

What is content correlation?

Anonymity is not determined only by the communication environment.

The text you post. The background visible in an image. The contents of a file. The way you choose topics. Personal experiences. Specialized fields.

This content itself also becomes material for inferring identity.

Content correlation means that published content connects with past information, real-name accounts, affiliations, places where someone regularly spends time, or the memories of people involved.

Content speaks about the person

People reveal their own experience through what they write.

The area where they live. The industry where they work. The school they attended. Trouble they experienced. Places they often visit. Topics they care about. Matters known only to insiders.

Even without writing a name, the content narrows the candidate set.

ContentWhat becomes visible
Personal experienceCandidates for the person or people involved
Specialized termsOccupation, industry, affiliated field
Regional topicsRegular activity area, travel range
Inside informationAffiliated organization or people involved
Image backgroundPlace, school, workplace
File contentsCreator, organization, work-related content

Content correlation is different from technical tracking. It happens in the form of "people who know can tell when they read it."

Personal experiences become strong clues

Personal experiences are persuasive. However, for anonymity, they are very strong clues.

Events known only to a small number of people are especially dangerous.

"Trouble that happened in the department last year" "A story known only to some people inside the school" "Something I saw at a specific event" "A timeline known only to people involved"

Even if outsiders do not understand these details, people involved do.

Even if an anonymous post does not spread widely, candidates can be narrowed if it reaches people involved.

When writing about personal experience, separate the facts readers need from the details that narrow down the person.

For a story that "harassment happened at work," the exact department name, meeting date, supervisor's verbal habits, number of participants, and internal system name are not always necessary. On the other hand, to communicate the structure of the problem, it may be necessary to show the industry or position to some degree.

DetailHow to handle it
Date and timeUse a period rather than a date
PlaceUse a broad category rather than a specific name
Number of peopleGeneralize it if the group is small
Job titleGeneralize it if it narrows candidates
ConversationAvoid wording unique to the person

For anonymity, reduce the level of detail needed for identification while keeping persuasiveness.

Look at image and file contents too

Content correlation is not only about text.

Image backgrounds show places and affiliations. PDFs and Office documents may retain internal terms, department names, case names, and comments in the body. Screenshots may show notifications, tab names, account names, and bookmarks.

Even if metadata is deleted, it is meaningless if the visible content remains.

TypeExampleCaution
MetadataCreator, date/time, location informationEasy to check with tools
Body contentInside information, personal experiences, proper nounsHumans infer by reading
Image contentBackgrounds, reflections, signs, uniformsNeeds to be checked by zooming in
ScreenshotNotifications, tabs, account namesEdge information is easy to miss

For anonymity, check both hidden file information and what is visibly shown.

It connects with past posts

Even if the current post does not contain a real name, anonymity weakens if it connects with past posts.

You wrote the same personal experience on an old blog. You used the same image on past social media. You posted under your real name about the same specialized field. You wrote the same argument in the same order.

These overlaps can be found through search or memory.

When thinking about anonymity, looking only at the current post is not enough. You need to check whether your past self has put out the same information.

Generalizing the wrong way does not help

When generalizing content, simply removing proper nouns is not enough.

Even if you remove a place name, writing about a regional event reveals the place. Even if you remove a company name, candidates narrow if the industry, role, number of people, and timing are detailed. Even if you remove a school name, the grade, event, club activity, and region can combine to reveal it.

Risky wordingSafer way to thinkReason
Remove only the company nameMake the industry, number of people, and timing rougher tooThe combination reveals the affiliation
Remove only the station nameReview the route and places you regularly spend time tooThe travel range remains
Hide only the nameGeneralize the relationship and role tooPeople involved can tell
Remove only the dateLook at surrounding events tooThe timeline narrows candidates

The purpose of generalizing is not to make the writing vague. It is to reduce the level of detail needed for identification.

What to check before posting

To avoid content correlation, check the following before posting.

  • Whether you wrote about events known only to you or people involved
  • Whether a region, workplace, school, or places you regularly spend time are visible
  • Whether you reused the same personal experience as in a past post
  • Whether an image background or reflection contains clues
  • Whether a screenshot contains notifications or account names
  • Whether a file body retains internal terms or comments
  • Whether the combined generalized information still allows identification

When checking, think about how it would look not to you, but to "someone who knows you."

For anonymity, people close to you may sometimes pose a greater identification risk than outside strangers.

Some information is visible only to people involved

Text that looks abstract to outside readers can look specific to people involved.

Even without writing a company name, if internal terms, unique system names, names for meetings, or project timing appear, people inside the company can tell. Even without writing a school name, an event name, the way a teacher is addressed, or club achievements may reveal it.

For content correlation, the standard is not "would a general public reader understand?" but "would someone who was there understand?"

Choose where to keep specificity

If you make everything vague for anonymity, the writing becomes hard to read. To help readers understand the problem, some specificity is necessary.

What matters is choosing where to place that specificity.

For example, if you want to communicate a problem with a system, write specifically about the mechanism rather than personal names or department names. If you want to communicate a regional danger, explain the broad region or situation rather than the nearest station.

What you want to communicateSpecificity to keepSpecificity to remove
Workplace problemMechanism, impact, conditions for recurrenceDepartment name, meeting date, comments from a small group
Regional problemStructure of transportation or environmentNearest station, shop near home
School problemSystem or response flowClass, homeroom teacher name, event date
Personal harmType of harm and responseDetails that identify the other person

Writing that protects anonymity is not simply thin writing. It keeps the information readers need and removes information used for identification.

Summary

Content correlation means that published content connects with past information, real-name accounts, affiliations, places where someone regularly spends time, or the memories of people involved.

The contents of text, images, screenshots, PDFs, Office documents, videos, and audio become clues.

Even if you remove names and place names, candidates narrow if personal experiences, specialized terms, timelines, backgrounds, or information known only to people involved remains.

For anonymity, you need to check not only the communication route and metadata, but also the content you publish. What matters is not deleting information, but reducing the level of detail needed for identification.

Related tools

OSINT directory

OSINT Framework

An external resource related to this article. Open it only when it fits your situation and threat model.

Why it is listed: It can help with the article topic, but it is outside Anonymity Sense and should be checked before use.

URL : https://osintframework.com/

Open external site
Reverse image search

Google Lens

An external resource related to this article. Open it only when it fits your situation and threat model.

Why it is listed: It can help with the article topic, but it is outside Anonymity Sense and should be checked before use.

URL : https://lens.google/

Open external site
Metadata inspection

ExifTool

An external resource related to this article. Open it only when it fits your situation and threat model.

Why it is listed: It can help with the article topic, but it is outside Anonymity Sense and should be checked before use.

URL : https://exiftool.org/

Open external site
Metadata removal

MAT2

An external resource related to this article. Open it only when it fits your situation and threat model.

Why it is listed: It can help with the article topic, but it is outside Anonymity Sense and should be checked before use.

URL : https://0xacab.org/jvoisin/mat2

Open external site
Audio and video

FFmpeg

An external resource related to this article. Open it only when it fits your situation and threat model.

Why it is listed: It can help with the article topic, but it is outside Anonymity Sense and should be checked before use.

URL : https://ffmpeg.org/

Open external site

Related articles