Question 1

What does HIPAA Safe Harbor require?

Accepted Answer

HIPAA Safe Harbor specifies 18 identifiers that must be removed from health information to achieve de-identification: names, medical record numbers, dates (except year), addresses, and more. anonym.today detects and removes all Safe Harbor identifiers, with audit documentation for compliance verification.

Question 2

Can IRBs accept hashed/pseudonymized data?

Accepted Answer

Yes. IRBs accept de-identified data when anonymization is done according to recognized standards (HIPAA Safe Harbor, GDPR principles). Providing detailed anonymization reports and documenting your methods strengthens IRB approval. anonym.today generates these reports automatically.

Question 3

How do I maintain subject IDs for longitudinal studies?

Accepted Answer

Use deterministic pseudonymization (consistent hashing). The same original identifier always maps to the same pseudonym (e.g., 'subj_123' → 'SUBJ_00045'). This preserves within-subject relationships for repeated measures and follow-up analyses while removing direct identifiers.

Question 4

Is my anonymized data truly safe from re-identification?

Accepted Answer

No single technique guarantees absolute protection. Risk depends on direct identifiers removed, quasi-identifiers retained, and external data availability. anonym.today removes all direct identifiers and provides guidance on suppressing quasi-identifiers. For high-risk data, differential privacy and k-anonymity techniques provide additional protection against linkage attacks.

Question 5

Can I track consent withdrawals in anonymized data?

Accepted Answer

With consistent pseudonymization, yes. Store a separate mapping file (kept secure, separate from research data) linking original IDs to pseudonyms. When a participant withdraws consent, you can identify and remove their data using the pseudonym, then securely destroy the mapping. This preserves the utility of remaining data.

Question 6

What's the difference between anonymization and pseudonymization?

Accepted Answer

Anonymization makes it impossible to identify a person; pseudonymization replaces identifiers with consistent codes while maintaining relationships. For research, pseudonymization is usually preferred because it enables longitudinal analyses and withdrawal tracking. Both are acceptable for IRB approval if properly documented.

Question 7

How do I share anonymized data with collaborators?

Accepted Answer

Export your anonymized dataset from anonym.today along with the anonymization report detailing all methods used. Share the data directly (no mapping file needed for collaborators—they only need the de-identified data). This enables secondary research while maintaining participant privacy and avoiding data access coordination.

Method	Use Case	Preserves Relationships	Reversible
Hash (MD5, SHA-256)	Generate consistent pseudonyms		No
Encryption (AES)	Reversible masking
Suppression (Redaction)	Remove sensitive fields	N/A	No
Generalization	Reduce precision (ZIP code, age group)		No
Synthetic Data	Maximum privacy with AI-generated data		N/A

Dataset Anonymization forEthical Research

The Challenge of Research Data Privacy

IRB Compliance Burden

Re-identification Risk

Reproducibility Loss

Consistent, Reproducible Anonymization

Consistent Hashing

Pseudonymization

Detection & Review

Audit Trail

Research-Grade Anonymization Workflow

Import Data

Scan & Detect

Configure Rules

Apply & Review

Export & Report

Research-Specific Benefits

IRB Compliance

Reproducibility

Participant Protection

Data Utility

Ethical Data Sharing

Meta-Analysis Ready

Common Research Scenarios

Clinical Trial Data

Survey & Behavioral Research

Genomics & Biomarker Studies

Educational & Social Sciences

Research FAQs