Supported Entity Types#
Review the available entity types that NeMo Safe Synthesizer can detect and redact in your data.
NeMo Safe Synthesizer uses machine learning models, regular expressions, and custom patterns to detect personally identifiable information (PII) and sensitive data.
Entity Names#
Use the entity names shown in the tables below when configuring PII detection and replacement for best results, as these are the standardized names used across all configuration files, API calls, and transformations. The system has been fine-tuned on the entity types shown in the table below, although the PII replacement component will attempt to classify any arbitrary entity type specified.
Core Entity Types#
The following entity types are available by default:
Personal Identifiers#
Entity Type |
Description |
Example Values |
---|---|---|
|
Full person names |
“John Smith”, “Maria García” |
|
Given names |
“John”, “Maria”, “Alice” |
|
Family names |
“Smith”, “García”, “Johnson” |
Contact Information#
Entity Type |
Description |
Example Values |
---|---|---|
|
Email addresses |
“john@company.com”, “user@domain.org” |
|
Phone numbers |
“555-123-4567”, “+1-800-555-0199” |
|
Fax numbers |
“555-123-4568” |
Address Information#
Entity Type |
Description |
Example Values |
---|---|---|
|
Complete addresses |
“123 Main St, Anytown, CA 90210” |
|
Street addresses |
“123 Main St”, “456 Oak Avenue” |
|
City names |
“New York”, “Los Angeles” |
|
County names |
“Harris”, “Maricopa”, “Orange” |
|
State/province names |
“California”, “NY” |
|
Postal/ZIP codes |
“90210”, “12345” |
|
Country names |
“United States”, “Canada” |
Government Identifiers#
Entity Type |
Description |
Example Values |
---|---|---|
|
Social Security Numbers |
“123-45-6789” |
|
National ID numbers |
“AB123456C” |
|
Tax identification numbers |
“12-3456789” |
|
Certificate/license numbers |
“LIC123456”, “CERT-789012” |
Financial Information#
Entity Type |
Description |
Example Values |
---|---|---|
|
Payment card numbers |
“4111-1111-1111-1111” |
|
Card verification values |
“123”, “4567” |
|
Personal identification numbers |
“1234”, “5678” |
|
Bank account numbers |
“1234567890” |
|
Bank routing numbers |
“123456789” |
|
SWIFT/BIC codes |
“CHASUS33”, “DEUTDEFF” |
|
International Bank Account Numbers |
“GB29 NWBK 6016 1331 9268 19” |
Technical Identifiers#
Entity Type |
Description |
Example Values |
---|---|---|
|
Web URLs |
“https://example.com” |
|
IPv4 addresses |
“192.168.1.1” |
|
IPv6 addresses |
“2001:db8::1” |
|
Hardware MAC addresses |
“00:1B:44:11:3A:B7” |
|
API keys and tokens |
“sk_test_123abc…” |
|
Usernames |
“jsmith”, “user123” |
|
Passwords |
“MyP@ssw0rd!”, “secret123” |
|
HTTP Cookies |
“sessionId=abc123” |
|
Device IDs |
“iPhone12,1”, “SM-G975F” |
Vehicle Identifiers#
Entity Type |
Description |
Example Values |
---|---|---|
|
VIN numbers |
“1HGCM82633A123456” |
|
License plates |
“ABC-1234”, “CA 1ABC123” |
Medical Information#
Entity Type |
Description |
Example Values |
---|---|---|
|
Medical record numbers |
“MRN123456”, “H123456789” |
|
Insurance IDs |
“INS-123456789”, “BCBS-987654321” |
|
Biometric data references |
“FP-123456”, “DNA-SAMPLE-789” |
Geographic and Temporal Information#
Entity Type |
Description |
Example Values |
---|---|---|
|
Latitude coordinates |
“37.7749”, “40.7128” |
|
Longitude coordinates |
“-122.4194”, “-74.0060” |
|
Coordinate pairs |
“(37.7749, -122.4194)” |
Other Identifiers#
Entity Type |
Description |
Example Values |
---|---|---|
|
Generic unique IDs |
“ID123456”, “UUID-abc-def” |
|
Customer identifiers |
“CUST001”, “C-123456” |
|
Employee identifiers |
“EMP001”, “E-789012” |
Quasi-Identifiers#
Quasi-identifiers are attributes that may not directly identify individuals but can be combined with other data for identification:
Entity Type |
Description |
Example Values |
---|---|---|
|
Date values |
“2023-01-15”, “01/15/2023” |
|
Date and time values |
“2023-01-15 14:30:00”, “01/15/2023 2:30 PM” |
|
Birth dates |
“1985-03-15”, “March 15, 1985” |
|
Time values |
“14:30:00”, “2:30 PM” |
|
Ages |
“18”, “72” |
|
Blood type information |
“A+”, “O-”, “AB+” |
|
Gender information |
“male”, “female”, “non-binary” |
|
Sexual orientation |
“heterosexual”, “gay”, “lesbian” |
|
Political affiliations |
“Democrat”, “Republican”, “Independent” |
|
Race & ethnicity information |
“Asian”, “Caucasian”, “Hispanic” |
|
Religious affiliations |
“Christian”, “Muslim”, “Jewish” |
|
Language preferences |
“English”, “Spanish”, “Mandarin” |
|
Education level |
“Bachelor’s Degree”, “High School”, “PhD” |
|
Professional titles |
“Software Engineer”, “Manager”, “Director” |
|
Employment information |
“Full-time”, “Part-time”, “Unemployed” |
|
Organization names |
“ACME Corp”, “Tech Solutions Inc” |