utils.fuzzy_dedup_utils.id_mapping#

Module Contents#

Functions#

convert_str_id_to_int

Converts the legacy id format “dataset_name-0000034” type of ID into 2 int based ID’s

int_ids_to_str

Converts int id’s generated via convert_str_id_to_int back to a string ID

API#

utils.fuzzy_dedup_utils.id_mapping.convert_str_id_to_int(
df: pandas.DataFrame | utils.fuzzy_dedup_utils.id_mapping.cudf,
id_column: str = 'id',
) pandas.DataFrame | utils.fuzzy_dedup_utils.id_mapping.cudf#

Converts the legacy id format “dataset_name-0000034” type of ID into 2 int based ID’s

utils.fuzzy_dedup_utils.id_mapping.int_ids_to_str(
df: pandas.DataFrame | utils.fuzzy_dedup_utils.id_mapping.cudf,
id_column: str = 'id',
) pandas.DataFrame | utils.fuzzy_dedup_utils.id_mapping.cudf#

Converts int id’s generated via convert_str_id_to_int back to a string ID