Entity linking is the process of matching concepts mentioned in natural language to their unique IDs and canonical forms stored
in a knowledge base. For example, an entity linking model might match the phrase bloodthinners mentioned in conversation
to the knowledge base concept UID45623 anticoagulant. Entity linking applications range from helping automate ingestion of
large amounts of data to assisting in real time concept normalization.
Within NeMo we use the entity linking approach described in Liu et. al’s NAACL 2021 “Self-alignment Pre-training for Biomedical Entity Representations” [nlp-entity_linking1].
The main idea behind this approach is to reshape an initial concept embedding space such that synonyms of the same concept are
pulled closer together and unrelated concepts are pushed further apart. The concept embeddings from this reshaped space can then
be used to build a knowledge base embedding index.