TY - GEN
T1 - Joint learning of the embedding of words and entities for named entity disambiguation
AU - Yamada, Ikuya
AU - Shindo, Hiroyuki
AU - Takeda, Hideaki
AU - Takefuji, Yoshiyasu
N1 - Publisher Copyright:
© 2016 Association for Computational Linguistics.
PY - 2016
Y1 - 2016
N2 - Named Entity Disambiguation (NED) refers to the task of resolving multiple named entity mentions in a document to their correct references in a knowledge base (KB) (e.g., Wikipedia). In this paper, we propose a novel embedding method specifically designed for NED. The proposed method jointly maps words and entities into the same continuous vector space. We extend the skip-gram model by using two models. The KB graph model learns the relatedness of entities using the link structure of the KB, whereas the anchor context model aims to align vectors such that similar words and entities occur close to one another in the vector space by leveraging KB anchors and their context words. By combining contexts based on the proposed embedding with standard NED features, we achieved state-of-the-art accuracy of 93.1% on the standard CoNLL dataset and 85.2% on the TAC 2010 dataset.
AB - Named Entity Disambiguation (NED) refers to the task of resolving multiple named entity mentions in a document to their correct references in a knowledge base (KB) (e.g., Wikipedia). In this paper, we propose a novel embedding method specifically designed for NED. The proposed method jointly maps words and entities into the same continuous vector space. We extend the skip-gram model by using two models. The KB graph model learns the relatedness of entities using the link structure of the KB, whereas the anchor context model aims to align vectors such that similar words and entities occur close to one another in the vector space by leveraging KB anchors and their context words. By combining contexts based on the proposed embedding with standard NED features, we achieved state-of-the-art accuracy of 93.1% on the standard CoNLL dataset and 85.2% on the TAC 2010 dataset.
UR - http://www.scopus.com/inward/record.url?scp=85021631436&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85021631436&partnerID=8YFLogxK
U2 - 10.18653/v1/k16-1025
DO - 10.18653/v1/k16-1025
M3 - Conference contribution
AN - SCOPUS:85021631436
T3 - CoNLL 2016 - 20th SIGNLL Conference on Computational Natural Language Learning, Proceedings
SP - 250
EP - 259
BT - CoNLL 2016 - 20th SIGNLL Conference on Computational Natural Language Learning, Proceedings
PB - Association for Computational Linguistics (ACL)
T2 - 20th SIGNLL Conference on Computational Natural Language Learning, CoNLL 2016
Y2 - 11 August 2016 through 12 August 2016
ER -