On Approximately Searching for Similar Word Embeddings - Yahoo! JAPAN R&D

Publications

CONFERENCE (INTERNATIONAL) On Approximately Searching for Similar Word Embeddings

Kohei Sugawara, Hayato Kobayashi and Masajiro Iwasaki

the annual meeting of the Association for Computational Linguistics (ACL2016)

August 07, 2016

We discuss an approximate similarity search for word embeddings, which is an operation to approximately find embeddings close to a given vector. We compared several metric-based search algorithms with hash-, tree-, and graph- based indexing from different aspects. Our experimental results showed that a graph-based indexing exhibits robust performance and additionally provided useful information, e.g., vector normalization achieves an efficient search with cosine similarity.

Natural Language Processing
Information Retrieval
Machine Learning

Paper : On Approximately Searching for Similar Word Embeddings (external link)

PDF : On Approximately Searching for Similar Word Embeddings