TY - JOUR
T1 - Word vectorization using relations among words for neural network
AU - Hotta, Hajime
AU - Kittaka, Masanobu
AU - Hagiwara, Masafumi
PY - 2010
Y1 - 2010
N2 - In this paper, we propose a new vectorization method for a new generation of computational intelligence including neural networks and natural language processing. In recent years, various techniques of word vectorization have been proposed, many of which rely on the preparation of dictionaries. However, these techniques don't consider the symbol grounding problem for unknown types of data, which is one of the most fundamental issues on artificial intelligence. In order to avoid the symbol-grounding problem, pattern processing based methods, such as neural networks, are often used in various studies on self-directive systems and algorithms, and the merit of neural network is not exception in the natural language processing. The proposed method is a converter from one word input to one real-valued vector, whose algorithm is inspired by neural network architecture. he merits of the method are as follows: (1) the method requires no specific knowledge of linguistics e.g. word classes or grammatical one; (2) the method is a sequence learning technique and it can learn additional knowledge. The experiment showed the efficiency of word vectorization in terms of similarity measurement.
AB - In this paper, we propose a new vectorization method for a new generation of computational intelligence including neural networks and natural language processing. In recent years, various techniques of word vectorization have been proposed, many of which rely on the preparation of dictionaries. However, these techniques don't consider the symbol grounding problem for unknown types of data, which is one of the most fundamental issues on artificial intelligence. In order to avoid the symbol-grounding problem, pattern processing based methods, such as neural networks, are often used in various studies on self-directive systems and algorithms, and the merit of neural network is not exception in the natural language processing. The proposed method is a converter from one word input to one real-valued vector, whose algorithm is inspired by neural network architecture. he merits of the method are as follows: (1) the method requires no specific knowledge of linguistics e.g. word classes or grammatical one; (2) the method is a sequence learning technique and it can learn additional knowledge. The experiment showed the efficiency of word vectorization in terms of similarity measurement.
KW - Natural language processing
KW - Neural netowrk
KW - Self-organizing map
KW - Thesaurus
KW - Vectorization
UR - http://www.scopus.com/inward/record.url?scp=77956798167&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77956798167&partnerID=8YFLogxK
U2 - 10.1541/ieejeiss.130.75
DO - 10.1541/ieejeiss.130.75
M3 - Article
AN - SCOPUS:77956798167
SN - 0385-4221
VL - 130
SP - 75
EP - 82
JO - IEEJ Transactions on Electronics, Information and Systems
JF - IEEJ Transactions on Electronics, Information and Systems
IS - 1
ER -