Image estimation of words based on adjective Co-occurrences

Kouhei Shimizu, Masafumi Hagiwara

Research output: Contribution to journalArticlepeer-review

Abstract

In natural language, words convey various impressions such as "kurai (dark)-afajrai (bright)" or "kitanai (dirty)-utsukushii (beautiful)." Impressions play an important role in inferring the speaker's intentions or feelings. Systems that use natural language to support conceptualization or retrieve information by teaching the computer the impressions of words are growing significantly. Therefore, in this paper, the authors propose a technique for estimating the impressions or images that words convey. The proposed technique consists of two processes: similarity measurement and spatial positioning. Similarity measurement uses the enormous amount of text on the Internet to obtain similarities by measuring the co-occurrence frequency of words. Co-occurrence conditions, which are limited to the joint co-occurrences of adjectives or adjectival nouns, are applied to image estimation. Spatial positioning positions words in space by quantifying the similarities that were obtained according to images. The authors used a simulation of an electrical circuit for quantification. The results of evaluation experiments verified that the proposed system could estimate the images of words with high precision. It was also apparent that the proposed system could estimate the images with sufficient precision even for words that were difficult to analyze because of the insufficient number of samples in an existing corpus, including new words and colloquial words or slang.

Original languageEnglish
Pages (from-to)14-24
Number of pages11
JournalSystems and Computers in Japan
Volume38
Issue number12
DOIs
Publication statusPublished - 2007 Nov 15

Keywords

  • Co-occurrence
  • Corpus
  • Natural language processing
  • World wide Web

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Information Systems
  • Hardware and Architecture
  • Computational Theory and Mathematics

Fingerprint

Dive into the research topics of 'Image estimation of words based on adjective Co-occurrences'. Together they form a unique fingerprint.

Cite this