A term-based cross-cultural computing system for multilingual analysis with phonological-semantic vector spaces

Totok Suhardijanto, Ali Ridho Barakbah, Yasushi Kiyoki

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

This paper proposes a cross-cultural computing system that deals with multilingual analysis. This system focuses on a cultural aspect comparison that is based on linguistic basic elements. The most important task of our system is to realize a cross-cultural computation in the framework of correlation computation by using vectorized numeric data that express cultural aspects in some concepts and objects with regard to speech sounds. The key technology of the system is a cross-cultural semantic distance computation in phonological-semantic metadata spaces that involve the phonological aspects of sound, syllabic and lexical composition features. The phonological-semantic metadata of multiple languages is extracted based on two main aspects of language: form and meaning. Form refers to speech sound, and meaning refers to the semantic of language. We compare language units (or terms) with the same meaning from different cultures, focusing on the speech sound characteristics of the terms. The speech sound metadata are extracted from a term and separated based on the phonological aspects of sound, syllabic and lexical composition features. These metadata are converted into vectorized numeric data to create phonological-semantic vector spaces. By using these spaces, we conducted similarity and weighting computations to perform a comparative analysis of language-related metadata. Our research goal is to perform a language similarity analysis through a term-based distance calculation in phone (sound) and meaning spaces, and to reconstruct an inheritance relationship among languages via agglomerative hierarchical clustering based on an inter-term distance calculation. Our system clusters the phonological-semantic vector space and represents a 2D visualization of cultural differentiation to analyze further the interconnectedness across languages. In this paper, we perform our proposed cross-cultural computing system for an experimental purpose with linguistic data from 32 different Asian-Oceanic languages.

Original languageEnglish
Pages (from-to)20-38
Number of pages19
JournalFrontiers in Artificial Intelligence and Applications
Volume237
DOIs
Publication statusPublished - 2012

Fingerprint

Vector spaces
Semantics
Metadata
Acoustic waves
Linguistics
Chemical analysis
Visualization

Keywords

  • multilingual analysis
  • phonological-semantic vector space
  • term-based cross-cultural computing system

ASJC Scopus subject areas

  • Artificial Intelligence

Cite this

A term-based cross-cultural computing system for multilingual analysis with phonological-semantic vector spaces. / Suhardijanto, Totok; Barakbah, Ali Ridho; Kiyoki, Yasushi.

In: Frontiers in Artificial Intelligence and Applications, Vol. 237, 2012, p. 20-38.

Research output: Contribution to journalArticle

@article{0ca823fe39624cbd887e6d5ef60f3207,
title = "A term-based cross-cultural computing system for multilingual analysis with phonological-semantic vector spaces",
abstract = "This paper proposes a cross-cultural computing system that deals with multilingual analysis. This system focuses on a cultural aspect comparison that is based on linguistic basic elements. The most important task of our system is to realize a cross-cultural computation in the framework of correlation computation by using vectorized numeric data that express cultural aspects in some concepts and objects with regard to speech sounds. The key technology of the system is a cross-cultural semantic distance computation in phonological-semantic metadata spaces that involve the phonological aspects of sound, syllabic and lexical composition features. The phonological-semantic metadata of multiple languages is extracted based on two main aspects of language: form and meaning. Form refers to speech sound, and meaning refers to the semantic of language. We compare language units (or terms) with the same meaning from different cultures, focusing on the speech sound characteristics of the terms. The speech sound metadata are extracted from a term and separated based on the phonological aspects of sound, syllabic and lexical composition features. These metadata are converted into vectorized numeric data to create phonological-semantic vector spaces. By using these spaces, we conducted similarity and weighting computations to perform a comparative analysis of language-related metadata. Our research goal is to perform a language similarity analysis through a term-based distance calculation in phone (sound) and meaning spaces, and to reconstruct an inheritance relationship among languages via agglomerative hierarchical clustering based on an inter-term distance calculation. Our system clusters the phonological-semantic vector space and represents a 2D visualization of cultural differentiation to analyze further the interconnectedness across languages. In this paper, we perform our proposed cross-cultural computing system for an experimental purpose with linguistic data from 32 different Asian-Oceanic languages.",
keywords = "multilingual analysis, phonological-semantic vector space, term-based cross-cultural computing system",
author = "Totok Suhardijanto and Barakbah, {Ali Ridho} and Yasushi Kiyoki",
year = "2012",
doi = "10.3233/978-1-60750-992-9-20",
language = "English",
volume = "237",
pages = "20--38",
journal = "Frontiers in Artificial Intelligence and Applications",
issn = "0922-6389",
publisher = "IOS Press",

}

TY - JOUR

T1 - A term-based cross-cultural computing system for multilingual analysis with phonological-semantic vector spaces

AU - Suhardijanto, Totok

AU - Barakbah, Ali Ridho

AU - Kiyoki, Yasushi

PY - 2012

Y1 - 2012

N2 - This paper proposes a cross-cultural computing system that deals with multilingual analysis. This system focuses on a cultural aspect comparison that is based on linguistic basic elements. The most important task of our system is to realize a cross-cultural computation in the framework of correlation computation by using vectorized numeric data that express cultural aspects in some concepts and objects with regard to speech sounds. The key technology of the system is a cross-cultural semantic distance computation in phonological-semantic metadata spaces that involve the phonological aspects of sound, syllabic and lexical composition features. The phonological-semantic metadata of multiple languages is extracted based on two main aspects of language: form and meaning. Form refers to speech sound, and meaning refers to the semantic of language. We compare language units (or terms) with the same meaning from different cultures, focusing on the speech sound characteristics of the terms. The speech sound metadata are extracted from a term and separated based on the phonological aspects of sound, syllabic and lexical composition features. These metadata are converted into vectorized numeric data to create phonological-semantic vector spaces. By using these spaces, we conducted similarity and weighting computations to perform a comparative analysis of language-related metadata. Our research goal is to perform a language similarity analysis through a term-based distance calculation in phone (sound) and meaning spaces, and to reconstruct an inheritance relationship among languages via agglomerative hierarchical clustering based on an inter-term distance calculation. Our system clusters the phonological-semantic vector space and represents a 2D visualization of cultural differentiation to analyze further the interconnectedness across languages. In this paper, we perform our proposed cross-cultural computing system for an experimental purpose with linguistic data from 32 different Asian-Oceanic languages.

AB - This paper proposes a cross-cultural computing system that deals with multilingual analysis. This system focuses on a cultural aspect comparison that is based on linguistic basic elements. The most important task of our system is to realize a cross-cultural computation in the framework of correlation computation by using vectorized numeric data that express cultural aspects in some concepts and objects with regard to speech sounds. The key technology of the system is a cross-cultural semantic distance computation in phonological-semantic metadata spaces that involve the phonological aspects of sound, syllabic and lexical composition features. The phonological-semantic metadata of multiple languages is extracted based on two main aspects of language: form and meaning. Form refers to speech sound, and meaning refers to the semantic of language. We compare language units (or terms) with the same meaning from different cultures, focusing on the speech sound characteristics of the terms. The speech sound metadata are extracted from a term and separated based on the phonological aspects of sound, syllabic and lexical composition features. These metadata are converted into vectorized numeric data to create phonological-semantic vector spaces. By using these spaces, we conducted similarity and weighting computations to perform a comparative analysis of language-related metadata. Our research goal is to perform a language similarity analysis through a term-based distance calculation in phone (sound) and meaning spaces, and to reconstruct an inheritance relationship among languages via agglomerative hierarchical clustering based on an inter-term distance calculation. Our system clusters the phonological-semantic vector space and represents a 2D visualization of cultural differentiation to analyze further the interconnectedness across languages. In this paper, we perform our proposed cross-cultural computing system for an experimental purpose with linguistic data from 32 different Asian-Oceanic languages.

KW - multilingual analysis

KW - phonological-semantic vector space

KW - term-based cross-cultural computing system

UR - http://www.scopus.com/inward/record.url?scp=84869663405&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84869663405&partnerID=8YFLogxK

U2 - 10.3233/978-1-60750-992-9-20

DO - 10.3233/978-1-60750-992-9-20

M3 - Article

AN - SCOPUS:84869663405

VL - 237

SP - 20

EP - 38

JO - Frontiers in Artificial Intelligence and Applications

JF - Frontiers in Artificial Intelligence and Applications

SN - 0922-6389

ER -