Two stages refinement of query translation for pivot language approach to cross lingual information retrieval: A trial at CLEF 200

Kazuaki Kishida, Noriko Kando

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

This paper reports experimental results of cross-lingual information retrieval from German to Italian. The authors are concerned with CLIR in the case that available language resources are very limited. Thus transitive translation of queries using English as a pivot language was used to search Italian document collections for German queries without any direct bilingual dictionary or MT system of these two languages. In order to remove irrelevant translations produced by the transitive translation, we propose a disambiguation technique, in which two stages of refinement of query translation are executed. Basically, this refinement is based on the idea of pseudo rele-vance feedback. In the first stage, for each source query term, we select a translation candidate that appears most frequently in the set of top-ranked documents searched for a set of terms pro-vided via transitive translation of the source query. Next, in the second stage, a standard query expansion based on pseudo relevance feedback is conducted. Our experiment result showed that the two stages refinement method is able to improve significantly search performance of bilingual IR using a pivot language. However, it also turned out that performance of this method is inferior to that of machine translation method.

Original languageEnglish
Title of host publicationCLEF 2003 - Working Notes for CLEF 2003 Workshop, co-located with the 7th European Conference on Digital Libraries, ECDL 2003
PublisherCEUR-WS
Volume1169
Publication statusPublished - 2003
Externally publishedYes
Event2003 Cross Language Evaluation Forum Workshop, CLEF 2003, co-located with the 7th European Conference on Digital Libraries, ECDL 2003 - Trondheim, Norway
Duration: 2003 Aug 212003 Aug 22

Other

Other2003 Cross Language Evaluation Forum Workshop, CLEF 2003, co-located with the 7th European Conference on Digital Libraries, ECDL 2003
CountryNorway
CityTrondheim
Period03/8/2103/8/22

Fingerprint

Information retrieval
Feedback
Glossaries
Experiments

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Kishida, K., & Kando, N. (2003). Two stages refinement of query translation for pivot language approach to cross lingual information retrieval: A trial at CLEF 200. In CLEF 2003 - Working Notes for CLEF 2003 Workshop, co-located with the 7th European Conference on Digital Libraries, ECDL 2003 (Vol. 1169). CEUR-WS.

Two stages refinement of query translation for pivot language approach to cross lingual information retrieval : A trial at CLEF 200. / Kishida, Kazuaki; Kando, Noriko.

CLEF 2003 - Working Notes for CLEF 2003 Workshop, co-located with the 7th European Conference on Digital Libraries, ECDL 2003. Vol. 1169 CEUR-WS, 2003.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Kishida, K & Kando, N 2003, Two stages refinement of query translation for pivot language approach to cross lingual information retrieval: A trial at CLEF 200. in CLEF 2003 - Working Notes for CLEF 2003 Workshop, co-located with the 7th European Conference on Digital Libraries, ECDL 2003. vol. 1169, CEUR-WS, 2003 Cross Language Evaluation Forum Workshop, CLEF 2003, co-located with the 7th European Conference on Digital Libraries, ECDL 2003, Trondheim, Norway, 03/8/21.
Kishida K, Kando N. Two stages refinement of query translation for pivot language approach to cross lingual information retrieval: A trial at CLEF 200. In CLEF 2003 - Working Notes for CLEF 2003 Workshop, co-located with the 7th European Conference on Digital Libraries, ECDL 2003. Vol. 1169. CEUR-WS. 2003
Kishida, Kazuaki ; Kando, Noriko. / Two stages refinement of query translation for pivot language approach to cross lingual information retrieval : A trial at CLEF 200. CLEF 2003 - Working Notes for CLEF 2003 Workshop, co-located with the 7th European Conference on Digital Libraries, ECDL 2003. Vol. 1169 CEUR-WS, 2003.
@inproceedings{4a89ec7aec214632b041a8686b349e5f,
title = "Two stages refinement of query translation for pivot language approach to cross lingual information retrieval: A trial at CLEF 200",
abstract = "This paper reports experimental results of cross-lingual information retrieval from German to Italian. The authors are concerned with CLIR in the case that available language resources are very limited. Thus transitive translation of queries using English as a pivot language was used to search Italian document collections for German queries without any direct bilingual dictionary or MT system of these two languages. In order to remove irrelevant translations produced by the transitive translation, we propose a disambiguation technique, in which two stages of refinement of query translation are executed. Basically, this refinement is based on the idea of pseudo rele-vance feedback. In the first stage, for each source query term, we select a translation candidate that appears most frequently in the set of top-ranked documents searched for a set of terms pro-vided via transitive translation of the source query. Next, in the second stage, a standard query expansion based on pseudo relevance feedback is conducted. Our experiment result showed that the two stages refinement method is able to improve significantly search performance of bilingual IR using a pivot language. However, it also turned out that performance of this method is inferior to that of machine translation method.",
author = "Kazuaki Kishida and Noriko Kando",
year = "2003",
language = "English",
volume = "1169",
booktitle = "CLEF 2003 - Working Notes for CLEF 2003 Workshop, co-located with the 7th European Conference on Digital Libraries, ECDL 2003",
publisher = "CEUR-WS",

}

TY - GEN

T1 - Two stages refinement of query translation for pivot language approach to cross lingual information retrieval

T2 - A trial at CLEF 200

AU - Kishida, Kazuaki

AU - Kando, Noriko

PY - 2003

Y1 - 2003

N2 - This paper reports experimental results of cross-lingual information retrieval from German to Italian. The authors are concerned with CLIR in the case that available language resources are very limited. Thus transitive translation of queries using English as a pivot language was used to search Italian document collections for German queries without any direct bilingual dictionary or MT system of these two languages. In order to remove irrelevant translations produced by the transitive translation, we propose a disambiguation technique, in which two stages of refinement of query translation are executed. Basically, this refinement is based on the idea of pseudo rele-vance feedback. In the first stage, for each source query term, we select a translation candidate that appears most frequently in the set of top-ranked documents searched for a set of terms pro-vided via transitive translation of the source query. Next, in the second stage, a standard query expansion based on pseudo relevance feedback is conducted. Our experiment result showed that the two stages refinement method is able to improve significantly search performance of bilingual IR using a pivot language. However, it also turned out that performance of this method is inferior to that of machine translation method.

AB - This paper reports experimental results of cross-lingual information retrieval from German to Italian. The authors are concerned with CLIR in the case that available language resources are very limited. Thus transitive translation of queries using English as a pivot language was used to search Italian document collections for German queries without any direct bilingual dictionary or MT system of these two languages. In order to remove irrelevant translations produced by the transitive translation, we propose a disambiguation technique, in which two stages of refinement of query translation are executed. Basically, this refinement is based on the idea of pseudo rele-vance feedback. In the first stage, for each source query term, we select a translation candidate that appears most frequently in the set of top-ranked documents searched for a set of terms pro-vided via transitive translation of the source query. Next, in the second stage, a standard query expansion based on pseudo relevance feedback is conducted. Our experiment result showed that the two stages refinement method is able to improve significantly search performance of bilingual IR using a pivot language. However, it also turned out that performance of this method is inferior to that of machine translation method.

UR - http://www.scopus.com/inward/record.url?scp=84961323996&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84961323996&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:84961323996

VL - 1169

BT - CLEF 2003 - Working Notes for CLEF 2003 Workshop, co-located with the 7th European Conference on Digital Libraries, ECDL 2003

PB - CEUR-WS

ER -