TY - JOUR
T1 - Spatio-temporal pseudo relevance feedback for scientific data retrieval
AU - Takeuchi, Shin'ichi
AU - Sugiura, Komei
AU - Akahoshi, Yuhei
AU - Zettsu, Koji
PY - 2017/1/1
Y1 - 2017/1/1
N2 - We consider the problem of searching scientific data from vast heterogeneous scientific data repositories. This problem is challenging because scientific data contain relatively little text information compared to other search targets such as web pages. On the other hand, the metadata in scientific data contain other characteristic information such as spatio-temporal information. Although using this information make it possible to improve the search performance, many widely adopted scientific data search engines use this information exclusively for narrowing down search results. In this paper, we propose a novel query generation method using spatial, temporal, and text information based on pseudo relevance feedback. The proposed method generates new spatio-temporal queries from the initial search results. By using these queries, the search results are reranked such that more related results obtain higher rank. The experimental results show that the proposed method outperforms a baseline method when search targets do not have rich text information.
AB - We consider the problem of searching scientific data from vast heterogeneous scientific data repositories. This problem is challenging because scientific data contain relatively little text information compared to other search targets such as web pages. On the other hand, the metadata in scientific data contain other characteristic information such as spatio-temporal information. Although using this information make it possible to improve the search performance, many widely adopted scientific data search engines use this information exclusively for narrowing down search results. In this paper, we propose a novel query generation method using spatial, temporal, and text information based on pseudo relevance feedback. The proposed method generates new spatio-temporal queries from the initial search results. By using these queries, the search results are reranked such that more related results obtain higher rank. The experimental results show that the proposed method outperforms a baseline method when search targets do not have rich text information.
KW - Pseudo relevance feedback
KW - information retrieval
KW - query generation
KW - scientific data
KW - spatio-temporal and text information
UR - http://www.scopus.com/inward/record.url?scp=85003571674&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85003571674&partnerID=8YFLogxK
U2 - 10.1002/tee.22352
DO - 10.1002/tee.22352
M3 - Article
AN - SCOPUS:85003571674
VL - 12
SP - 124
EP - 131
JO - IEEJ Transactions on Electrical and Electronic Engineering
JF - IEEJ Transactions on Electrical and Electronic Engineering
SN - 1931-4973
IS - 1
ER -