TY - GEN
T1 - Dynamic semantic retrieval space reconstruction for WWW environments
AU - Chen, Xing
AU - Kiyoki, Yasushi
PY - 2007/12/1
Y1 - 2007/12/1
N2 - Semantic retrieval for Web pages is an important issue in WWW environments. One of the typical methods for implementing semantic retrieval is to map Web pages onto an orthogonal space according to contents of Web pages. In this paper, we represent a space creation method for semantic retrieval for WWW environments. In this method, the orthogonal space, which is referred to as the retrieval space, is created by using sample Web pages. A sample Web page is selected from a Web page cluster as the typical Web page to stand all the Web pages in the cluster. As Web pages change rapidly every day, that is, new Web pages appear and some existing Web pages disappear, the sample Web pages have to be reselected dynamically according to the changes of Web pages. Following the changes of sample Web pages, the retrieval space has to be reconstructed dynamically. In general, the retrieval space reconstruction is a time consuming process because all the retrieval candidates in the space have to be re-mapped to the reconstructed retrieval space. In this paper, we present a mechanism for reducing the processing overhead of the retrieval space reconstruction. In our mechanism, we only reconstruct some parts of the retrieval space which are related to the modified sample Web pages. Furthermore, we present a fast search method to search Web pages which are needed to be re-mapped onto the retrieval space. By using these methods, our system is efficient for the semantic retrieval for Web pages in the WWW environment.
AB - Semantic retrieval for Web pages is an important issue in WWW environments. One of the typical methods for implementing semantic retrieval is to map Web pages onto an orthogonal space according to contents of Web pages. In this paper, we represent a space creation method for semantic retrieval for WWW environments. In this method, the orthogonal space, which is referred to as the retrieval space, is created by using sample Web pages. A sample Web page is selected from a Web page cluster as the typical Web page to stand all the Web pages in the cluster. As Web pages change rapidly every day, that is, new Web pages appear and some existing Web pages disappear, the sample Web pages have to be reselected dynamically according to the changes of Web pages. Following the changes of sample Web pages, the retrieval space has to be reconstructed dynamically. In general, the retrieval space reconstruction is a time consuming process because all the retrieval candidates in the space have to be re-mapped to the reconstructed retrieval space. In this paper, we present a mechanism for reducing the processing overhead of the retrieval space reconstruction. In our mechanism, we only reconstruct some parts of the retrieval space which are related to the modified sample Web pages. Furthermore, we present a fast search method to search Web pages which are needed to be re-mapped onto the retrieval space. By using these methods, our system is efficient for the semantic retrieval for Web pages in the WWW environment.
KW - Content analysis and indexing
KW - Data mining and database theory
KW - Semantic retrieval
KW - Vector space
UR - http://www.scopus.com/inward/record.url?scp=56149127139&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=56149127139&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:56149127139
SN - 9780889866560
T3 - Proceedings of the 3rd IASTED International Conference on Advances in Computer Science and Technology, ACST 2007
SP - 69
EP - 77
BT - Proceedings of the 3rd IASTED International Conference on Advances in Computer Science and Technology, ACST 2007
T2 - 3rd IASTED International Conference on Advances in Computer Science and Technology, ACST 2007
Y2 - 2 April 2007 through 4 April 2007
ER -