Abstract
Inverted index is a core element of current text retrieval systems. They can be dynamically constructed using online indexing approaches in the environment which even a small delay in timeliness cannot be tolerated, and the index must always be queryable and up to date. Recently, efficient online index construction schemes have been proposed, however, previous works have not focused on scalability with the modern commodity hardware resources such as multi-core CPUs. In this paper, we propose a scalable online index construction method that better utilizes multicore CPUs. Using experiments on 30 GB of web data, we demonstrate the efficiency of our method in practice, showing that it dramatically reduces online index construction time without sacrificing query performance.
Original language | English |
---|---|
Pages (from-to) | 29-36 |
Number of pages | 8 |
Journal | Conferences in Research and Practice in Information Technology Series |
Volume | 104 |
Publication status | Published - 2010 Dec 1 |
Event | 21st Australasian Database Conference, ADC 2010 - Brisbane, QLD, Australia Duration: 2010 Jan 18 → 2010 Jan 21 |
Keywords
- Index maintenance
- Information retrieval
- Inverted index
- Online index construction
- Text databases
ASJC Scopus subject areas
- Computer Networks and Communications
- Computer Science Applications
- Hardware and Architecture
- Information Systems
- Software