TY - GEN
T1 - Node-Centric Random Walk for Fast Index-Free Personalized PageRank
AU - Tsuchida, Kohei
AU - Matsumoto, Naoki
AU - Kaneko, Kunitake
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Personalized PageRank (PPR) is a popular graph computation in various real-world applications. Since massive real-world graphs are evolving rapidly, PPR computation methods require index-free and fast. In general, index-free methods go through Forward Push phase and random walk Monte-Carlo simulation phase respectively. While existing methods have succeeded in accelerating the Forward Push phase, there is a space for running-time improvements in the second phase that performs a large number of sequential random walks. Through this sequential process, each random walk needs to obtain neighbor nodes for every single step, which causes redundant operation at each node as a result. Our proposal is a node-centric random walk that aggregates random walks at each node and minimizes the total number of obtaining neighbor nodes in the second phase. Most of the random walks can be aggregated while maintaining theoretical guarantees because they do not need to memorize the starting node. In addition, we review the expected running time of random walk Monte-Carlo simulation focusing on the total number of obtaining neighbor nodes. We conducted extensive experiments using four real-world graphs. Experimental results showed that our proposed method is up to 3.3x faster than the existing methods.
AB - Personalized PageRank (PPR) is a popular graph computation in various real-world applications. Since massive real-world graphs are evolving rapidly, PPR computation methods require index-free and fast. In general, index-free methods go through Forward Push phase and random walk Monte-Carlo simulation phase respectively. While existing methods have succeeded in accelerating the Forward Push phase, there is a space for running-time improvements in the second phase that performs a large number of sequential random walks. Through this sequential process, each random walk needs to obtain neighbor nodes for every single step, which causes redundant operation at each node as a result. Our proposal is a node-centric random walk that aggregates random walks at each node and minimizes the total number of obtaining neighbor nodes in the second phase. Most of the random walks can be aggregated while maintaining theoretical guarantees because they do not need to memorize the starting node. In addition, we review the expected running time of random walk Monte-Carlo simulation focusing on the total number of obtaining neighbor nodes. We conducted extensive experiments using four real-world graphs. Experimental results showed that our proposed method is up to 3.3x faster than the existing methods.
KW - Index-Free
KW - Personalized PageRank
KW - Random Walk
UR - http://www.scopus.com/inward/record.url?scp=85147653298&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85147653298&partnerID=8YFLogxK
U2 - 10.1109/ICOIN56518.2023.10049056
DO - 10.1109/ICOIN56518.2023.10049056
M3 - Conference contribution
AN - SCOPUS:85147653298
T3 - International Conference on Information Networking
SP - 194
EP - 199
BT - 37th International Conference on Information Networking, ICOIN 2023
PB - IEEE Computer Society
T2 - 37th International Conference on Information Networking, ICOIN 2023
Y2 - 11 January 2023 through 14 January 2023
ER -