TY - GEN
T1 - Generating social network features for link-based classification
AU - Karamon, Jun
AU - Matsuo, Yutaka
AU - Yamamoto, Hikaru
AU - Ishizuka, Mitsuru
PY - 2007
Y1 - 2007
N2 - There have been numerous attempts at the aggregation of attributes for relational data mining. Recently, an increasing number of studies have been undertaken to process social network data, partly because of the fact that so much social network data has become available. Among the various tasks in link mining, a popular task is link-based classification, by which samples are classified using the relations or links that are present among them. On the other hand, we sometimes employ traditional analytical methods in the field of social network analysis using e.g., centrality measures, structural holes, and network clustering. Through this study, we seek to bridge the gap between the aggregated features from the network data and traditional indices used in social network analysis. The notable feature of our algorithm is the ability to invent several indices that are well studied in sociology. We first define general operators that are applicable to an adjacent network. Then the combinations of the operators generate new features, some of which correspond to traditional indices, and others which are considered to be new. We apply our method for classification to two different datasets, thereby demonstrating the effectiveness of our approach.
AB - There have been numerous attempts at the aggregation of attributes for relational data mining. Recently, an increasing number of studies have been undertaken to process social network data, partly because of the fact that so much social network data has become available. Among the various tasks in link mining, a popular task is link-based classification, by which samples are classified using the relations or links that are present among them. On the other hand, we sometimes employ traditional analytical methods in the field of social network analysis using e.g., centrality measures, structural holes, and network clustering. Through this study, we seek to bridge the gap between the aggregated features from the network data and traditional indices used in social network analysis. The notable feature of our algorithm is the ability to invent several indices that are well studied in sociology. We first define general operators that are applicable to an adjacent network. Then the combinations of the operators generate new features, some of which correspond to traditional indices, and others which are considered to be new. We apply our method for classification to two different datasets, thereby demonstrating the effectiveness of our approach.
UR - http://www.scopus.com/inward/record.url?scp=38049108148&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=38049108148&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-74976-9_15
DO - 10.1007/978-3-540-74976-9_15
M3 - Conference contribution
AN - SCOPUS:38049108148
SN - 9783540749752
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 127
EP - 139
BT - Knowledge Discovery in Database
PB - Springer Verlag
T2 - 11th European Conference on Principles and Practice of Knowledge Discovery in Databases, PKDD 2007
Y2 - 17 September 2007 through 21 September 2007
ER -