TY - GEN
T1 - SPIT callers detection with unsupervised Random Forests classifier
AU - Toyoda, Kentaroh
AU - Sasase, Iwao
PY - 2013/1/1
Y1 - 2013/1/1
N2 - As VoIP (Voice over IP) grows rapidly, it is expected to prevail tremendous unsolicited advertisement calls, which type of calls is referred to SPIT (SPam over Internet Telephony). SPIT detection is more difficult to execute than email SPAM detection since the callee or SPIT detection system does not tell whether it is SPIT or legitimate call until he/she actually takes a call. Recently, many SPIT detection techniques are proposed by finding outliers of call patterns. However, most of these techniques suffer from setting a threshold to distinguish that the caller is legitimate or not and this could cause to high false negative rate or low true positive rate. This is because these techniques analyse call pattern by a single feature e.g. call frequency or average call duration. In this paper, we propose a multi-feature call pattern analysis with unsupervised Random Forests classifier, which is one of the excellent classification algorithms. We also propose two simple but helpful features for better classification. We show the effectiveness of Random Forests based classification without supervised training data and which features contribute to classification.
AB - As VoIP (Voice over IP) grows rapidly, it is expected to prevail tremendous unsolicited advertisement calls, which type of calls is referred to SPIT (SPam over Internet Telephony). SPIT detection is more difficult to execute than email SPAM detection since the callee or SPIT detection system does not tell whether it is SPIT or legitimate call until he/she actually takes a call. Recently, many SPIT detection techniques are proposed by finding outliers of call patterns. However, most of these techniques suffer from setting a threshold to distinguish that the caller is legitimate or not and this could cause to high false negative rate or low true positive rate. This is because these techniques analyse call pattern by a single feature e.g. call frequency or average call duration. In this paper, we propose a multi-feature call pattern analysis with unsupervised Random Forests classifier, which is one of the excellent classification algorithms. We also propose two simple but helpful features for better classification. We show the effectiveness of Random Forests based classification without supervised training data and which features contribute to classification.
UR - http://www.scopus.com/inward/record.url?scp=84891355747&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84891355747&partnerID=8YFLogxK
U2 - 10.1109/ICC.2013.6654830
DO - 10.1109/ICC.2013.6654830
M3 - Conference contribution
AN - SCOPUS:84891355747
SN - 9781467331227
T3 - IEEE International Conference on Communications
SP - 2068
EP - 2072
BT - 2013 IEEE International Conference on Communications, ICC 2013
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2013 IEEE International Conference on Communications, ICC 2013
Y2 - 9 June 2013 through 13 June 2013
ER -