This research experimentally investigates the performance of conventional rule interestingness measures and discusses their usefulness for supporting KDD through human-system interaction in medical domain. We compared the evaluation results by a medical expert and those by selected sixteen kinds of interestingness measures for the rules discovered in a dataset on hepatitis. χ2 measure, recall, and accuracy demonstrated the highest performance, and specificity and prevalence did the lowest. The interestingness measures showed a complementary relationship for each other. These results indicated that some interestingness measures have the possibility to predict really interesting rules at a certain level and that the combinational use of interestingness measures will be useful. We then discussed how to combinationally utilize interestingness measures and proposed a post-processing user interface utilizing them, which supports KDD through human-system interaction.