Nowadays, numerous home electric appliances with voice guidance functions are available. These functions not only tell us how to operate the product but also make us feel attachment and familiarity as we would have for a human. On the other hand, at the beginning of the design process, an image of the product is clearly decided. It is necessary for a good design to have elements that are consistent with the image. However, the images that the users have by voice guidance with different characters have not yet been reported. In this study, we made 3-D CG videos of home electric appliances (a microwave) with different images and combined them with varying voice guidance. Participants evaluated each combination using the consistency with the product image. In the result, we reveal the requirements necessary for voice guidance to increase consistency and the relation between voice guidance functions and various images.