TY - GEN
T1 - People Removal Using Edge and Depth Information
AU - Yae, Shunsuke
AU - Ikehara, Masaaki
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - In this paper, we propose a people removal method from a single image for privacy and other reasons using a three-stage network of depth estimation, semantic segmentation, and inpainting, as shown in Fig. 1. In this three-stage network, we improve semantic segmentation for detecting people. We focus on a special situation of a person and construct a network. It is known that the accuracy of conventional methods can be improved by using edge information. The accuracy of segmentation can be further improved by increasing the accuracy of the edge map. In addition, edge detection does not work well when the person and the background are of the similar color, because edge detects the brightness change of the image. Therefore, in this paper, an adversarial loss function for edge maps is proposed. In addition, since an image with people is expected to have a depth difference from the background image, we use a trained depth estimation network to include the depth image in the input. In this way, it is possible to construct a network for people removal with a high accuracy both quantitatively and qualitatively.
AB - In this paper, we propose a people removal method from a single image for privacy and other reasons using a three-stage network of depth estimation, semantic segmentation, and inpainting, as shown in Fig. 1. In this three-stage network, we improve semantic segmentation for detecting people. We focus on a special situation of a person and construct a network. It is known that the accuracy of conventional methods can be improved by using edge information. The accuracy of segmentation can be further improved by increasing the accuracy of the edge map. In addition, edge detection does not work well when the person and the background are of the similar color, because edge detects the brightness change of the image. Therefore, in this paper, an adversarial loss function for edge maps is proposed. In addition, since an image with people is expected to have a depth difference from the background image, we use a trained depth estimation network to include the depth image in the input. In this way, it is possible to construct a network for people removal with a high accuracy both quantitatively and qualitatively.
KW - Depth estimation
KW - Inpainting
KW - Semantic segmentation
UR - http://www.scopus.com/inward/record.url?scp=85127045722&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85127045722&partnerID=8YFLogxK
U2 - 10.1109/ICCE53296.2022.9730122
DO - 10.1109/ICCE53296.2022.9730122
M3 - Conference contribution
AN - SCOPUS:85127045722
T3 - Digest of Technical Papers - IEEE International Conference on Consumer Electronics
BT - 2022 IEEE International Conference on Consumer Electronics, ICCE 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 IEEE International Conference on Consumer Electronics, ICCE 2022
Y2 - 7 January 2022 through 9 January 2022
ER -