TY - GEN
T1 - Building a Video-and-Language Dataset with Human Actions for Multimodal Logical Inference
AU - Suzuki, Riko
AU - Yanaka, Hitomi
AU - Mineshima, Koji
AU - Bekki, Daisuke
N1 - Funding Information:
This work was partially supported by JST CREST Grant Number JPMJCR20D2, Japan. Thanks to the anonymous reviewers for helpful comments. We would also like to thank Mai Yokozeki and Natsuki Murakami for their contributions.
Publisher Copyright:
© 2021 Association for Computational Linguistics
PY - 2021
Y1 - 2021
N2 - This paper introduces a new video-and-language dataset with human actions for multimodal logical inference, which focuses on intentional and aspectual expressions that describe dynamic human actions. The dataset consists of 200 videos, 5,554 action labels, and 1,942 action triplets of the form (subject, predicate, object) that can be translated into logical semantic representations. The dataset is expected to be useful for evaluating multimodal inference systems between videos and semantically complicated sentences including negation and quantification.
AB - This paper introduces a new video-and-language dataset with human actions for multimodal logical inference, which focuses on intentional and aspectual expressions that describe dynamic human actions. The dataset consists of 200 videos, 5,554 action labels, and 1,942 action triplets of the form (subject, predicate, object) that can be translated into logical semantic representations. The dataset is expected to be useful for evaluating multimodal inference systems between videos and semantically complicated sentences including negation and quantification.
UR - http://www.scopus.com/inward/record.url?scp=85138285564&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85138285564&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85138285564
T3 - MMSR 2021 - Multimodal Semantic Representations, Proceedings of the 1st Workshop
SP - 102
EP - 107
BT - MMSR 2021 - Multimodal Semantic Representations, Proceedings of the 1st Workshop
A2 - Donatelli, Lucia
A2 - Krishnaswamy, Nikhil
A2 - Lai, Kenneth
A2 - Pustejovsky, James
PB - Association for Computational Linguistics (ACL)
T2 - 1st Workshop on Multimodal Semantic Representations, MMSR 2021
Y2 - 16 June 2021
ER -