-
1.
公开(公告)号:US20240338962A1
公开(公告)日:2024-10-10
申请号:US18747599
申请日:2024-06-19
Inventor: Haiwei WANG , Zhongwen ZHANG , Gang LI
IPC: G06V30/414 , G06V30/418
CPC classification number: G06V30/414 , G06V30/418
Abstract: The present disclosure provides an image based human-computer interaction method, which includes: acquiring a to-be-analyzed image, and determining image layout information and image content information of the to-be-analyzed image, where the to-be-analyzed image includes a variety of modal data, the image layout information represents distribution of image elements with preset granularity in the to-be-analyzed image, and the image content information represents a content expressed by the modal data in the to-be-analyzed image; and determining, in response to acquiring question information, response information corresponding to the question information according to the image layout information and the image content information, where the question information represents a question proposed by a user for the to-be-analyzed image, and the response information represents a reply answer corresponding to the question information. By extracting layout information and content information from an image, the accuracy of answering a question and user experience of human-computer interaction are improved.