-
公开(公告)号:US20240394942A1
公开(公告)日:2024-11-28
申请号:US18323029
申请日:2023-05-24
Applicant: Adobe Inc.
Inventor: Anant Shankhdhar , Samyak Sanjay Mehta , Shreya Singh , K. V. Vikram , Tripti Shukla , Srikrishna Karanam , Balaji Vasan Srinivasan , Vishwa Vinay , Niyati Himanshu Chhaya
IPC: G06T11/60 , G06F16/58 , G06F40/211 , G06V30/418
Abstract: The present disclosure relates to systems, non-transitory computer-readable media, and methods for expanding a digital document including a sequence of informational data via supplemental multimodal digital content. In particular, the system expands digital documents with multimodal granular details to dynamically integrate supplemental in-depth information to the digital document. For example, in response to a selection of a specific portion of a digital document, the system generates expanded multimodal content (e.g., text and image content) for the selected portion of the digital document from external text and image sources. Indeed, the system uses existing content from the digital document to select images and combine the selected images with text into image-text pairs that are textually and visually consistent with the digital document. Moreover, the system expands the digital document by inserting the image-text pairs in connection with the selected portion of the digital document.