-
公开(公告)号:US20240046916A1
公开(公告)日:2024-02-08
申请号:US18491890
申请日:2023-10-23
发明人: Manda MILLER , Kirk GOLDMAN , Jon A. HOFFMAN , John PISTONE , Dimple NANWANI , Theodore CLARK
IPC分类号: G10L13/027
CPC分类号: G10L13/027 , G06V10/10
摘要: The present disclosure provides techniques for graphics translation. A subset of a plurality of natural language image descriptions for an image of a product is received. A set of shared natural language descriptors is identified in the subset of the plurality of natural language image descriptions. The set of shared natural language descriptors is aggregated, and a description for the first image is generated based on the aggregated set of shared natural language image descriptions.
-
公开(公告)号:US20230037100A1
公开(公告)日:2023-02-02
申请号:US17386328
申请日:2021-07-27
发明人: Manda MILLER , Kirk GOLDMAN , Jon A. HOFFMAN , John PISTONE , Dimple NANWANI , Theodore CLARK
IPC分类号: G10L13/027
摘要: The present disclosure provides techniques for graphics translation. A plurality of natural language image descriptions is collected for an image of a product. An overall description for the image is generated using one or more models, based on the plurality of natural language image descriptions, by: identifying a set of shared descriptors used in at least a subset of the plurality of natural language image descriptions, and aggregating the set of shared descriptors to form the overall description. A first request to provide a description of the first image is received, and the overall description is returned in response to the first request, where the overall description is output using one or more text-to-speech techniques.
-