-
公开(公告)号:US20240394840A1
公开(公告)日:2024-11-28
申请号:US18669939
申请日:2024-05-21
Applicant: Google LLC
Inventor: Elchonon Zeav Lapin , Xibing Yang , Amit Handa , Apurv Suman , Siddhant Mittal , Ashish Dilipchand Bora , Thorne Wolfenbarger , Naga Sreenivas Meruva , Yudong Sun , Rahul Guin , Arie Sharon , Beatriz Alessio Robles Orozco , Yuanzhen Li , Zhongyue Zheng , Mohammad Izadi
Abstract: Using artificial intelligence (AI), imagery may be created for content in response to verbal or textual input. The imagery includes an object, such as a product, and a quality of the image is improved using pre-processing techniques before the image is generated and post-processing techniques after the image is generated. The pre-processing may include upscaling the object in the original image, segmenting the object from its background in the captured image, adding an outline or border stroke to the object. The post-processing techniques may include removing the object from the AI-generated background while keeping shadows and other effects in place, blurring portions of the AI-generated background where the object will be positioned, removing the outline from the object, and re-positioning the object in the AI-generated background with the outline removed.
-
2.
公开(公告)号:US11170772B2
公开(公告)日:2021-11-09
申请号:US16269275
申请日:2019-02-06
Applicant: Google LLC
Inventor: Ulas Kirazci , Adam Coimbra , Abraham Lee , Wei Dong , Thushan Amarasiriwardena , Yudong Sun , Xiao Gao
IPC: G10L15/22 , G06F3/0485 , G06F3/16 , G10L13/02
Abstract: Techniques are described herein for multi-modal interaction between users, automated assistants, and other computing services. In various implementations, a user may engage with the automated assistant in order to further engage with a third party computing service. In some implementations, the user may advance through dialog state machines associated with third party computing service using both verbal input modalities and input modalities other than verbal modalities, such as visual/tactile modalities.
-
3.
公开(公告)号:US11200893B2
公开(公告)日:2021-12-14
申请号:US16269275
申请日:2019-02-06
Applicant: Google LLC
Inventor: Ulas Kirazci , Adam Coimbra , Abraham Lee , Wei Dong , Thushan Amarasiriwardena , Yudong Sun , Xiao Gao
IPC: G10L15/22 , G06F3/0485 , G06F3/16 , G10L13/02
Abstract: Techniques are described herein for multi-modal interaction between users, automated assistants, and other computing services. In various implementations, a user may engage with the automated assistant in order to further engage with a third party computing service. In some implementations, the user may advance through dialog state machines associated with third party computing service using both verbal input modalities and input modalities other than verbal modalities, such as visual/tactile modalities.
-
4.
公开(公告)号:US20190341040A1
公开(公告)日:2019-11-07
申请号:US16269275
申请日:2019-02-06
Applicant: Google LLC
Inventor: Ulas Kirazci , Adam Coimbra , Abraham Lee , Wei Dong , Thushan Amarasiriwardena , Yudong Sun , Xiao Gao
IPC: G10L15/22 , G10L13/02 , G06F3/16 , G06F3/0485
Abstract: Techniques are described herein for multi-modal interaction between users, automated assistants, and other computing services. In various implementations, a user may engage with the automated assistant in order to further engage with a third party computing service. In some implementations, the user may advance through dialog state machines associated with third party computing service using both verbal input modalities and input modalities other than verbal modalities, such as visual/tactile modalities.
-
-
-