-
公开(公告)号:US20240311421A1
公开(公告)日:2024-09-19
申请号:US18182467
申请日:2023-03-13
Applicant: Google LLC
Inventor: Utsav Lathia , Sundeep Vaddadi
IPC: G06F16/532 , G06F16/538 , G06F16/583 , G06V10/77
CPC classification number: G06F16/532 , G06F16/538 , G06F16/583 , G06V10/7715
Abstract: Systems and methods disclosed herein can leverage an embedding model to generate an image embedding for image data. The image embedding can then be utilized to determine relevant search results in each of a plurality of datasets. The systems and methods may include a pure embedding search for one dataset and a multimodal search for another dataset. One or more of the datasets may be selected for search based on one or more contexts associated with the user and/or the image. The search results may then be provided simultaneously to a user computing system.
-
公开(公告)号:US20200211288A1
公开(公告)日:2020-07-02
申请号:US16620264
申请日:2019-10-07
Applicant: Google LLC
Inventor: Bryan Woods , Jianingwei Wei , Sundeep Vaddadi , Cheng Yang , Konstantine Tsotsos , Keith Schaefer , Leon Wong , Keir Banks Mierle , Matthias Grundmann
IPC: G06T19/00 , G06T19/20 , G06F3/0481
Abstract: In a general aspect, a method can include receiving data defining an augmented reality (AR) environment including a representation of a physical environment, and changing tracking of an AR object within the AR environment between region-tracking mode and plane-tracking mode.
-
公开(公告)号:US20240362279A1
公开(公告)日:2024-10-31
申请号:US18306638
申请日:2023-04-25
Applicant: Google LLC
Inventor: Harshit Kharbanda , Belinda Luna Zeng , Viviana Caso Corella , Christopher James Kelley , Jessica Lee , Pendar Yousefi , Dounia Berrada , Sundeep Vaddadi , Kai Yu , Balint Miklos , Severin Heiniger , Louis Wang
IPC: G06F16/9532 , G06F16/538 , G06F40/40
CPC classification number: G06F16/9532 , G06F16/538 , G06F40/40
Abstract: A multimodal search system is described. The system can receive image data captured by a camera of a user device. Additionally, the system can receive audio data associated with the image data. The audio data can be captured by a microphone of the user device. Moreover, the system can process the image data to generate visual features. Furthermore, the system can process the audio data to generate a plurality of words. The system can generate a plurality of search terms based on the plurality of words and the visual features. Subsequently, the system can determine one or more search results associated with the plurality of search terms and provide the one or more search results as an output.
-
公开(公告)号:US11494990B2
公开(公告)日:2022-11-08
申请号:US16620264
申请日:2019-10-07
Applicant: Google LLC
Inventor: Bryan Woods , Jianing Wei , Sundeep Vaddadi , Cheng Yang , Konstantine Tsotsos , Keith Schaefer , Leon Wong , Keir Banks Mierle , Matthias Grundmann
IPC: G06T19/00 , G06F3/04815 , G06T19/20
Abstract: In a general aspect, a method can include receiving data defining an augmented reality (AR) environment including a representation of a physical environment, and changing tracking of an AR object within the AR environment between region-tracking mode and plane-tracking mode.
-
-
-