Patent search ap:("Google LLC") AND inv:"Jessica Lee" Page 2

11.

发明授权
Object filtering and information display in an augmented-reality experience 有权

公开(公告)号：US12230030B2

公开(公告)日：2025-02-18

申请号：US18084710

申请日：2022-12-20

Applicant: Google LLC

Inventor： Jessica Lee , Christopher James Kelley , Alok Aggarwal , Harshit Kharbanda

IPC: G06V20/20 , G06F16/9535 , G06T11/00 , G06V10/94

Abstract: Systems and methods for providing scene understanding can include obtaining a plurality of images, stitching images associated with the scene, detecting objects in the scene, and providing information associated with the objects in the scene. The systems and methods can include determining filter tags or query tags that can be selected to filter the plurality of objects, which can then be provided as information to the user to provide further insight on the scene. The information may be provided in an augmented-reality experience via text or other user-interface elements anchored to objects in the images.

12.

发明公开
Visual and Audio Multimodal Searching System 审中-公开

公开(公告)号：US20240362279A1

公开(公告)日：2024-10-31

申请号：US18306638

申请日：2023-04-25

Applicant: Google LLC

Inventor： Harshit Kharbanda , Belinda Luna Zeng , Viviana Caso Corella , Christopher James Kelley , Jessica Lee , Pendar Yousefi , Dounia Berrada , Sundeep Vaddadi , Kai Yu , Balint Miklos , Severin Heiniger , Louis Wang

IPC: G06F16/9532 , G06F16/538 , G06F40/40

CPC classification number: G06F16/9532 , G06F16/538 , G06F40/40

Abstract: A multimodal search system is described. The system can receive image data captured by a camera of a user device. Additionally, the system can receive audio data associated with the image data. The audio data can be captured by a microphone of the user device. Moreover, the system can process the image data to generate visual features. Furthermore, the system can process the audio data to generate a plurality of words. The system can generate a plurality of search terms based on the plurality of words and the visual features. Subsequently, the system can determine one or more search results associated with the plurality of search terms and provide the one or more search results as an output.

13.

发明公开
Object Filtering and Information Display in an Augmented-Reality Experience 审中-公开

公开(公告)号：US20230368527A1

公开(公告)日：2023-11-16

申请号：US18084710

申请日：2022-12-20

Applicant: Google LLC

Inventor： Jessica Lee , Christopher James Kelley , Alok Aggarwal , Harshit Kharbanda

IPC: G06V20/20 , G06T11/00 , G06V10/94 , G06F16/9535

CPC classification number: G06V20/20 , G06T11/00 , G06V10/945 , G06F16/9535 , G06T2200/24

Abstract: Systems and methods for providing scene understanding can include obtaining a plurality of images, stitching images associated with the scene, detecting objects in the scene, and providing information associated with the objects in the scene. The systems and methods can include determining filter tags or query tags that can be selected to filter the plurality of objects, which can then be provided as information to the user to provide further insight on the scene. The information may be provided in an augmented-reality experience via text or other user-interface elements anchored to objects in the images.

14.

发明授权
Dynamically adjusting instructions in an augmented-reality experience 有权

公开(公告)号：US12254785B2

公开(公告)日：2025-03-18

申请号：US17969303

申请日：2022-10-19

Applicant: Google LLC

Inventor： Jessica Lee , David Trotter Oleson , Fabian Roth , Nils Grimsmo

IPC: G09B7/04 , G06F3/04845 , G06F40/205 , G06T11/60 , G06V10/94 , G06V20/70 , G06V30/12 , G06V30/19

Abstract: Systems and methods for augmented-reality tutoring can utilize optical character recognition, natural language processing, and/or augmented-reality rendering for providing real-time notifications for completing a determined task. The systems and methods can include utilizing one or more machine-learned models trained for quantitative reasoning and can include providing a plurality of different user interface elements at different times.

15.

发明申请
Systems and Methods for Analyzing Text Extracted from Images and Performing Appropriate Transformations on the Extracted Text 有权

公开(公告)号：US20250087207A1

公开(公告)日：2025-03-13

申请号：US18736113

申请日：2024-06-06

Applicant: Google LLC

Inventor： Harshit Kharbanda , Jessica Lee , Christopher James Kelley , Fabian Roth , Dounia Berrada , Samer Hassan Hassan , Afroz Mohiuddin , Misha Khalman , Ali Essam Ali Elqursh , Belinda Luna Zeng

IPC: G10L15/183 , G06F16/583 , G06V10/778 , G06V30/14 , G06V30/148 , G10L15/22 , G10L15/30

Abstract: The present disclosure provides computer-implemented methods, systems, and devices for responding to requests associated with an image. A computing system obtains, wherein the image depicts a first set of textual content. The computing system determines one or more characteristics of the first set of textual content. The computing system determines a response type from a plurality of response types based on the one or more characteristics. The computing system generates a model input, wherein the model input comprises data descriptive of the first set of textual content and a prompt associated with the response type. The computing system provides providing the model input as an input to a machine-learned language model. The computing system receives a second set of text as an output of the machine-learned language model as a result of the machine-learned language model processing the model input. The computing system provides the second set of text for display to a user, wherein the second set of textual content is associated with the response type.

16.

发明申请
Visual Citations for Information Provided in Response to Multimodal Queries 有权

公开(公告)号：US20240378237A1

公开(公告)日：2024-11-14

申请号：US18314663

申请日：2023-05-09

Applicant: Google LLC

Inventor： Harshit Kharbanda , Jessica Lee , Christopher James Kelley , Belinda Luna Zeng , Louis Wang

IPC: G06F16/583 , G06V10/74

Abstract: Result images are retrieved based on a similarity to a query image. A set of textual inputs is processed with a machine-learned language model to obtain a language output comprising textual content, wherein the set of textual inputs comprises textual content from source documents that include the result images, and a prompt associated with the query image. The language output and the result images are provided to a user computing device. Information is received descriptive of an indication by a user that a first result image is visually dissimilar to the query image. Textual content associated with the source document that includes the first result image from the set of textual inputs is removed. The set of textual inputs is processed with the machine-learned language model to obtain a refined language output. The refined language output is provided to the user computing device.

17.

外观设计
Display screen or portion thereof with transitional graphical user interface 有权

公开(公告)号：USD1042529S1

公开(公告)日：2024-09-17

申请号：US29869040

申请日：2022-12-20

Applicant: Google LLC

Designer： Jessica Lee , Bálint Miklos , Harshit Kharbanda , Severin Heiniger

Abstract: FIG. 1 is a front view of a display screen or portion thereof with a transitional graphical user interface showing a first image of the claimed design; and,
FIG. 2 is a second image thereof.
The outermost evenly spaced broken lines show an electronic device that forms no part of the claimed design. The dot-dash broken lines show a display screen or portion thereof and form no part of the claimed design. The remaining broken lines and all lined-through text, show portions of the transitional graphical user interface, and form no part of the claimed design.
The appearance of the transitional image sequentially transitions between the views of FIGS. 1-2. The process or period in which an image transitions to another forms no part of the claimed design.

18.

发明授权
Systems and methods for analyzing text extracted from images and performing appropriate transformations on the extracted text 有权

公开(公告)号：US12033620B1

公开(公告)日：2024-07-09

申请号：US18463951

申请日：2023-09-08

Applicant: Google LLC

Inventor： Harshit Kharbanda , Jessica Lee , Christopher James Kelley , Fabian Roth , Dounia Berrada , Samer Hassan Hassan , Afroz Mohiuddin , Mikhail Khalman , Ali Essam Ali Elqursh , Belinda Luna Zeng

IPC: G06F3/0483 , G06F16/30 , G06F16/33 , G06F16/583 , G06V10/778 , G06V30/14 , G06V30/148 , G10L15/183 , G10L15/22 , G10L15/30

CPC classification number: G10L15/183 , G06F16/5846 , G06V10/778 , G06V30/1456 , G06V30/153 , G10L15/22 , G10L15/30

Abstract: The present disclosure provides computer-implemented methods, systems, and devices for responding to requests associated with an image. A computing system obtains, wherein the image depicts a first set of textual content. The computing system determines one or more characteristics of the first set of textual content. The computing system determines a response type from a plurality of response types based on the one or more characteristics. The computing system generates a model input, wherein the model input comprises data descriptive of the first set of textual content and a prompt associated with the response type. The computing system provides providing the model input as an input to a machine-learned language model. The computing system receives a second set of text as an output of the machine-learned language model as a result of the machine-learned language model processing the model input. The computing system provides the second set of text for display to a user, wherein the second set of textual content is associated with the response type.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification