Patent search ap:("Amazon Technologies Page Inc.") AND inv:"Gokhan Tur"

1.

发明申请
OBJECT TRACKING AND ENTITY RESOLUTION 有权

公开(公告)号：US20250028321A1

公开(公告)日：2025-01-23

申请号：US18907880

申请日：2024-10-07

Applicant: Amazon Technologies, Inc.

Inventor： Gunnar Atli Sigurdsson , Robinson Piramuthu , Gokhan Tur

IPC: G05D1/00 , G06T7/73 , G10L13/08 , G10L15/18 , G10L15/22

Abstract: Described herein is a system for tracking objects and performing dynamic entity resolution using image data. For example, the system may build an environment map and populate the map with objects present in the environment. As the devices move about the environment it may capture image data and, based on its position and/or configuration of its components, may determine updated locations of objects that move in the environment. Upon receiving a query from a user, based on the location of the objects relative to the device/user, the system can interpret gestures and voice commands to infer which object is specified by the voice command. To build the environment map, the system performs object detection to generate bounding boxes associated with an object, then clusters the bounding boxes into a three-dimensional (3D) object associated with 3D coordinates. As the system tracks the object using the 3D coordinates while maintaining two-dimensional (2D) information (e.g., bounding boxes and other features), the system can use existing 2D models to process objects in 3D.

2.

发明授权
Natural language processing of declarative statements 有权

公开(公告)号：US12008985B2

公开(公告)日：2024-06-11

申请号：US16907680

申请日：2020-06-22

Applicant: Amazon Technologies, Inc.

Inventor： Qiaozi Gao , Divyanshu Brijmohan Verma , Govindarajan Sundaram Thattai , Qing Ping , Joel Joseph Chengottusseriyil , Ivan Vitomir Stojanovic , Feiyang Niu , Gokhan Tur , Charles J Allen

IPC: G10L15/22 , G10L15/18 , G10L15/26 , G10L13/08

CPC classification number: G10L15/1815 , G10L15/1822 , G10L15/22 , G10L15/26 , G10L13/08 , G10L2015/223 , G10L2015/225 , G10L2015/227 , G10L2015/228

Abstract: Devices and techniques are generally described for learning personalized responses to declarative natural language inputs. In various examples, a first natural language input may be received. The first natural language input may correspond to intent data corresponding to a declarative user input. In some examples, a dialog session may be initiated with the first user. An action intended by the first user for the first natural language input may be determined based at least in part on the dialog session. In various examples, first data representing the action may be stored in association with second data representing a state described by at least a portion of the first natural language input.

3.

发明授权
Virtual conversational companion 有权

公开(公告)号：US12205577B1

公开(公告)日：2025-01-21

申请号：US17217031

申请日：2021-03-30

Applicant: Amazon Technologies, Inc.

Inventor： Taehwan Kim , Sanqiang Zhao , Robinson Piramuthu , Seokhwan Kim , Yang Liu , Gokhan Tur , Eshan Bhatnagar

IPC: G10L15/22 , G06T13/20 , G06T13/40 , G06T13/80 , G10L13/08 , G10L15/18 , G10L25/57

Abstract: Techniques for rendering visual content, in response to one or more utterances, are described. A device receives one or more utterances that define a parameter(s) for desired output content. A system (or the device) identifies natural language data corresponding to the desired content, and uses natural language generation processes to update the natural language data based on the parameter(s). The system (or the device) then generates an image based on the updated natural language data. The system (or the device) also generates video data of an avatar. The device displays the image and the avatar, and synchronizes movements of the avatar with output of synthesized speech of the updated natural language data. The device may also display subtitles of the updated natural language data, and cause a word of the subtitles to be emphasized when synthesized speech of the word is being output.

4.

发明授权
Object tracking and entity resolution 有权

公开(公告)号：US12117838B1

公开(公告)日：2024-10-15

申请号：US17218621

申请日：2021-03-31

Applicant: Amazon Technologies, Inc.

Inventor： Gunnar Atli Sigurdsson , Robinson Piramuthu , Gokhan Tur

IPC: G06T7/70 , G05D1/00 , G05D1/02 , G06T7/73 , G10L13/08 , G10L15/18 , G10L15/22

CPC classification number: G05D1/0219 , G05D1/0088 , G05D1/0251 , G05D1/0274 , G06T7/73 , G10L13/08 , G10L15/1807 , G10L15/22 , G10L2015/223

Abstract: Described herein is a system for tracking objects and performing dynamic entity resolution using image data. For example, the system may build an environment map and populate the map with objects present in the environment. As the devices move about the environment it may capture image data and, based on its position and/or configuration of its components, may determine updated locations of objects that move in the environment. Upon receiving a query from a user, based on the location of the objects relative to the device/user, the system can interpret gestures and voice commands to infer which object is specified by the voice command. To build the environment map, the system performs object detection to generate bounding boxes associated with an object, then clusters the bounding boxes into a three-dimensional (3D) object associated with 3D coordinates. As the system tracks the object using the 3D coordinates while maintaining two-dimensional (2D) information (e.g., bounding boxes and other features), the system can use existing 2D models to process objects in 3D.

5.

发明授权
Natural language processing 有权

公开(公告)号：US11978437B1

公开(公告)日：2024-05-07

申请号：US17119099

申请日：2020-12-11

Applicant: Amazon Technologies, Inc.

Inventor： Govindarajan Sundaram Thattai , Qing Ping , Feiyang Niu , Joel Joseph Chengottusseriyil , Prashanth Rajagopal , Qiaozi Gao , Aishwarya Naresh Reganti , Gokhan Tur , Dilek Hakkani-Tur , Rohit Prasad , Premkumar Natarajan

IPC: G10L15/00 , G06F16/22 , G06F21/62 , G10L15/18 , G10L15/22 , G10L15/30 , G10L15/183 , G10L15/19

CPC classification number: G10L15/1815 , G06F16/22 , G06F21/6218 , G10L15/22 , G10L15/30 , G10L15/1822 , G10L15/183 , G10L15/19 , G10L2015/223

Abstract: Devices and techniques are generally described for learning personalized concepts for natural language processing. In various examples, a first natural language input may be received. In some examples, a determination may be made that the first natural language input comprises non-actionable slot data. A dialog session may be initiated with the user. In some examples, first slot data that is indicated by the user during the dialog session may be determined. In various examples, data representing the first slot data may be stored in a database in association with the first natural language input.

6.

发明申请
NATURAL LANGUAGE PROCESSING 有权

公开(公告)号：US20210398524A1

公开(公告)日：2021-12-23

申请号：US16907680

申请日：2020-06-22

Applicant: Amazon Technologies, Inc.

Inventor： Qiaozi Gao , Divyanshu Brijmohan Verma , Govindarajan Sundaram Thattai , Qing Ping , Joel Joseph Chengottusseriyil , Ivan Vitomir Stojanovic , Feiyang Niu , Gokhan Tur , Charles J. Allen

IPC: G10L15/18 , G10L15/22 , G10L15/26

Abstract: Devices and techniques are generally described for learning personalized responses to declarative natural language inputs. In various examples, a first natural language input may be received. The first natural language input may correspond to intent data corresponding to a declarative user input. In some examples, a dialog session may be initiated with the first user. An action intended by the first user for the first natural language input may be determined based at least in part on the dialog session. In various examples, first data representing the action may be stored in association with second data representing a state described by at least a portion of the first natural language input.

7.

发明授权
Autonomously motile device with command processing 有权

公开(公告)号：US12002458B1

公开(公告)日：2024-06-04

申请号：US17012257

申请日：2020-09-04

Applicant: Amazon Technologies, Inc.

Inventor： Qiaozi Gao , Govindarajan Sundaram Thattai , Qing Ping , Joel Joseph Chengottusseriyil , Feiyang Niu , Gokhan Tur , Dilek Hakkani-Tur

IPC: G10L15/22

CPC classification number: G10L15/22

Abstract: A device capable of autonomous motion may move in an environment and may receive audio data from a microphone. If the device receives a command represented in the audio data that is absent from a set of known commands, the device may prompt the user to explain how to perform the command. The device may save a command template corresponding to the command, which may be used to perform future commands.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification