Patent search ap:("Amazon Technologies Page Inc.") AND inv:"Pradeep Natarajan"

1.

发明授权
Natural language processing 有权

公开(公告)号：US12165636B1

公开(公告)日：2024-12-10

申请号：US17984511

申请日：2022-11-10

Applicant: Amazon Technologies, Inc.

Inventor： Kiana Hajebi , Vivek Yadav , Pradeep Natarajan

IPC: G10L15/22 , G10L15/18

Abstract: Devices and techniques are generally described for inference reduction in natural language processing using semantic similarity-based caching. In various examples, first automatic speech recognition (ASR) data representing a first natural language input may be determined. A cache may be searched using the first ASR data. A first skill associated with the first ASR data may be determined from the cache. In some examples, first intent data representing a semantic interpretation of the first natural language input data may be determined by using a first natural language process associated with the first skill.

2.

发明授权
Gaze prediction 有权

公开(公告)号：US11681364B1

公开(公告)日：2023-06-20

申请号：US17361939

申请日：2021-06-29

Applicant: Amazon Technologies, Inc.

Inventor： Xu Zhang , Yue Wu , Varsha Hedau , Shih-Fu Chang , Pradeep Natarajan

IPC: G06F3/01 , G06T7/73 , G06N3/049 , G06F40/40 , G06V40/18 , G06V40/16

CPC classification number: G06F3/013 , G06F40/40 , G06N3/049 , G06T7/74 , G06V40/171 , G06V40/18

Abstract: An image processing system may receive image data from a camera of a user device and perform gaze prediction processing of the image data to predict one or more gaze patterns. The gaze prediction processing may include processing the image data using a neural network to detect faces and/or objects and generate an image feature map. The gaze prediction processing may include performing gaze direction prediction operations using the feature map and detected faces and/or objects to determine gaze direction probability data. The gaze prediction processing may include predicting a gaze pattern based on the gaze direction probability data and the image feature map. The gaze pattern may be short-term (e.g., atomic-level) or long-term (e.g., event-level).

3.

发明申请
ACTIVE SPEAKER DETECTION USING IMAGE DATA 有权

公开(公告)号：US20230068798A1

公开(公告)日：2023-03-02

申请号：US17465143

申请日：2021-09-02

Applicant: Amazon Technologies, Inc.

Inventor： Tyler Jerel Etchart , Vivek Yadav , Pradeep Natarajan

IPC: G10L15/25 , G06K9/00 , G06T7/70 , G10L15/22 , G10L25/78

Abstract: A system can operate a speech-controlled device to perform active speaker detection to detect an utterance using image data showing a user speaking the utterance. This enables the device to perform utterance detection using the image data and/or determine which user is speaking the utterance. To perform active speaker detection, the device processes the image data to determine expression parameters associated with the user's face and generates facial measurements based on the expression parameters. For example, the device can use the expression parameters to generate a 3D model including an agnostic facial representation and determine a mouth aspect ratio by measuring a mouth height and a mouth width of the agnostic facial representation. As the mouth aspect ratio changes when the user is speaking, the device can determine that the user is speaking and/or detect an utterance based on an amount of variation of the mouth aspect ratio.

4.

发明授权
Natural language processing 有权

公开(公告)号：US11532301B1

公开(公告)日：2022-12-20

申请号：US17113823

申请日：2020-12-07

Applicant: Amazon Technologies, Inc.

Inventor： Kiana Hajebi , Vivek Yadav , Pradeep Natarajan

IPC: G10L15/22 , G10L15/18

Abstract: Devices and techniques are generally described for inference reduction in natural language processing using semantic similarity-based caching. In various examples, first automatic speech recognition (ASR) data representing a first natural language input may be determined. A cache may be searched using the first ASR data. A first skill associated with the first ASR data may be determined from the cache. In some examples, first intent data representing a semantic interpretation of the first natural language input data may be determined by using a first natural language process associated with the first skill.

5.

发明申请
TASK-BASED IMAGE MASKING 有权

公开(公告)号：US20210406589A1

公开(公告)日：2021-12-30

申请号：US16913837

申请日：2020-06-26

Applicant: Amazon Technologies, Inc.

Inventor： Vivek Yadav , Aayush Gupta , Yue Wu , Pradeep Natarajan , Ayush Jaiswal

IPC: G06K9/62 , G06N20/00 , G06N3/08

Abstract: Techniques for masking images based on a particular task are described. A system masks portions of an image that are not relevant to a particular task, thus, reducing the amount of data used by applications for image processing tasks. For example, images to be processed using a hair color classification model are masked so that only portions that show the person's hair are available for the model to analyze. The system configures different masker components to mask images for different tasks. A masker component can be implemented at a user device to mask images prior to sending to an application/task-specific model.

6.

发明授权
Task-based image masking 有权

公开(公告)号：US11854116B2

公开(公告)日：2023-12-26

申请号：US17740533

申请日：2022-05-10

Applicant: Amazon Technologies, Inc.

Inventor： Vivek Yadav , Aayush Gupta , Yue Wu , Pradeep Natarajan , Ayush Jaiswal

IPC: G06T11/00 , G06N20/00 , G06N3/08 , G06F18/2431 , G06F18/214 , G06V10/772 , G06V10/20

CPC classification number: G06T11/00 , G06F18/214 , G06F18/2431 , G06N3/08 , G06N20/00 , G06V10/20 , G06V10/772

Abstract: Techniques for masking images based on a particular task are described. A system masks portions of an image that are not relevant to a particular task, thus, reducing the amount of data used by applications for image processing tasks. For example, images to be processed using a hair color classification model are masked so that only portions that show the person's hair are available for the model to analyze. The system configures different masker components to mask images for different tasks. A masker component can be implemented at a user device to mask images prior to sending to an application/task-specific model.

7.

发明申请
DIALOG MANAGEMENT FOR MULTIPLE USERS 有权

公开(公告)号：US20220093093A1

公开(公告)日：2022-03-24

申请号：US17112227

申请日：2020-12-04

Applicant: Amazon Technologies, Inc.

Inventor： Prakash Krishnan , Arindam Mandal , Nikko Strom , Pradeep Natarajan , Ariya Rastrow , Shiv Naga Prasad Vitaladevuni , David Chi-Wai Tang , Aaron Challenner , Xu Zhang , Krishna Anisetty , Josey Diego Sandoval , Rohit Prasad , Premkumar Natarajan

IPC: G10L15/22 , G10L15/08 , G10L15/24 , G06K9/46 , G06K9/62 , G06K9/00 , G10L15/02

Abstract: A system can operate a speech-controlled device in a mode where the speech-controlled device determines that an utterance is directed at the speech-controlled device using image data showing the user speaking the utterance. If the user is directing the user's gaze at the speech-controlled device while speaking, the system may determine the utterance is system directed and thus may perform further speech processing based on the utterance. If the user's gaze is directed elsewhere, the system may determine the utterance is not system directed (for example directed at another user) and thus the system may not perform further speech processing based on the utterance and may take other actions, for example discarding audio data of the utterance.

8.

发明授权
Image processing using multiple aspect ratios 有权
Title translation: 使用多个宽高比的图像处理

公开(公告)号：US09418283B1

公开(公告)日：2016-08-16

申请号：US14463961

申请日：2014-08-20

Applicant: Amazon Technologies, Inc.

Inventor： Pradeep Natarajan , Avnish Sikka , Rohit Prasad

IPC: G06K9/00 , G06T3/40 , G06K9/64

CPC classification number: G06K9/00463 , G06K9/3258 , G06K9/42 , G06K9/64 , G06K9/6857 , G06T3/40

Abstract: A system to recognize text, objects, or symbols in a captured image using machine learning models reduces computational overhead by generating a plurality of thumbnail versions of the image at different downscaled resolutions and aspect ratios, and then processing the downscaled images instead of the entire image, or sections of the entire image. The downscaled images are processed to produce a combine feature vector characterizing the overall image. The combined feature vector is processed using the machine learning model.

Abstract translation: 使用机器学习模型识别拍摄图像中的文本，对象或符号的系统通过以不同的缩小分辨率和高宽比生成图像的多个缩略图版本，然后处理缩小的图像而不是整个图像来减少计算开销，或整个图像的部分。处理缩小的图像以产生表征整体图像的组合特征向量。使用机器学习模型处理组合特征向量。

9.

发明授权
Class-agnostic object detection 有权

公开(公告)号：US11775617B1

公开(公告)日：2023-10-03

申请号：US17201358

申请日：2021-03-15

Applicant: Amazon Technologies, Inc.

Inventor： Ayush Jaiswal , Yue Wu , Pradeep Natarajan , Premkumar Natarajan

IPC: G06K9/00 , G06F18/2413 , G06F16/53 , G06F40/20 , G06V10/40 , G06F18/22 , G06F18/2132

CPC classification number: G06F18/2413 , G06F16/53 , G06F18/2132 , G06F18/22 , G06F40/20 , G06V10/40

Abstract: Devices and techniques are generally described for class-agnostic object detection. In some examples, a first frame of image data comprising a first plurality of pixels may be received. First class-agnostic feature data representing the first plurality of pixels may be generated. A first object detection component may be used to determine that the first plurality of pixels corresponds to an arbitrary object represented in the first frame of image data based at least in part on the first class-agnostic feature data. Class-agnostic data indicating that the first plurality of pixels in the first frame of image data corresponds to the arbitrary object may be generated.

10.

发明授权
Natural language processing 有权

公开(公告)号：US11626107B1

公开(公告)日：2023-04-11

申请号：US17114116

申请日：2020-12-07

Applicant: Amazon Technologies, Inc.

Inventor： Kiana Hajebi , Vivek Yadav , Pradeep Natarajan

IPC: G10L15/22 , G10L15/18

Abstract: Devices and techniques are generally described for inference reduction in natural language processing using semantic similarity-based caching. In various examples, first automatic speech recognition (ASR) data representing a first natural language input may be determined. A cache may be searched using the first ASR data. A first skill associated with the first ASR data may be determined from the cache. In some examples, first intent data representing a semantic interpretation of the first natural language input data may be determined by using a first natural language process associated with the first skill.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification