Patent search ap:("QUALCOMM Incorporated") AND inv:"Stephane VILLETTE" Page 1

1.

发明公开
AVATAR FACIAL EXPRESSIONS BASED ON SEMANTICAL CONTEXT 审中-公开

公开(公告)号：US20240078732A1

公开(公告)日：2024-03-07

申请号：US17930244

申请日：2022-09-07

Applicant: QUALCOMM Incorporated

Inventor： Scott BEITH , Suzana ARELLANO , Michel Adib SARKIS , Matthew FISCHLER , Ke-Li CHENG , Stephane VILLETTE

IPC: G06T13/40 , G06F3/01 , G06V20/40 , G06V40/16

CPC classification number: G06T13/40 , G06F3/012 , G06V20/41 , G06V40/174

Abstract: A device includes a memory and one or more processors configured to process sensor data to determine a semantical context associated with the sensor data. The one or more processors are also configured to generate adjusted face data based on the determined semantical context and face data. The adjusted face data includes an avatar facial expression that corresponds to the semantical context.

2.

发明公开
MEDIA SEGMENT PREDICTION FOR MEDIA GENERATION 审中-公开

公开(公告)号：US20240127838A1

公开(公告)日：2024-04-18

申请号：US18047572

申请日：2022-10-18

Applicant: QUALCOMM Incorporated

Inventor： Stephane VILLETTE , Sen LI , Pravin Kumar RAMADAS , Daniel Jared SINDER

IPC: G10L21/01 , G10L17/02 , G10L25/54

CPC classification number: G10L21/01 , G10L17/02 , G10L25/54

Abstract: A device includes one or more processors configured to input one or more segments of an input media stream into a feature extractor. The one or more processors are further configured to pass an output of the feature extractor into an utterance classifier to produce at least one representation of at least one utterance class of a plurality of utterance classes. The one or more processors are further configured to pass the output of the feature extractor and the at least one representation into a segment matcher to produce a media output segment identifier.

3.

发明公开
MEDIA SEGMENT REPRESENTATION USING FIXED WEIGHTS 审中-公开

公开(公告)号：US20240127809A1

公开(公告)日：2024-04-18

申请号：US18047562

申请日：2022-10-18

Applicant: QUALCOMM Incorporated

Inventor： Stephane VILLETTE , Sen LI , Daniel Jared SINDER

IPC: G10L15/22 , G10L15/04 , G10L15/16 , G10L25/78

CPC classification number: G10L15/22 , G10L15/04 , G10L15/16 , G10L25/78

Abstract: A device includes a memory configured to store a collection of sets of weights, each of the sets of weights representing a respective media segment. The device also includes one or more processors configured to generate data representing the detected first input speech segment and to pass the data representing the detected first input speech segment into a collection of memory units. Each memory unit of the collection of memory units includes a set of weights from the collection of sets of weights. The one or more processors are also configured to generate a first estimate of an associated media segment that represents the detected first input speech segment. The associated media segment corresponds to a first memory unit in the collection of memory units.

4.

发明公开
AVATAR REPRESENTATION AND AUDIO GENERATION 审中-公开

公开(公告)号：US20240078731A1

公开(公告)日：2024-03-07

申请号：US17930257

申请日：2022-09-07

Applicant: QUALCOMM Incorporated

Inventor： Scott BEITH , Suzana ARELLANO , Michel Adib SARKIS , Matthew FISCHLER , Ke-Li CHENG , Stephane VILLETTE

IPC: G06T13/20 , G06T13/40 , G06V40/16

CPC classification number: G06T13/205 , G06T13/40 , G06V40/174

Abstract: A device includes a memory and one or more processors configured to process image data corresponding to a user's face to generate face data. The one or more processors are configured to process sensor data to generate feature data and to generate a representation of an avatar based on the face data and the feature data. The one or more processors are also configured to generate an audio output for the avatar based on the sensor data.

5.

发明公开
MATCHING AUDIO USING MACHINE LEARNING BASED AUDIO REPRESENTATIONS 审中-公开

公开(公告)号：US20240127827A1

公开(公告)日：2024-04-18

申请号：US18047565

申请日：2022-10-18

Applicant: QUALCOMM Incorporated

Inventor： Stephane VILLETTE , Sen LI , Pravin Kumar RAMADAS , Daniel Jared SINDER

IPC: G10L19/00 , H04L65/70

CPC classification number: G10L19/00 , H04L65/70

Abstract: Systems and techniques are described herein for encoding and/or decoding audio information. For example, a process can process an input audio segment to generate a representation of the input audio segment, and can compare the representation of the input audio segment to representations stored in a memory. The representations represent a plurality of audio segments. The process can determine, based on the comparison, target representation(s) of target audio segment(s) from the representations stored in the memory. The process can determine one or more indices associated with the target audio segment(s). The process can then packetize the one or more indices and transmit the one or more packetized indices (e.g., to a decoder configured to decode the packetized indices).

Patent Agency Ranking