Patent search ap:("QUALCOMM Incorporated") AND inv:"Byeonggeun Kim" Page 1

1.

发明授权
Systems and methods of image processing based on gaze detection 有权

公开(公告)号：US11798204B2

公开(公告)日：2023-10-24

申请号：US17685278

申请日：2022-03-02

Applicant: QUALCOMM Incorporated

Inventor： Hyunsin Park , Juntae Lee , Simyung Chang , Byeonggeun Kim , Jaewon Choi , Kyu Woong Hwang

IPC: G06T11/00 , G06V40/16 , G06F3/01 , G06V40/18

CPC classification number: G06T11/00 , G06F3/013 , G06V40/174 , G06V40/18

Abstract: Imaging systems and techniques are described. An imaging system receives image data representing at least a portion (e.g., a face) of a first user as captured by a first image sensor. The imaging system identifies that a gaze of the first user as represented in the image data is directed toward a displayed representation of at least a portion (e.g., a face) of a second user. The imaging system identifies an arrangement of representations of users for output. The imaging system generates modified image data based on the gaze and the arrangement at least in part by modifying the image data to modify at least the portion of the first user in the image data to be visually directed toward a direction corresponding to the second user based on the gaze and the arrangement. The imaging system outputs the modified image data arranged according to the arrangement.

2.

发明授权
Relaxed instance frequency normalization for neural-network-based audio processing 有权

公开(公告)号：US12266379B2

公开(公告)日：2025-04-01

申请号：US17937765

申请日：2022-10-03

Applicant: QUALCOMM Incorporated

Inventor： Byeonggeun Kim , Seunghan Yang , Hyunsin Park , Juntae Lee , Simyung Chang

IPC: G10L21/034 , G10L17/04 , G10L17/18 , G10L25/30 , G10L25/51

Abstract: Techniques and apparatus for training a neural network to classify audio into one of a plurality of categories and using such a trained neural network. An example method generally includes receiving a data set including a plurality of audio samples. A relaxed feature-normalized data set is generated by normalizing each audio sample of the plurality of audio samples. A neural network is trained to classify audio into one of a plurality of categories based on the relaxed feature-normalized data set, and the trained neural network is deployed.

3.

发明授权
Target keyword selection 有权

公开(公告)号：US12039968B2

公开(公告)日：2024-07-16

申请号：US17038887

申请日：2020-09-30

Applicant: QUALCOMM Incorporated

Inventor： Wonil Chang , Jinseok Lee , Mingu Lee , Jinkyu Lee , Byeonggeun Kim , Dooyong Sung , Jae-Won Choi , Kyu Woong Hwang

IPC: G10L15/00 , G10L15/02 , G10L15/22 , G10L15/08

CPC classification number: G10L15/02 , G10L15/22 , G10L15/08

Abstract: System and method for operating an always-on ASR (automatic speech recognition) system by selecting target keywords and continuously detecting the selected target keywords in voice commands in a mobile device are provided. In the mobile device, a processor is configured to collect keyword candidates, collect usage frequency data for keywords in the keyword candidates, collect situational usage frequency data for the keywords in the keyword candidates, select target keywords from the keyword candidates based on the usage frequency data and the situational usage frequency data, and detect one or more of the target keywords in a voice command using continuous detection of the target keywords.

4.

发明授权
Task agnostic open-set prototypes for few-shot open-set recognition 有权

公开(公告)号：US12019641B2

公开(公告)日：2024-06-25

申请号：US18153899

申请日：2023-01-12

Applicant: QUALCOMM Incorporated

Inventor： Byeonggeun Kim , Juntae Lee , Simyung Chang

IPC: G06F7/00 , G06F16/2458 , G06F16/28

CPC classification number: G06F16/2462 , G06F16/285

Abstract: Systems and techniques are provided for processing one or more data samples. For example, a neural network classifier can be trained to perform few-shot open-set recognition (FSOSR) based on a task-agnostic open-set prototype. A process can include determining one or more prototype representations for each class included in a plurality of support samples. A task-agnostic open-set prototype representation can be determined, in a same learned metric space as the one or more prototype representations. One or more distance metrics can be determined for each query sample of one or more query samples, based on the one or more prototype representations and the task-agnostic open-set prototype representation. Based on the one or more distance metrics, each query sample can be classified into one of classes associated with the one or more prototype representations or an open-set class associated with the task-agnostic open-set prototype representation.

5.

发明授权
On-device self training in a two-stage wakeup system comprising a system on chip which operates in a reduced-activity mode 有权

公开(公告)号：US11664012B2

公开(公告)日：2023-05-30

申请号：US16830029

申请日：2020-03-25

Applicant: Qualcomm Incorporated

Inventor： Young Mo Kang , Sungrack Yun , Kyu Woong Hwang , Hye Jin Jang , Byeonggeun Kim

IPC: G10L15/08 , G10L15/22 , G10L25/27 , G10L15/32 , G06F15/78 , G10L15/26

CPC classification number: G10L15/08 , G06F15/7807 , G10L15/26 , G10L15/32 , G10L2015/088

Abstract: In one embodiment, an electronic device includes an input device configured to provide an input stream, a first processing device, and a second processing device. The first processing device is configured to use a keyword-detection model to determine if the input stream comprises a keyword, wake up the second processing device in response to determining that a segment of the input stream comprises the keyword, and modify the keyword-detection model in response to a training input received from the second processing device. The second processing device is configured to use a first neural network to determine whether the segment of the input stream comprises the keyword and provide the training input to the first processing device in response to determining that the segment of the input stream does not comprise the keyword.

6.

发明授权
Activating speech recognition based on hand patterns detected using plurality of filters 有权

公开(公告)号：US11437031B2

公开(公告)日：2022-09-06

申请号：US16526608

申请日：2019-07-30

Applicant: QUALCOMM Incorporated

Inventor： Sungrack Yun , Young Mo Kang , Hye Jin Jang , Byeonggeun Kim , Kyu Woong Hwang

IPC: G06F3/01 , G06K9/62 , B60H1/00 , G06F13/42 , G10L17/22 , G06F3/16 , G10L15/22 , G06F1/3231 , G10L15/08 , G06V40/10

Abstract: A device to process an audio signal representing input sound includes a hand detector configured to generate a first indication responsive to detection of at least a portion of a hand over at least a portion of the device. The device also includes an automatic speech recognition system configured to be activated, responsive to the first indication, to process the audio signal.

7.

发明授权
Method and apparatus for activating speech recognition 有权

公开(公告)号：US11205433B2

公开(公告)日：2021-12-21

申请号：US16547263

申请日：2019-08-21

Applicant: QUALCOMM Incorporated

Inventor： Byeonggeun Kim , Young Mo Kang , Sungrack Yun , Kyu Woong Hwang , Hye Jin Jang

IPC: G10L25/00 , G10L15/00 , G10L17/00 , G10L15/22

Abstract: A device to process an audio signal representing input sound includes a user voice verifier configured to generate a first indication based on whether the audio signal represents a user's voice. The device includes a speaking target detector configured to generate a second indication based on whether the audio signal represents at least one of a command or a question. The device includes an activation signal unit configured to selectively generate an activation signal based on the first indication and the second indication. The device also includes an automatic speech recognition engine configured to be activated, responsive to the activation signal, to process the audio signal.

8.

发明申请
ACTIVATING SPEECH RECOGNITION 有权

公开(公告)号：US20210035571A1

公开(公告)日：2021-02-04

申请号：US16526608

申请日：2019-07-30

Applicant: QUALCOMM Incorporated

Inventor： Sungrack Yun , Young Mo Kang , Hye Jin Jang , Byeonggeun Kim , Kyu Woong Hwang

IPC: G10L15/22 , G10L15/08 , G06F1/3231 , G06K9/00

Abstract: A device to process an audio signal representing input sound includes a hand detector configured to generate a first indication responsive to detection of at least a portion of a hand over at least a portion of the device. The device also includes an automatic speech recognition system configured to be activated, responsive to the first indication, to process the audio signal.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification