Patent search ap:("GOOGLE LLC") AND inv:"Dominik Roblek" Page 5

41.

发明授权
Segment content displayed on a computing device into regions based on pixels of a screenshot image that captures the content 有权

公开(公告)号：US10147197B2

公开(公告)日：2018-12-04

申请号：US15839797

申请日：2017-12-12

Applicant: Google LLC

Inventor： Dominik Roblek , David Petrou , Matthew Sharifi

IPC: G06T7/30 , G06F17/30 , G06T7/90 , G06F3/0484 , G06F3/0488

Abstract: Methods and apparatus directed to segmenting content displayed on a computing device into regions. The segmenting of content displayed on the computing device into regions is accomplished via analysis of pixels of a “screenshot image” that captures at least a portion of (e.g., all of) the displayed content. Individual pixels of the screenshot image may be analyzed to determine one or more regions of the screenshot image and to optionally assign a corresponding semantic type to each of the regions. Some implementations are further directed to generating, based on one or more of the regions, interactive content to provide for presentation to the user via the computing device.

42.

发明申请
SEGMENT-BASED SPEAKER VERIFICATION USING DYNAMICALLY GENERATED PHRASES 审中-公开

公开(公告)号：US20180308492A1

公开(公告)日：2018-10-25

申请号：US16017690

申请日：2018-06-25

Applicant: Google LLC

Inventor： Dominik Roblek , Matthew Sharifi

IPC: G10L17/24 , G10L17/04 , G10L15/02

CPC classification number: G10L17/24 , G10L15/02 , G10L17/04 , G10L2015/025

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for verifying an identity of a user. The methods, systems, and apparatus include actions of receiving a request for a verification phrase for verifying an identity of a user. Additional actions include, in response to receiving the request for the verification phrase for verifying the identity of the user, identifying subwords to be included in the verification phrase and in response to identifying the subwords to be included in the verification phrase, obtaining a candidate phrase that includes at least some of the identified subwords as the verification phrase. Further actions include providing the verification phrase as a response to the request for the verification phrase for verifying the identity of the user.

43.

发明申请
GENERATING AUDIO WAVEFORMS USING ENCODER AND DECODER NEURAL NETWORKS 有权

公开(公告)号：US20250078848A1

公开(公告)日：2025-03-06

申请号：US18952607

申请日：2024-11-19

Applicant: Google LLC

Inventor： Yunpeng Li , Marco Tagliasacchi , Dominik Roblek , Félix de Chaumont Quitry , Beat Gfeller , Hannah Raphaelle Muckenhirn , Victor Ungureanu , Oleg Rybakov , Karolis Misiunas , Zalán Borsos

IPC: G10L19/022 , G06N3/045

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for processing an input audio waveform using a generator neural network to generate an output audio waveform. In one aspect, a method comprises: receiving an input audio waveform; processing the input audio waveform using an encoder neural network to generate a set of feature vectors representing the input audio waveform; and processing the set of feature vectors representing the input audio waveform using a decoder neural network to generate an output audio waveform that comprises a respective output audio sample for each of a plurality of output time steps.

44.

发明授权
Generating coded data representations using neural networks and vector quantizers 有权

公开(公告)号：US12198710B2

公开(公告)日：2025-01-14

申请号：US18400992

申请日：2023-12-29

Applicant: Google LLC

Inventor： Neil Zeghidour , Marco Tagliasacchi , Dominik Roblek

IPC: G10L19/038 , G06N3/045 , G06N3/08 , G10L19/00 , G10L25/30

Abstract: Methods, systems and apparatus, including computer programs encoded on computer storage media. According to one aspect, there is provided a method comprising: receiving a new input; processing the new input using an encoder neural network to generate a feature vector representing the new input; and generating a coded representation of the feature vector using a sequence of vector quantizers that are each associated with a respective codebook of code vectors, wherein the coded representation of the feature vector identifies a plurality of code vectors, including a respective code vector from the codebook of each vector quantizer, that define a quantized representation of the feature vector.

45.

发明授权
Personalized entity repository 有权

公开(公告)号：US12108314B2

公开(公告)日：2024-10-01

申请号：US18227751

申请日：2023-07-28

Applicant: GOOGLE LLC

Inventor： Matthew Sharifi , Jorge Pereira , Dominik Roblek , Julian Odell , Cong Li , David Petrou

IPC: H04W4/021 , G06F16/23 , G06F16/2457 , G06F16/248 , G06F16/587 , G06F16/907 , G06F16/9535 , G06V20/62 , H04L67/50 , H04W4/029 , H04W4/18 , H04W4/60

CPC classification number: H04W4/60 , G06F16/23 , G06F16/235 , G06F16/2358 , G06F16/24578 , G06F16/248 , G06F16/587 , G06F16/907 , G06F16/9535 , G06V20/62 , H04L67/535 , H04W4/029 , H04W4/18

Abstract: Systems and methods are provided for a personalized entity repository. For example, a computing device comprises a personalized entity repository having fixed sets of entities from an entity repository stored at a server, a processor, and memory storing instructions that cause the computing device to identify fixed sets of entities that are relevant to a user based on context associated with the computing device, rank the fixed sets by relevancy, and update the personalized entity repository using selected sets determined based on the rank and on set usage parameters applicable to the user. In another example, a method includes generating fixed sets of entities from an entity repository, including location-based sets and topic-based sets, and providing a subset of the fixed sets to a client, the client requesting the subset based on the client's location and on items identified in content generated for display on the client.

46.

发明授权
Automated mining of real-world audio training data 有权

公开(公告)号：US12106748B2

公开(公告)日：2024-10-01

申请号：US17769624

申请日：2019-11-18

Applicant: Google LLC

Inventor： Dominik Roblek

IPC: G10L15/06 , G10L15/02 , G10L15/08 , H04R1/40 , H04R3/00

CPC classification number: G10L15/063 , G10L15/02 , G10L15/08 , H04R1/406 , H04R3/005 , G10L2015/088

Abstract: Methods, systems, and apparatus, for generated labeled training examples for machine learning. In one aspect, a method includes receiving sets of audio recordings by a user device. For each set of audio recordings, each audio recording in the set is recorded over a respective separate microphone in the user device during a particular time interval, and each particular time interval is different for each set of audio recordings. For each set of audio recordings, a detector determines whether an audio recording in the set of audio recordings includes a particular audio feature, and whether another one of the audio recordings does not include the particular audio feature. For each set of audio recordings determined to include an audio recording that includes the particular audio feature and to include another audio recording that does not include the particular audio feature, a labeled training example is generated.

47.

发明授权
Aggregation of related media content 有权

公开(公告)号：US12033668B2

公开(公告)日：2024-07-09

申请号：US17745252

申请日：2022-05-16

Applicant: Google LLC

Inventor： Yossi Matias , Matthew Sharifi , Thomas Bugnon , Dominik Roblek , Annie Chen

IPC: G11B27/031 , G06F16/44 , G06V20/40 , G11B27/034 , G11B27/10 , G11B27/28 , G11B27/30 , H04N23/698 , H04N5/04

CPC classification number: G11B27/031 , G06F16/447 , G06V20/41 , G11B27/034 , G11B27/10 , G11B27/28 , G11B27/3081 , H04N23/698 , H04N5/04

Abstract: Systems and methods for media aggregation are disclosed herein. The system includes a media system that can transform media items into one aggregated media item. A synchronization component synchronizes media items with respect to time. The synchronized media items can be analyzed and transformed into an aggregated media item for storage and/or display. In one implementation, the aggregated media item is capable of being displayed in multiple ways to create an enhanced and customizable viewing and/or listening experience.

48.

发明公开
Machine Learning Based Enhancement of Audio for a Voice Call 审中-公开

公开(公告)号：US20240153514A1

公开(公告)日：2024-05-09

申请号：US18548949

申请日：2021-03-05

Applicant: Google LLC

Inventor： Omer Ahmed Siddig Osman , Dominik Roblek , Yunpeng Li , Marco Tagliasacchi , Oleg Rybakov , Victor Ungureanu , Eric Giguere

IPC: G10L19/06 , G10L19/16 , G10L25/30 , G10L25/69

CPC classification number: G10L19/06 , G10L19/167 , G10L25/30 , G10L25/69

Abstract: Apparatus and methods related to enhancement of audio content are provided. An example method includes receiving, by a computing device and via a communications network interface, a compressed audio data frame, wherein the compressed audio data frame is received after transmission over a communications network, The method further includes decompressing the compressed audio data frame to extract an audio waveform. The method also includes predicting, by applying a neural network to the audio waveform, an enhanced version of the audio waveform, wherein the neural network has been trained on (i) a ground truth sample comprising unencoded audio waveforms prior to compression by an audio encoder, and (ii) a training dataset comprising decoded audio waveforms after compression of the unencoded audio waveforms by the audio encoder. The method additionally includes providing, by an audio output component of the computing device, the enhanced version of the audio waveform.

49.

发明授权
Personalized entity repository 有权

公开(公告)号：US11716600B2

公开(公告)日：2023-08-01

申请号：US17397666

申请日：2021-08-09

Applicant: GOOGLE LLC

Inventor： Matthew Sharifi , Jorge Pereira , Dominik Roblek , Julian Odell , Cong Li , David Petrou

IPC: H04W4/02 , H04W4/60 , G06F16/248 , G06F16/9535 , G06F16/2457 , H04W4/029 , G06F16/907 , G06F16/587 , G06V20/62 , H04L67/50 , H04W4/18 , G06F16/23

CPC classification number: H04W4/60 , G06F16/23 , G06F16/235 , G06F16/2358 , G06F16/248 , G06F16/24578 , G06F16/587 , G06F16/907 , G06F16/9535 , G06V20/62 , H04L67/535 , H04W4/029 , H04W4/18

Abstract: Systems and methods are provided for a personalized entity repository. For example, a computing device comprises a personalized entity repository having fixed sets of entities from an entity repository stored at a server, a processor, and memory storing instructions that cause the computing device to identify fixed sets of entities that are relevant to a user based on context associated with the computing device, rank the fixed sets by relevancy, and update the personalized entity repository using selected sets determined based on the rank and on set usage parameters applicable to the user. In another example, a method includes generating fixed sets of entities from an entity repository, including location-based sets and topic-based sets, and providing a subset of the fixed sets to a client, the client requesting the subset based on the client's location and on items identified in content generated for display on the client.

50.

发明公开
COMPRESSING AUDIO WAVEFORMS USING NEURAL NETWORKS AND VECTOR QUANTIZERS 审中-公开

公开(公告)号：US20230186927A1

公开(公告)日：2023-06-15

申请号：US18106094

申请日：2023-02-06

Applicant: Google LLC

Inventor： Neil Zeghidour , Marco Tagliasacchi , Dominik Roblek

IPC: G10L19/038 , G06N3/045 , G06N3/08 , G10L25/30

CPC classification number: G10L19/038 , G06N3/045 , G06N3/08 , G10L25/30 , G10L2019/0002

Abstract: Methods, systems and apparatus, including computer programs encoded on computer storage media. One of the methods includes receiving an audio waveform that includes a respective audio sample for each of a plurality of time steps, processing the audio waveform using an encoder neural network to generate a plurality of feature vectors representing the audio waveform, generating a respective coded representation of each of the plurality of feature vectors using a plurality of vector quantizers that are each associated with a respective codebook of code vectors, wherein the respective coded representation of each feature vector identifies a plurality of code vectors, including a respective code vector from the codebook of each vector quantizer, that define a quantized representation of the feature vector, and generating a compressed representation of the audio waveform by compressing the respective coded representation of each of the plurality of feature vectors.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification