Patent search ap:("Google LLC") AND inv:"Shumeet Baluja" Page 1

1.

发明申请
Methods and Systems for Encoding Images 有权

公开(公告)号：US20250078494A1

公开(公告)日：2025-03-06

申请号：US18953894

申请日：2024-11-20

Applicant: Google LLC

Inventor： Shumeet Baluja , Rahul Sukthankar

IPC: G06V10/94 , G06V10/774 , G06V10/82

Abstract: The present disclosure is directed to encoding images. In particular, one or more computing devices can receive data representing one or more machine learning (ML) models configured, at least in part, to encode images comprising objects of a particular type. The computing device(s) can receive data representing an image comprising one or more objects of the particular type. The computing device(s) can generate, based at least in part on the data representing the image and the data representing the ML model(s), data representing an encoded version of the image that alters at least a portion of the image comprising the object(s) such that when the encoded version of the image is decoded, the object(s) are unrecognizable as being of the particular type by one or more object-recognition ML models based at least in part upon which the ML model(s) configured to encode the images were trained.

2.

发明授权
Predictive information retrieval 有权

公开(公告)号：US11971897B2

公开(公告)日：2024-04-30

申请号：US16356301

申请日：2019-03-18

Applicant: Google LLC

Inventor： Shumeet Baluja , Henry Allan Rowley

IPC: G06F16/248 , G06F3/0482 , G06F3/04842 , G06F16/951 , G06F16/9535 , G06F16/955

CPC classification number: G06F16/248 , G06F3/0482 , G06F3/04842 , G06F16/951 , G06F16/9535 , G06F16/9558

Abstract: A computer-implemented method for generating results for a client-requested query involves receiving a query produced by a client communication device, generating a result for the query in response to reception of the query, determining one or more predictive follow-up requests before receiving an actual follow-up request from the client device, and initiating retrieval of information associated with the one or more predictive follow-up requests, and transmitting at least part of the result to the client device, and then transmitting to the client device at least part of the information associated with the one or more predictive follow-up requests.

3.

发明授权
Hiding information and images via deep learning 有权

公开(公告)号：US11080809B2

公开(公告)日：2021-08-03

申请号：US16614983

申请日：2018-02-13

Applicant: Google LLC

Inventor： Shumeet Baluja

IPC: G06T1/00 , G06N3/04 , G06N3/08

Abstract: The present disclosure provides systems and methods for hiding information using deep neural networks. In one example, a computer-implemented method is provided to train neural networks for hiding images, which includes inputting a package image and a cover image into an image hiding neural network and generating a carrier image as an output, the carrier image comprising the package image hidden within the cover image. The method includes inputting the carrier image into an image decoding neural network and generating a reconstruction of the package image as an output. The method includes simultaneously training the image decoding neural network based at least in part on a first loss function that describes a difference between the package image and the reconstruction of the package image and the image hiding neural network based at least in part on the first loss function and on a second loss function that describes a difference between the cover image and the carrier image.

4.

发明申请
METHODS, SYSTEMS, AND MEDIA FOR SEAMLESS AUDIO MELDING BETWEEN SONGS IN A PLAYLIST 有权

公开(公告)号：US20210166731A1

公开(公告)日：2021-06-03

申请号：US17009001

申请日：2020-09-01

Applicant: Google LLC

Inventor： Michele Covell , Shumeet Baluja

IPC: G11B27/02 , G11B27/10 , G10L21/10

Abstract: In accordance with some embodiments of the disclosed subject matter, mechanisms for seamless audio melding between audio items in a playlist are provided. In some embodiments, a method for transitioning between audio items in playlists is provided, comprising: identifying a sequence of audio items in a playlist of audio items, wherein the sequence of audio items includes a first audio item and a second audio item that is to be played subsequent to the first audio item; and modifying an end portion of the first audio item and a beginning portion of the second audio item, where the end portion of the first audio item and the beginning portion of the second audio item are to be played concurrently to transition between the first audio item and the second audio item, wherein the end portion of the first audio item and the beginning portion of the second audio item have an overlap duration, and wherein modifying the end portion of the first audio item and the beginning portion of the second audio item comprises: generating a first spectrogram corresponding to the end portion of the first audio item and a second spectrogram corresponding to the beginning portion of the second audio item; identifying, for each frequency band in a series of frequency bands, a window over which the first spectrogram within the end portion of the first audio item and the second spectrogram within the beginning portion of the second audio item have a particular cross-correlation; modifying, for each frequency band in the series of frequency bands, the end portion of the first spectrogram and the beginning portion of the second spectrogram such that amplitudes of frequencies within the frequency band decrease within the first spectrogram over the end portion of the first spectrogram and that amplitudes of frequencies within the frequency band increase within the second spectrogram over the beginning portion of the second spectrogram; and generating a modified version of the first audio item the includes the modified end portion of the first audio item based on the modified end portion of the first spectrogram and generating a modified version of the second audio item that includes the modified beginning portion of the second audio item based on the modified beginning portion of the second spectrogram.

5.

发明授权
Automatic language model update 有权

公开(公告)号：US10410627B2

公开(公告)日：2019-09-10

申请号：US15922154

申请日：2018-03-15

Applicant: Google LLC

Inventor： Michael H. Cohen , Shumeet Baluja , Pedro J. Moreno Mengibar

IPC: G10L15/00 , G10L17/00 , G10L15/065 , G10L15/187 , G10L15/06 , G10L15/26

Abstract: A method for generating a speech recognition model includes accessing a baseline speech recognition model, obtaining information related to recent language usage from search queries, and modifying the speech recognition model to revise probabilities of a portion of a sound occurrence based on the information. The portion of a sound may include a word. Also, a method for generating a speech recognition model, includes receiving at a search engine from a remote device an audio recording and a transcript that substantially represents at least a portion of the audio recording, synchronizing the transcript with the audio recording, extracting one or more letters from the transcript and extracting the associated pronunciation of the one or more letters from the audio recording, and generating a dictionary entry in a pronunciation dictionary.

6.

发明申请
Predictive Information Retrieval 审中-公开

公开(公告)号：US20190213186A1

公开(公告)日：2019-07-11

申请号：US16356301

申请日：2019-03-18

Applicant: Google LLC

Inventor： Shumeet Baluja , Henry Allan Rowley

IPC: G06F16/248 , G06F16/955 , G06F16/9535 , G06F3/0482 , G06F3/0484 , G06F16/951

CPC classification number: G06F16/248 , G06F3/0482 , G06F3/04842 , G06F16/951 , G06F16/9535 , G06F16/9558

Abstract: A computer-implemented method for generating results for a client-requested query involves receiving a query produced by a client communication device, generating a result for the query in response to reception of the query, determining one or more predictive follow-up requests before receiving an actual follow-up request from the client device, and initiating retrieval of information associated with the one or more predictive follow-up requests, and transmitting at least part of the result to the client device, and then transmitting to the client device at least part of the information associated with the one or more predictive follow-up requests.

7.

发明授权
Predictive information retrieval 有权

公开(公告)号：US10275503B2

公开(公告)日：2019-04-30

申请号：US15791584

申请日：2017-10-24

Applicant: Google LLC

Inventor： Shumeet Baluja , Henry Allan Rowley

IPC: G06F17/30 , G06F3/0482 , G06F3/0484

Abstract: A computer-implemented method for generating results for a client-requested query involves receiving a query produced by a client communication device, generating a result for the query in response to reception of the query, determining one or more predictive follow-up requests before receiving an actual follow-up request from the client device, and initiating retrieval of information associated with the one or more predictive follow-up requests, and transmitting at least part of the result to the client device, and then transmitting to the client device at least part of the information associated with the one or more predictive follow-up requests.

8.

发明公开
Machine-Learned Discretization Level Reduction 审中-公开

公开(公告)号：US20230385613A1

公开(公告)日：2023-11-30

申请号：US18249389

申请日：2020-10-29

Applicant: Google LLC

Inventor： Shumeet Baluja

IPC: G06N3/048

CPC classification number: G06N3/048

Abstract: A computer-implemented method for providing level-reduced tensor data having improved representation of information can include obtaining input tensor data, providing the input tensor data as input to a machine-learned discretization level reduction model configured to receive tensor data having a number of discretization levels and produce, in response to receiving the tensor data, level-reduced tensor data having a reduced number of discretization levels, and obtaining, from the machine-learned discretization level reduction model, the level-reduced tensor data. The machine-learned discretization level reduction model is trained using reconstructed input tensor data generated using an output of the machine-learned discretization level reduction model. The machine-learned discretization level reduction model can include one or more level reduction layers configured to receive input having a first number of discretization levels and to provide a layer output having a reduced a number of discretization levels.

9.

发明授权
Methods, systems, and media for seamless audio melding between songs in a playlist 有权

公开(公告)号：US11670338B2

公开(公告)日：2023-06-06

申请号：US17542757

申请日：2021-12-06

Applicant: Google LLC

Inventor： Michele Covell , Shumeet Baluja

IPC: G11B27/02 , G11B27/10 , G10L21/10

CPC classification number: G11B27/02 , G10L21/10 , G11B27/10

Abstract: In accordance with some embodiments of the disclosed subject matter, mechanisms for seamless audio melding between audio items in a playlist are provided. In some embodiments, a method for transitioning between audio items in playlists is provided, comprising: identifying a sequence of audio items in a playlist of audio items, wherein the sequence of audio items includes a first audio item and a second audio item that is to be played subsequent to the first audio item; and modifying an end portion of the first audio item and a beginning portion of the second audio item, where the end portion of the first audio item and the beginning portion of the second audio item are to be played concurrently to transition between the first audio item and the second audio item, wherein the end portion of the first audio item and the beginning portion of the second audio item have an overlap duration, and wherein modifying the end portion of the first audio item and the beginning portion of the second audio item comprises: generating a first spectrogram corresponding to the end portion of the first audio item and a second spectrogram corresponding to the beginning portion of the second audio item; identifying, for each frequency band in a series of frequency bands, a window over which the first spectrogram within the end portion of the first audio item and the second spectrogram within the beginning portion of the second audio item have a particular cross-correlation; modifying, for each frequency band in the series of frequency bands, the end portion of the first spectrogram and the beginning portion of the second spectrogram such that amplitudes of frequencies within the frequency band decrease within the first spectrogram over the end portion of the first spectrogram and that amplitudes of frequencies within the frequency band increase within the second spectrogram over the beginning portion of the second spectrogram; and generating a modified version of the first audio item the includes the modified end portion of the first audio item based on the modified end portion of the first spectrogram and generating a modified version of the second audio item that includes the modified beginning portion of the second audio item based on the modified beginning portion of the second spectrogram.

10.

发明授权
Providing content in response to user actions 有权

公开(公告)号：US11455299B1

公开(公告)日：2022-09-27

申请号：US16293898

申请日：2019-03-06

Applicant: Google LLC

Inventor： Michael Chu , Michele Covell , Joshua J. Sacks , Shumeet Baluja , Zhengrong Ji

IPC: G06F7/00 , G06F17/30 , G06F16/245

Abstract: Methods, systems and apparatus, including computer programs encoded on a computer storage medium for selecting keywords for resources are disclosed. In one aspect, a search query is received associated with a first user. A determination is made that the first user is a follower of an entity feed that is provided by a first entity and that is provided through a social network. A content item is selected having distribution parameters specifying that the content item is to be provided to users that are followers of the entity feed and that submit the search query. The selected content item is provided for the first user.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification