Methods and Systems for Encoding Images

    公开(公告)号:US20250078494A1

    公开(公告)日:2025-03-06

    申请号:US18953894

    申请日:2024-11-20

    Applicant: Google LLC

    Abstract: The present disclosure is directed to encoding images. In particular, one or more computing devices can receive data representing one or more machine learning (ML) models configured, at least in part, to encode images comprising objects of a particular type. The computing device(s) can receive data representing an image comprising one or more objects of the particular type. The computing device(s) can generate, based at least in part on the data representing the image and the data representing the ML model(s), data representing an encoded version of the image that alters at least a portion of the image comprising the object(s) such that when the encoded version of the image is decoded, the object(s) are unrecognizable as being of the particular type by one or more object-recognition ML models based at least in part upon which the ML model(s) configured to encode the images were trained.

    Hiding information and images via deep learning

    公开(公告)号:US11080809B2

    公开(公告)日:2021-08-03

    申请号:US16614983

    申请日:2018-02-13

    Applicant: Google LLC

    Inventor: Shumeet Baluja

    Abstract: The present disclosure provides systems and methods for hiding information using deep neural networks. In one example, a computer-implemented method is provided to train neural networks for hiding images, which includes inputting a package image and a cover image into an image hiding neural network and generating a carrier image as an output, the carrier image comprising the package image hidden within the cover image. The method includes inputting the carrier image into an image decoding neural network and generating a reconstruction of the package image as an output. The method includes simultaneously training the image decoding neural network based at least in part on a first loss function that describes a difference between the package image and the reconstruction of the package image and the image hiding neural network based at least in part on the first loss function and on a second loss function that describes a difference between the cover image and the carrier image.

    METHODS, SYSTEMS, AND MEDIA FOR SEAMLESS AUDIO MELDING BETWEEN SONGS IN A PLAYLIST

    公开(公告)号:US20210166731A1

    公开(公告)日:2021-06-03

    申请号:US17009001

    申请日:2020-09-01

    Applicant: Google LLC

    Abstract: In accordance with some embodiments of the disclosed subject matter, mechanisms for seamless audio melding between audio items in a playlist are provided. In some embodiments, a method for transitioning between audio items in playlists is provided, comprising: identifying a sequence of audio items in a playlist of audio items, wherein the sequence of audio items includes a first audio item and a second audio item that is to be played subsequent to the first audio item; and modifying an end portion of the first audio item and a beginning portion of the second audio item, where the end portion of the first audio item and the beginning portion of the second audio item are to be played concurrently to transition between the first audio item and the second audio item, wherein the end portion of the first audio item and the beginning portion of the second audio item have an overlap duration, and wherein modifying the end portion of the first audio item and the beginning portion of the second audio item comprises: generating a first spectrogram corresponding to the end portion of the first audio item and a second spectrogram corresponding to the beginning portion of the second audio item; identifying, for each frequency band in a series of frequency bands, a window over which the first spectrogram within the end portion of the first audio item and the second spectrogram within the beginning portion of the second audio item have a particular cross-correlation; modifying, for each frequency band in the series of frequency bands, the end portion of the first spectrogram and the beginning portion of the second spectrogram such that amplitudes of frequencies within the frequency band decrease within the first spectrogram over the end portion of the first spectrogram and that amplitudes of frequencies within the frequency band increase within the second spectrogram over the beginning portion of the second spectrogram; and generating a modified version of the first audio item the includes the modified end portion of the first audio item based on the modified end portion of the first spectrogram and generating a modified version of the second audio item that includes the modified beginning portion of the second audio item based on the modified beginning portion of the second spectrogram.

    Automatic language model update
    5.
    发明授权

    公开(公告)号:US10410627B2

    公开(公告)日:2019-09-10

    申请号:US15922154

    申请日:2018-03-15

    Applicant: Google LLC

    Abstract: A method for generating a speech recognition model includes accessing a baseline speech recognition model, obtaining information related to recent language usage from search queries, and modifying the speech recognition model to revise probabilities of a portion of a sound occurrence based on the information. The portion of a sound may include a word. Also, a method for generating a speech recognition model, includes receiving at a search engine from a remote device an audio recording and a transcript that substantially represents at least a portion of the audio recording, synchronizing the transcript with the audio recording, extracting one or more letters from the transcript and extracting the associated pronunciation of the one or more letters from the audio recording, and generating a dictionary entry in a pronunciation dictionary.

    Predictive information retrieval
    7.
    发明授权

    公开(公告)号:US10275503B2

    公开(公告)日:2019-04-30

    申请号:US15791584

    申请日:2017-10-24

    Applicant: Google LLC

    Abstract: A computer-implemented method for generating results for a client-requested query involves receiving a query produced by a client communication device, generating a result for the query in response to reception of the query, determining one or more predictive follow-up requests before receiving an actual follow-up request from the client device, and initiating retrieval of information associated with the one or more predictive follow-up requests, and transmitting at least part of the result to the client device, and then transmitting to the client device at least part of the information associated with the one or more predictive follow-up requests.

    Machine-Learned Discretization Level Reduction

    公开(公告)号:US20230385613A1

    公开(公告)日:2023-11-30

    申请号:US18249389

    申请日:2020-10-29

    Applicant: Google LLC

    Inventor: Shumeet Baluja

    CPC classification number: G06N3/048

    Abstract: A computer-implemented method for providing level-reduced tensor data having improved representation of information can include obtaining input tensor data, providing the input tensor data as input to a machine-learned discretization level reduction model configured to receive tensor data having a number of discretization levels and produce, in response to receiving the tensor data, level-reduced tensor data having a reduced number of discretization levels, and obtaining, from the machine-learned discretization level reduction model, the level-reduced tensor data. The machine-learned discretization level reduction model is trained using reconstructed input tensor data generated using an output of the machine-learned discretization level reduction model. The machine-learned discretization level reduction model can include one or more level reduction layers configured to receive input having a first number of discretization levels and to provide a layer output having a reduced a number of discretization levels.

    Methods, systems, and media for seamless audio melding between songs in a playlist

    公开(公告)号:US11670338B2

    公开(公告)日:2023-06-06

    申请号:US17542757

    申请日:2021-12-06

    Applicant: Google LLC

    CPC classification number: G11B27/02 G10L21/10 G11B27/10

    Abstract: In accordance with some embodiments of the disclosed subject matter, mechanisms for seamless audio melding between audio items in a playlist are provided. In some embodiments, a method for transitioning between audio items in playlists is provided, comprising: identifying a sequence of audio items in a playlist of audio items, wherein the sequence of audio items includes a first audio item and a second audio item that is to be played subsequent to the first audio item; and modifying an end portion of the first audio item and a beginning portion of the second audio item, where the end portion of the first audio item and the beginning portion of the second audio item are to be played concurrently to transition between the first audio item and the second audio item, wherein the end portion of the first audio item and the beginning portion of the second audio item have an overlap duration, and wherein modifying the end portion of the first audio item and the beginning portion of the second audio item comprises: generating a first spectrogram corresponding to the end portion of the first audio item and a second spectrogram corresponding to the beginning portion of the second audio item; identifying, for each frequency band in a series of frequency bands, a window over which the first spectrogram within the end portion of the first audio item and the second spectrogram within the beginning portion of the second audio item have a particular cross-correlation; modifying, for each frequency band in the series of frequency bands, the end portion of the first spectrogram and the beginning portion of the second spectrogram such that amplitudes of frequencies within the frequency band decrease within the first spectrogram over the end portion of the first spectrogram and that amplitudes of frequencies within the frequency band increase within the second spectrogram over the beginning portion of the second spectrogram; and generating a modified version of the first audio item the includes the modified end portion of the first audio item based on the modified end portion of the first spectrogram and generating a modified version of the second audio item that includes the modified beginning portion of the second audio item based on the modified beginning portion of the second spectrogram.

    Providing content in response to user actions

    公开(公告)号:US11455299B1

    公开(公告)日:2022-09-27

    申请号:US16293898

    申请日:2019-03-06

    Applicant: Google LLC

    Abstract: Methods, systems and apparatus, including computer programs encoded on a computer storage medium for selecting keywords for resources are disclosed. In one aspect, a search query is received associated with a first user. A determination is made that the first user is a follower of an entity feed that is provided by a first entity and that is provided through a social network. A content item is selected having distribution parameters specifying that the content item is to be provided to users that are followers of the entity feed and that submit the search query. The selected content item is provided for the first user.

Patent Agency Ranking