Patent search ap:("Google LLC") AND inv:"Ryan M. Rifkin" Page 2

11.

发明申请
TIME-SYNCHRONIZED, MULTIZONE MEDIA STREAMING 有权

公开(公告)号：US20220408141A1

公开(公告)日：2022-12-22

申请号：US17891284

申请日：2022-08-19

Applicant: Google LLC

Inventor： Kenneth J. MacKay , Byungchul Kim , Tavis A. Maclellan , Richard F. Lyon , Chet N. Gnegy , Pascal T. Getreuer , Chien-Jung Kung , Tomer Shekel , Ryan M. Rifkin

IPC: H04N21/43 , H04N21/436 , H04N21/442 , H04N21/64

Abstract: In a general aspect, a system for media playback can include a first media playback device configured to receive a media stream from a media casting device over a data network, the first media playback device being a member of the media playback group and a second media playback device configured to receive the media stream, the second media playback device being a member of the media playback group. The first media playback device and the second media playback device can be collectively configured to designate one of the first media playback device and the second media playback device as a leader playback device of the media playback group. The playback device not designated as the leader playback device can be designated as a follower playback device of the media playback group. The first media playback device and the second media playback device can be further collectively configured to determine a clock offset between the leader playback device and the follower playback device. The leader playback device can be configured to receive a broadcast of the media stream over the data network; play the media stream; and provide the media stream to the follower playback device. The follower playback device can be configured to play the media stream in synchronization with the leader playback device based on the clock offset.

12.

发明授权
Time-synchronized, multizone media streaming 有权

公开(公告)号：US11463762B2

公开(公告)日：2022-10-04

申请号：US17360264

申请日：2021-06-28

Applicant: Google LLC

Inventor： Kenneth J. MacKay , Byungchul Kim , Tavis A. Maclellan , Richard F. Lyon , Chet N. Gnegy , Pascal T. Getreuer , Chien-Jung Kung , Tomer Shekel , Ryan M. Rifkin

IPC: H04N21/43 , H04N21/436 , H04N21/442 , H04N21/64

Abstract: In a general aspect, a system for media playback can include a first media playback device configured to receive a media stream from a media casting device over a data network, the first media playback device being a member of the media playback group and a second media playback device configured to receive the media stream, the second media playback device being a member of the media playback group. The first media playback device and the second media playback device can be collectively configured to designate one of the first media playback device and the second media playback device as a leader playback device of the media playback group. The playback device not designated as the leader playback device can be designated as a follower playback device of the media playback group. The first media playback device and the second media playback device can be further collectively configured to determine a clock offset between the leader playback device and the follower playback device. The leader playback device can be configured to receive a broadcast of the media stream over the data network; play the media stream; and provide the media stream to the follower playback device. The follower playback device can be configured to play the media stream in synchronization with the leader playback device based on the clock offset.

13.

发明申请
END-TO-END TEXT-TO-SPEECH CONVERSION 有权

公开(公告)号：US20210366463A1

公开(公告)日：2021-11-25

申请号：US17391799

申请日：2021-08-02

Applicant: Google LLC

Inventor： Samuel Bengio , Yuxuan Wang , Zongheng Yang , Zhifeng Chen , Yonghui Wu , Ioannis Agiomyrgiannakis , Ron J. Weiss , Navdeep Jaitly , Ryan M. Rifkin , Robert Andrew James Clark , Quoc V. Le , Russell J. Ryan , Ying Xiao

IPC: G10L13/08 , G06N3/08 , G10L25/18 , G10L25/30 , G10L13/04 , G06N3/04 , G10L15/16

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating speech from text. One of the systems includes one or more computers and one or more storage devices storing instructions that when executed by one or more computers cause the one or more computers to implement: a sequence-to-sequence recurrent neural network configured to: receive a sequence of characters in a particular natural language, and process the sequence of characters to generate a spectrogram of a verbal utterance of the sequence of characters in the particular natural language; and a subsystem configured to: receive the sequence of characters in the particular natural language, and provide the sequence of characters as input to the sequence-to-sequence recurrent neural network to obtain as output the spectrogram of the verbal utterance of the sequence of characters in the particular natural language.

14.

发明申请
END-TO-END TEXT-TO-SPEECH CONVERSION 审中-公开

公开(公告)号：US20200098350A1

公开(公告)日：2020-03-26

申请号：US16696101

申请日：2019-11-26

Applicant: Google LLC

Inventor： Samuel Bengio , Yuxuan Wang , Zongheng Yang , Zhifeng Chen , Yonghui Wu , Ioannis Agiomyrgiannakis , Ron J. Weiss , Navdeep Jaitly , Ryan M. Rifkin , Robert Andrew James Clark , Quoc V. Le , Russell J. Ryan , Ying Xiao

IPC: G10L13/08 , G10L15/16 , G06N3/08 , G06N3/04 , G10L13/04 , G10L25/30 , G10L25/18

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating speech from text. One of the systems includes one or more computers and one or more storage devices storing instructions that when executed by one or more computers cause the one or more computers to implement: a sequence-to-sequence recurrent neural network configured to: receive a sequence of characters in a particular natural language, and process the sequence of characters to generate a spectrogram of a verbal utterance of the sequence of characters in the particular natural language; and a subsystem configured to: receive the sequence of characters in the particular natural language, and provide the sequence of characters as input to the sequence-to-sequence recurrent neural network to obtain as output the spectrogram of the verbal utterance of the sequence of characters in the particular natural language.

15.

发明申请
END-TO-END TEXT-TO-SPEECH CONVERSION 审中-公开

公开(公告)号：US20190311708A1

公开(公告)日：2019-10-10

申请号：US16447862

申请日：2019-06-20

Applicant: Google LLC

Inventor： Samy Bengio , Yuxuan Wang , Zongheng Yang , Zhifeng Chen , Yonghui Wu , Ioannis Agiomyrgiannakis , Ron J. Weiss , Navdeep Jaitly , Ryan M. Rifkin , Robert Andrew James Clark , Quoc V. Le , Russell J. Ryan , Ying Xiao

IPC: G10L13/08 , G10L25/18 , G10L25/30 , G06N3/08

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating speech from text. One of the systems includes one or more computers and one or more storage devices storing instructions that when executed by one or more computers cause the one or more computers to implement: a sequence-to-sequence recurrent neural network configured to: receive a sequence of characters in a particular natural language, and process the sequence of characters to generate a spectrogram of a verbal utterance of the sequence of characters in the particular natural language; and a subsystem configured to: receive the sequence of characters in the particular natural language, and provide the sequence of characters as input to the sequence-to-sequence recurrent neural network to obtain as output the spectrogram of the verbal utterance of the sequence of characters in the particular natural language.

16.

发明授权
End-to-end text-to-speech conversion 有权

公开(公告)号：US12190860B2

公开(公告)日：2025-01-07

申请号：US18516069

申请日：2023-11-21

Applicant: Google LLC

Inventor： Samuel Bengio , Yuxuan Wang , Zongheng Yang , Zhifeng Chen , Yonghui Wu , Ioannis Agiomyrgiannakis , Ron J. Weiss , Navdeep Jaitly , Ryan M. Rifkin , Robert Andrew James Clark , Quoc V. Le , Russell J. Ryan , Ying Xiao

IPC: G10L13/06 , G06N3/045 , G06N3/08 , G06N3/084 , G10L13/04 , G10L13/08 , G10L15/16 , G10L25/18 , G10L25/30

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating speech from text. One of the systems includes one or more computers and one or more storage devices storing instructions that when executed by one or more computers cause the one or more computers to implement: a sequence-to-sequence recurrent neural network configured to: receive a sequence of characters in a particular natural language, and process the sequence of characters to generate a spectrogram of a verbal utterance of the sequence of characters in the particular natural language; and a subsystem configured to: receive the sequence of characters in the particular natural language, and provide the sequence of characters as input to the sequence-to-sequence recurrent neural network to obtain as output the spectrogram of the verbal utterance of the sequence of characters in the particular natural language.

17.

发明授权
Synthesizing speech from text using neural networks 有权

公开(公告)号：US12148444B2

公开(公告)日：2024-11-19

申请号：US17222736

申请日：2021-04-05

Applicant: Google LLC

Inventor： Yonghui Wu , Jonathan Shen , Ruoming Pang , Ron J. Weiss , Michael Schuster , Navdeep Jaitly , Zongheng Yang , Zhifeng Chen , Yu Zhang , Yuxuan Wang , Russell John Wyatt Skerry-Ryan , Ryan M. Rifkin , Ioannis Agiomyrgiannakis

IPC: G10L13/047 , G06N3/045 , G06N3/08 , G06N5/046 , G06N7/01 , G10L13/08 , G10L25/18 , G10L25/30

Abstract: Methods, systems, and computer program products for generating, from an input character sequence, an output sequence of audio data representing the input character sequence. The output sequence of audio data includes a respective audio output sample for each of a number of time steps. One example method includes, for each of the time steps: generating a mel-frequency spectrogram for the time step by processing a representation of a respective portion of the input character sequence using a decoder neural network; generating a probability distribution over a plurality of possible audio output samples for the time step by processing the mel-frequency spectrogram for the time step using a vocoder neural network; and selecting the audio output sample for the time step from the possible audio output samples in accordance with the probability distribution.

18.

发明公开
END-TO-END TEXT-TO-SPEECH CONVERSION 审中-公开

公开(公告)号：US20240127791A1

公开(公告)日：2024-04-18

申请号：US18516069

申请日：2023-11-21

Applicant: Google LLC

Inventor： Samuel Bengio , Yuxuan Wang , Zongheng Yang , Zhifeng Chen , Yonghui Wu , Ioannis Agiomyrgiannakis , Ron J. Weiss , Navdeep Jaitly , Ryan M. Rifkin , Robert Andrew James Clark , Quoc V. Le , Russell J. Ryan , Ying Xiao

IPC: G10L13/08 , G06N3/045 , G06N3/08 , G06N3/084 , G10L13/04 , G10L15/16 , G10L25/18 , G10L25/30

CPC classification number: G10L13/08 , G06N3/045 , G06N3/08 , G06N3/084 , G10L13/04 , G10L15/16 , G10L25/18 , G10L25/30

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating speech from text. One of the systems includes one or more computers and one or more storage devices storing instructions that when executed by one or more computers cause the one or more computers to implement: a sequence-to-sequence recurrent neural network configured to: receive a sequence of characters in a particular natural language, and process the sequence of characters to generate a spectrogram of a verbal utterance of the sequence of characters in the particular natural language; and a subsystem configured to: receive the sequence of characters in the particular natural language, and provide the sequence of characters as input to the sequence-to-sequence recurrent neural network to obtain as output the spectrogram of the verbal utterance of the sequence of characters in the particular natural language.

19.

发明授权
Time-synchronized, multizone media streaming 有权

公开(公告)号：US11871067B2

公开(公告)日：2024-01-09

申请号：US17891284

申请日：2022-08-19

Applicant: Google LLC

Inventor： Kenneth J. Mackay , Byungchul Kim , Tavis A. Maclellan , Richard F. Lyon , Chet N. Gnegy , Pascal T. Getreuer , Chien-Jung Kung , Tomer Shekel , Ryan M. Rifkin

IPC: H04N21/43 , H04N21/436 , H04N21/442 , H04N21/64

CPC classification number: H04N21/4305 , H04N21/4302 , H04N21/43076 , H04N21/43615 , H04N21/44227 , H04N21/64

Abstract: In a general aspect, a system for media playback can include a first media playback device configured to receive a media stream from a media casting device over a data network, the first media playback device being a member of the media playback group and a second media playback device configured to receive the media stream, the second media playback device being a member of the media playback group. The first media playback device and the second media playback device can be collectively configured to designate one of the first media playback device and the second media playback device as a leader playback device of the media playback group. The playback device not designated as the leader playback device can be designated as a follower playback device of the media playback group. The first media playback device and the second media playback device can be further collectively configured to determine a clock offset between the leader playback device and the follower playback device. The leader playback device can be configured to receive a broadcast of the media stream over the data network; play the media stream; and provide the media stream to the follower playback device. The follower playback device can be configured to play the media stream in synchronization with the leader playback device based on the clock offset.

20.

发明授权
Generating a playlist 有权

公开(公告)号：US11461388B2

公开(公告)日：2022-10-04

申请号：US16105717

申请日：2018-08-20

Applicant: Google LLC

Inventor： Geremy A. Heitz, III , Adam Berenzweig , Jason E. Weston , Ron J. Weiss , Sally A. Goldman , Thomas Walters , Samy Bengio , Douglas Eck , Jay M. Ponte , Ryan M. Rifkin

IPC: G06F16/00 , G06F16/638 , G06F16/683

Abstract: Generating a playlist may include designating a seed track in an audio library; identifying audio tracks in the audio library having constructs that are within a range of a corresponding construct of the seed track, where the constructs for the audio tracks are derived from frequency representations of the audio tracks, and the corresponding construct for the seed track is derived from a frequency representation of the seed track; and generating the playlist using at least some of the audio tracks that were identified.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification