-
公开(公告)号:US20220408141A1
公开(公告)日:2022-12-22
申请号:US17891284
申请日:2022-08-19
Applicant: Google LLC
Inventor: Kenneth J. MacKay , Byungchul Kim , Tavis A. Maclellan , Richard F. Lyon , Chet N. Gnegy , Pascal T. Getreuer , Chien-Jung Kung , Tomer Shekel , Ryan M. Rifkin
IPC: H04N21/43 , H04N21/436 , H04N21/442 , H04N21/64
Abstract: In a general aspect, a system for media playback can include a first media playback device configured to receive a media stream from a media casting device over a data network, the first media playback device being a member of the media playback group and a second media playback device configured to receive the media stream, the second media playback device being a member of the media playback group. The first media playback device and the second media playback device can be collectively configured to designate one of the first media playback device and the second media playback device as a leader playback device of the media playback group. The playback device not designated as the leader playback device can be designated as a follower playback device of the media playback group. The first media playback device and the second media playback device can be further collectively configured to determine a clock offset between the leader playback device and the follower playback device. The leader playback device can be configured to receive a broadcast of the media stream over the data network; play the media stream; and provide the media stream to the follower playback device. The follower playback device can be configured to play the media stream in synchronization with the leader playback device based on the clock offset.
-
公开(公告)号:US11463762B2
公开(公告)日:2022-10-04
申请号:US17360264
申请日:2021-06-28
Applicant: Google LLC
Inventor: Kenneth J. MacKay , Byungchul Kim , Tavis A. Maclellan , Richard F. Lyon , Chet N. Gnegy , Pascal T. Getreuer , Chien-Jung Kung , Tomer Shekel , Ryan M. Rifkin
IPC: H04N21/43 , H04N21/436 , H04N21/442 , H04N21/64
Abstract: In a general aspect, a system for media playback can include a first media playback device configured to receive a media stream from a media casting device over a data network, the first media playback device being a member of the media playback group and a second media playback device configured to receive the media stream, the second media playback device being a member of the media playback group. The first media playback device and the second media playback device can be collectively configured to designate one of the first media playback device and the second media playback device as a leader playback device of the media playback group. The playback device not designated as the leader playback device can be designated as a follower playback device of the media playback group. The first media playback device and the second media playback device can be further collectively configured to determine a clock offset between the leader playback device and the follower playback device. The leader playback device can be configured to receive a broadcast of the media stream over the data network; play the media stream; and provide the media stream to the follower playback device. The follower playback device can be configured to play the media stream in synchronization with the leader playback device based on the clock offset.
-
公开(公告)号:US20210366463A1
公开(公告)日:2021-11-25
申请号:US17391799
申请日:2021-08-02
Applicant: Google LLC
Inventor: Samuel Bengio , Yuxuan Wang , Zongheng Yang , Zhifeng Chen , Yonghui Wu , Ioannis Agiomyrgiannakis , Ron J. Weiss , Navdeep Jaitly , Ryan M. Rifkin , Robert Andrew James Clark , Quoc V. Le , Russell J. Ryan , Ying Xiao
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating speech from text. One of the systems includes one or more computers and one or more storage devices storing instructions that when executed by one or more computers cause the one or more computers to implement: a sequence-to-sequence recurrent neural network configured to: receive a sequence of characters in a particular natural language, and process the sequence of characters to generate a spectrogram of a verbal utterance of the sequence of characters in the particular natural language; and a subsystem configured to: receive the sequence of characters in the particular natural language, and provide the sequence of characters as input to the sequence-to-sequence recurrent neural network to obtain as output the spectrogram of the verbal utterance of the sequence of characters in the particular natural language.
-
公开(公告)号:US20200098350A1
公开(公告)日:2020-03-26
申请号:US16696101
申请日:2019-11-26
Applicant: Google LLC
Inventor: Samuel Bengio , Yuxuan Wang , Zongheng Yang , Zhifeng Chen , Yonghui Wu , Ioannis Agiomyrgiannakis , Ron J. Weiss , Navdeep Jaitly , Ryan M. Rifkin , Robert Andrew James Clark , Quoc V. Le , Russell J. Ryan , Ying Xiao
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating speech from text. One of the systems includes one or more computers and one or more storage devices storing instructions that when executed by one or more computers cause the one or more computers to implement: a sequence-to-sequence recurrent neural network configured to: receive a sequence of characters in a particular natural language, and process the sequence of characters to generate a spectrogram of a verbal utterance of the sequence of characters in the particular natural language; and a subsystem configured to: receive the sequence of characters in the particular natural language, and provide the sequence of characters as input to the sequence-to-sequence recurrent neural network to obtain as output the spectrogram of the verbal utterance of the sequence of characters in the particular natural language.
-
公开(公告)号:US20190311708A1
公开(公告)日:2019-10-10
申请号:US16447862
申请日:2019-06-20
Applicant: Google LLC
Inventor: Samy Bengio , Yuxuan Wang , Zongheng Yang , Zhifeng Chen , Yonghui Wu , Ioannis Agiomyrgiannakis , Ron J. Weiss , Navdeep Jaitly , Ryan M. Rifkin , Robert Andrew James Clark , Quoc V. Le , Russell J. Ryan , Ying Xiao
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating speech from text. One of the systems includes one or more computers and one or more storage devices storing instructions that when executed by one or more computers cause the one or more computers to implement: a sequence-to-sequence recurrent neural network configured to: receive a sequence of characters in a particular natural language, and process the sequence of characters to generate a spectrogram of a verbal utterance of the sequence of characters in the particular natural language; and a subsystem configured to: receive the sequence of characters in the particular natural language, and provide the sequence of characters as input to the sequence-to-sequence recurrent neural network to obtain as output the spectrogram of the verbal utterance of the sequence of characters in the particular natural language.
-
公开(公告)号:US12190860B2
公开(公告)日:2025-01-07
申请号:US18516069
申请日:2023-11-21
Applicant: Google LLC
Inventor: Samuel Bengio , Yuxuan Wang , Zongheng Yang , Zhifeng Chen , Yonghui Wu , Ioannis Agiomyrgiannakis , Ron J. Weiss , Navdeep Jaitly , Ryan M. Rifkin , Robert Andrew James Clark , Quoc V. Le , Russell J. Ryan , Ying Xiao
IPC: G10L13/06 , G06N3/045 , G06N3/08 , G06N3/084 , G10L13/04 , G10L13/08 , G10L15/16 , G10L25/18 , G10L25/30
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating speech from text. One of the systems includes one or more computers and one or more storage devices storing instructions that when executed by one or more computers cause the one or more computers to implement: a sequence-to-sequence recurrent neural network configured to: receive a sequence of characters in a particular natural language, and process the sequence of characters to generate a spectrogram of a verbal utterance of the sequence of characters in the particular natural language; and a subsystem configured to: receive the sequence of characters in the particular natural language, and provide the sequence of characters as input to the sequence-to-sequence recurrent neural network to obtain as output the spectrogram of the verbal utterance of the sequence of characters in the particular natural language.
-
公开(公告)号:US12148444B2
公开(公告)日:2024-11-19
申请号:US17222736
申请日:2021-04-05
Applicant: Google LLC
Inventor: Yonghui Wu , Jonathan Shen , Ruoming Pang , Ron J. Weiss , Michael Schuster , Navdeep Jaitly , Zongheng Yang , Zhifeng Chen , Yu Zhang , Yuxuan Wang , Russell John Wyatt Skerry-Ryan , Ryan M. Rifkin , Ioannis Agiomyrgiannakis
Abstract: Methods, systems, and computer program products for generating, from an input character sequence, an output sequence of audio data representing the input character sequence. The output sequence of audio data includes a respective audio output sample for each of a number of time steps. One example method includes, for each of the time steps: generating a mel-frequency spectrogram for the time step by processing a representation of a respective portion of the input character sequence using a decoder neural network; generating a probability distribution over a plurality of possible audio output samples for the time step by processing the mel-frequency spectrogram for the time step using a vocoder neural network; and selecting the audio output sample for the time step from the possible audio output samples in accordance with the probability distribution.
-
公开(公告)号:US20240127791A1
公开(公告)日:2024-04-18
申请号:US18516069
申请日:2023-11-21
Applicant: Google LLC
Inventor: Samuel Bengio , Yuxuan Wang , Zongheng Yang , Zhifeng Chen , Yonghui Wu , Ioannis Agiomyrgiannakis , Ron J. Weiss , Navdeep Jaitly , Ryan M. Rifkin , Robert Andrew James Clark , Quoc V. Le , Russell J. Ryan , Ying Xiao
CPC classification number: G10L13/08 , G06N3/045 , G06N3/08 , G06N3/084 , G10L13/04 , G10L15/16 , G10L25/18 , G10L25/30
Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating speech from text. One of the systems includes one or more computers and one or more storage devices storing instructions that when executed by one or more computers cause the one or more computers to implement: a sequence-to-sequence recurrent neural network configured to: receive a sequence of characters in a particular natural language, and process the sequence of characters to generate a spectrogram of a verbal utterance of the sequence of characters in the particular natural language; and a subsystem configured to: receive the sequence of characters in the particular natural language, and provide the sequence of characters as input to the sequence-to-sequence recurrent neural network to obtain as output the spectrogram of the verbal utterance of the sequence of characters in the particular natural language.
-
公开(公告)号:US11871067B2
公开(公告)日:2024-01-09
申请号:US17891284
申请日:2022-08-19
Applicant: Google LLC
Inventor: Kenneth J. Mackay , Byungchul Kim , Tavis A. Maclellan , Richard F. Lyon , Chet N. Gnegy , Pascal T. Getreuer , Chien-Jung Kung , Tomer Shekel , Ryan M. Rifkin
IPC: H04N21/43 , H04N21/436 , H04N21/442 , H04N21/64
CPC classification number: H04N21/4305 , H04N21/4302 , H04N21/43076 , H04N21/43615 , H04N21/44227 , H04N21/64
Abstract: In a general aspect, a system for media playback can include a first media playback device configured to receive a media stream from a media casting device over a data network, the first media playback device being a member of the media playback group and a second media playback device configured to receive the media stream, the second media playback device being a member of the media playback group. The first media playback device and the second media playback device can be collectively configured to designate one of the first media playback device and the second media playback device as a leader playback device of the media playback group. The playback device not designated as the leader playback device can be designated as a follower playback device of the media playback group. The first media playback device and the second media playback device can be further collectively configured to determine a clock offset between the leader playback device and the follower playback device. The leader playback device can be configured to receive a broadcast of the media stream over the data network; play the media stream; and provide the media stream to the follower playback device. The follower playback device can be configured to play the media stream in synchronization with the leader playback device based on the clock offset.
-
公开(公告)号:US11461388B2
公开(公告)日:2022-10-04
申请号:US16105717
申请日:2018-08-20
Applicant: Google LLC
Inventor: Geremy A. Heitz, III , Adam Berenzweig , Jason E. Weston , Ron J. Weiss , Sally A. Goldman , Thomas Walters , Samy Bengio , Douglas Eck , Jay M. Ponte , Ryan M. Rifkin
IPC: G06F16/00 , G06F16/638 , G06F16/683
Abstract: Generating a playlist may include designating a seed track in an audio library; identifying audio tracks in the audio library having constructs that are within a range of a corresponding construct of the seed track, where the constructs for the audio tracks are derived from frequency representations of the audio tracks, and the corresponding construct for the seed track is derived from a frequency representation of the seed track; and generating the playlist using at least some of the audio tracks that were identified.
-
-
-
-
-
-
-
-
-