Patent search ap:("Google LLC") AND inv:"Ioannis Agiomyrgiannakis" Page 1

1.

发明公开
DEVICES AND METHODS FOR A SPEECH-BASED USER INTERFACE 审中-公开

公开(公告)号：US20240029706A1

公开(公告)日：2024-01-25

申请号：US18479785

申请日：2023-10-02

Applicant: Google LLC

Inventor： Ioannis Agiomyrgiannakis , Fergus James Henderson

IPC: G10L13/033 , G06F3/16 , G10L13/10

CPC classification number: G10L13/033 , G06F3/167 , G10L13/10 , G10L2021/0135

Abstract: A device may identify a plurality of sources for outputs that the device is configured to provide. The plurality of sources may include at least one of a particular application in the device, an operating system of the device, a particular area within a display of the device, or a particular graphical user interface object. The device may also assign a set of distinct voices to respective sources of the plurality of sources. The device may also receive a request for speech output. The device may also select a particular source that is associated with the requested speech output. The device may also generate speech having particular voice characteristics of a particular voice assigned to the particular source.

2.

发明授权
End-to-end text-to-speech conversion 有权

公开(公告)号：US11862142B2

公开(公告)日：2024-01-02

申请号：US17391799

申请日：2021-08-02

Applicant: Google LLC

Inventor： Samuel Bengio , Yuxuan Wang , Zongheng Yang , Zhifeng Chen , Yonghui Wu , Ioannis Agiomyrgiannakis , Ron J. Weiss , Navdeep Jaitly , Ryan M. Rifkin , Robert Andrew James Clark , Quoc V. Le , Russell J. Ryan , Ying Xiao

IPC: G10L13/06 , G10L13/08 , G06N3/08 , G10L25/18 , G10L25/30 , G10L13/04 , G06N3/084 , G10L15/16 , G06N3/045

CPC classification number: G10L13/08 , G06N3/045 , G06N3/08 , G06N3/084 , G10L13/04 , G10L15/16 , G10L25/18 , G10L25/30

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating speech from text. One of the systems includes one or more computers and one or more storage devices storing instructions that when executed by one or more computers cause the one or more computers to implement: a sequence-to-sequence recurrent neural network configured to: receive a sequence of characters in a particular natural language, and process the sequence of characters to generate a spectrogram of a verbal utterance of the sequence of characters in the particular natural language; and a subsystem configured to: receive the sequence of characters in the particular natural language, and provide the sequence of characters as input to the sequence-to-sequence recurrent neural network to obtain as output the spectrogram of the verbal utterance of the sequence of characters in the particular natural language.

3.

发明授权
Devices and methods for a speech-based user interface 有权

公开(公告)号：US11798526B2

公开(公告)日：2023-10-24

申请号：US17653005

申请日：2022-03-01

Applicant: Google LLC

Inventor： Ioannis Agiomyrgiannakis , Fergus James Henderson

IPC: G10L13/10 , G10L13/033 , G06F3/16 , G10L21/013

CPC classification number: G10L13/033 , G06F3/167 , G10L13/10 , G10L2021/0135

Abstract: A device may identify a plurality of sources for outputs that the device is configured to provide. The plurality of sources may include at least one of a particular application in the device, an operating system of the device, a particular area within a display of the device, or a particular graphical user interface object. The device may also assign a set of distinct voices to respective sources of the plurality of sources. The device may also receive a request for speech output. The device may also select a particular source that is associated with the requested speech output. The device may also generate speech having particular voice characteristics of a particular voice assigned to the particular source.

4.

发明申请
Devices and Methods for a Speech-Based User Interface 有权

公开(公告)号：US20220270586A1

公开(公告)日：2022-08-25

申请号：US17653005

申请日：2022-03-01

Applicant: Google LLC

Inventor： Ioannis Agiomyrgiannakis , Fergus James Henderson

IPC: G10L13/033 , G06F3/16 , G10L13/10

Abstract: A device may identify a plurality of sources for outputs that the device is configured to provide. The plurality of sources may include at least one of a particular application in the device, an operating system of the device, a particular area within a display of the device, or a particular graphical user interface object. The device may also assign a set of distinct voices to respective sources of the plurality of sources. The device may also receive a request for speech output. The device may also select a particular source that is associated with the requested speech output. The device may also generate speech having particular voice characteristics of a particular voice assigned to the particular source.

5.

发明授权
Devices and methods for a speech-based user interface 有权

公开(公告)号：US10720146B2

公开(公告)日：2020-07-21

申请号：US15874051

申请日：2018-01-18

Applicant: Google LLC

Inventor： Ioannis Agiomyrgiannakis , Fergus James Henderson

IPC: G10L13/033 , G06F3/16 , G10L13/10 , G10L21/013

Abstract: A device may identify a plurality of sources for outputs that the device is configured to provide. The plurality of sources may include at least one of a particular application in the device, an operating system of the device, a particular area within a display of the device, or a particular graphical user interface object. The device may also assign a set of distinct voices to respective sources of the plurality of sources. The device may also receive a request for speech output. The device may also select a particular source that is associated with the requested speech output. The device may also generate speech having particular voice characteristics of a particular voice assigned to the particular source.

6.

发明申请
SPEECH SYNTHESIS UNIT SELECTION 审中-公开

公开(公告)号：US20180268807A1

公开(公告)日：2018-09-20

申请号：US15824122

申请日：2017-11-28

Applicant: Google LLC

Inventor： Ioannis Agiomyrgiannakis

IPC: G10L13/07 , G10L13/047 , G10L13/08

CPC classification number: G10L13/07 , G10L13/047 , G10L13/06 , G10L13/08

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for selecting units for speech synthesis. One of the methods includes determining a sequence of text units that each represent a respective portion of text for speech synthesis; and determining multiple paths of speech units that each represent the sequence of text units by selecting a first speech unit that includes speech synthesis data representing a first text unit; selecting multiple second speech units including speech synthesis data representing a second text unit based on (i) a join cost to concatenate the second speech unit with a first speech unit and (ii) a target cost indicating a degree that the second speech unit corresponds to the second text unit; and defining paths from the selected first speech unit to each of the multiple second speech units to include in the multiple paths of speech units.

7.

发明申请
Devices and Methods for a Speech-Based User Interface 审中-公开

公开(公告)号：US20180144737A1

公开(公告)日：2018-05-24

申请号：US15874051

申请日：2018-01-18

Applicant: Google LLC

Inventor： Ioannis Agiomyrgiannakis , Fergus James Henderson

IPC: G10L13/033 , G06F3/16 , G10L21/013

CPC classification number: G10L13/033 , G06F3/167 , G10L13/10 , G10L2021/0135

Abstract: A device may identify a plurality of sources for outputs that the device is configured to provide. The plurality of sources may include at least one of a particular application in the device, an operating system of the device, a particular area within a display of the device, or a particular graphical user interface object. The device may also assign a set of distinct voices to respective sources of the plurality of sources. The device may also receive a request for speech output. The device may also select a particular source that is associated with the requested speech output. The device may also generate speech having particular voice characteristics of a particular voice assigned to the particular source.

8.

发明申请
END-TO-END TEXT-TO-SPEECH CONVERSION 有权

公开(公告)号：US20210366463A1

公开(公告)日：2021-11-25

申请号：US17391799

申请日：2021-08-02

Applicant: Google LLC

Inventor： Samuel Bengio , Yuxuan Wang , Zongheng Yang , Zhifeng Chen , Yonghui Wu , Ioannis Agiomyrgiannakis , Ron J. Weiss , Navdeep Jaitly , Ryan M. Rifkin , Robert Andrew James Clark , Quoc V. Le , Russell J. Ryan , Ying Xiao

IPC: G10L13/08 , G06N3/08 , G10L25/18 , G10L25/30 , G10L13/04 , G06N3/04 , G10L15/16

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating speech from text. One of the systems includes one or more computers and one or more storage devices storing instructions that when executed by one or more computers cause the one or more computers to implement: a sequence-to-sequence recurrent neural network configured to: receive a sequence of characters in a particular natural language, and process the sequence of characters to generate a spectrogram of a verbal utterance of the sequence of characters in the particular natural language; and a subsystem configured to: receive the sequence of characters in the particular natural language, and provide the sequence of characters as input to the sequence-to-sequence recurrent neural network to obtain as output the spectrogram of the verbal utterance of the sequence of characters in the particular natural language.

9.

发明申请
END-TO-END TEXT-TO-SPEECH CONVERSION 审中-公开

公开(公告)号：US20200098350A1

公开(公告)日：2020-03-26

申请号：US16696101

申请日：2019-11-26

Applicant: Google LLC

Inventor： Samuel Bengio , Yuxuan Wang , Zongheng Yang , Zhifeng Chen , Yonghui Wu , Ioannis Agiomyrgiannakis , Ron J. Weiss , Navdeep Jaitly , Ryan M. Rifkin , Robert Andrew James Clark , Quoc V. Le , Russell J. Ryan , Ying Xiao

IPC: G10L13/08 , G10L15/16 , G06N3/08 , G06N3/04 , G10L13/04 , G10L25/30 , G10L25/18

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating speech from text. One of the systems includes one or more computers and one or more storage devices storing instructions that when executed by one or more computers cause the one or more computers to implement: a sequence-to-sequence recurrent neural network configured to: receive a sequence of characters in a particular natural language, and process the sequence of characters to generate a spectrogram of a verbal utterance of the sequence of characters in the particular natural language; and a subsystem configured to: receive the sequence of characters in the particular natural language, and provide the sequence of characters as input to the sequence-to-sequence recurrent neural network to obtain as output the spectrogram of the verbal utterance of the sequence of characters in the particular natural language.

10.

发明申请
END-TO-END TEXT-TO-SPEECH CONVERSION 审中-公开

公开(公告)号：US20190311708A1

公开(公告)日：2019-10-10

申请号：US16447862

申请日：2019-06-20

Applicant: Google LLC

Inventor： Samy Bengio , Yuxuan Wang , Zongheng Yang , Zhifeng Chen , Yonghui Wu , Ioannis Agiomyrgiannakis , Ron J. Weiss , Navdeep Jaitly , Ryan M. Rifkin , Robert Andrew James Clark , Quoc V. Le , Russell J. Ryan , Ying Xiao

IPC: G10L13/08 , G10L25/18 , G10L25/30 , G06N3/08

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating speech from text. One of the systems includes one or more computers and one or more storage devices storing instructions that when executed by one or more computers cause the one or more computers to implement: a sequence-to-sequence recurrent neural network configured to: receive a sequence of characters in a particular natural language, and process the sequence of characters to generate a spectrogram of a verbal utterance of the sequence of characters in the particular natural language; and a subsystem configured to: receive the sequence of characters in the particular natural language, and provide the sequence of characters as input to the sequence-to-sequence recurrent neural network to obtain as output the spectrogram of the verbal utterance of the sequence of characters in the particular natural language.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification