Devices and methods for use of phase information in speech synthesis systems

    公开(公告)号:US09865247B2

    公开(公告)日:2018-01-09

    申请号:US14631583

    申请日:2015-02-25

    Applicant: Google LLC

    CPC classification number: G10L13/02 G10L13/08 G10L25/75

    Abstract: A device may receive a speech signal. The device may determine acoustic feature parameters for the speech signal. The acoustic feature parameters may include phase data. The device may determine circular space representations for the phase data based on an alignment of the phase data with given axes of the circular space representations. The device may map the phase data to linguistic features based on the circular space representations. The linguistic features may be associated with linguistic content that includes phonemic content or text content. The device may provide a synthetic audio pronunciation of the linguistic content based on the mapping.

    Devices and methods for a speech-based user interface

    公开(公告)号:US12154543B2

    公开(公告)日:2024-11-26

    申请号:US18479785

    申请日:2023-10-02

    Applicant: Google LLC

    Abstract: A device may identify a plurality of sources for outputs that the device is configured to provide. The plurality of sources may include at least one of a particular application in the device, an operating system of the device, a particular area within a display of the device, or a particular graphical user interface object. The device may also assign a set of distinct voices to respective sources of the plurality of sources. The device may also receive a request for speech output. The device may also select a particular source that is associated with the requested speech output. The device may also generate speech having particular voice characteristics of a particular voice assigned to the particular source.

    Devices and methods for a speech-based user interface

    公开(公告)号:US11282496B2

    公开(公告)日:2022-03-22

    申请号:US16900839

    申请日:2020-06-12

    Applicant: Google LLC

    Abstract: A device may identify a plurality of sources for outputs that the device is configured to provide. The plurality of sources may include at least one of a particular application in the device, an operating system of the device, a particular area within a display of the device, or a particular graphical user interface object. The device may also assign a set of distinct voices to respective sources of the plurality of sources. The device may also receive a request for speech output. The device may also select a particular source that is associated with the requested speech output. The device may also generate speech having particular voice characteristics of a particular voice assigned to the particular source.

    Speech synthesis unit selection
    16.
    发明授权

    公开(公告)号:US10923103B2

    公开(公告)日:2021-02-16

    申请号:US15824122

    申请日:2017-11-28

    Applicant: Google LLC

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for selecting units for speech synthesis. One of the methods includes determining a sequence of text units that each represent a respective portion of text for speech synthesis; and determining multiple paths of speech units that each represent the sequence of text units by selecting a first speech unit that includes speech synthesis data representing a first text unit; selecting multiple second speech units including speech synthesis data representing a second text unit based on (i) a join cost to concatenate the second speech unit with a first speech unit and (ii) a target cost indicating a degree that the second speech unit corresponds to the second text unit; and defining paths from the selected first speech unit to each of the multiple second speech units to include in the multiple paths of speech units.

Patent Agency Ranking