-
公开(公告)号:US10192552B2
公开(公告)日:2019-01-29
申请号:US15266932
申请日:2016-09-15
Applicant: Apple Inc.
Inventor: Tuomo J. Raitio , Melvyn J. Hunt , Hywel B. Richards , Madhusudan Chinthakunta
IPC: G10L15/22 , G10L13/033 , G10L25/18 , G10L25/24
Abstract: Systems and processes for detecting and/or providing a whispered speech response are provided. In one example process, speech is received from a user, and based on the speech input, determined that a whispered speech response is to be provided. Upon determining that a whispered speech response is to be provided, the whispered speech response is generated and provided to the user.
-
公开(公告)号:US09934775B2
公开(公告)日:2018-04-03
申请号:US15266930
申请日:2016-09-15
Applicant: Apple Inc.
Inventor: Tuomo J. Raitio , Kishore Sunkeswari Prahallad , Alistair D. Conkie , Ladan Golipour , David A. Winarsky
IPC: G10L13/10 , G10L13/033 , G10L13/06
CPC classification number: G10L13/10 , G10L13/0335 , G10L13/06 , G10L13/07
Abstract: Systems and processes for performing unit-selection text-to-speech synthesis are provided. In an example process, text to be converted to speech is received. The text is represented as a sequence of target units. A plurality of candidate speech segments corresponding to the sequence of target units are selected. Predicted statistical parameters of acoustic features associated with the sequence of target units are determined. The predicted statistical parameters of acoustic features are used to determine target costs and concatenation costs associated with the plurality of candidate speech segments. Based on a combined cost determined from the target costs and concatenation costs, a subset of candidate speech segments is selected from the plurality of candidate speech segments. Speech corresponding to the received text is generated using the subset of candidate speech segments.
-