-
公开(公告)号:US20230134970A1
公开(公告)日:2023-05-04
申请号:US17977360
申请日:2022-10-31
申请人: Apple Inc.
发明人: Ramya RASIPURAM , William BECKMAN , Ladan GOLIPOUR , David A. WINARSKY , Cheng-Chieh YEH , Weicheng ZHANG
IPC分类号: G10L13/10 , G06F40/30 , G06F40/284 , G10L13/033
摘要: Systems and processes for generating audio books from text are provided. An example process includes, at an electronic device having one or more processors and memory: receiving a text including at least a first subset and a second subset, wherein at least a portion of the first subset overlaps with at least a portion of the second subset; determining, based on the text, a prosody for a speech output, wherein the prosody is representative of a genre; determining a semantic meaning of the text; and generating, based on the prosody and the semantic meaning, the speech output of the text.
-
公开(公告)号:US20180330729A1
公开(公告)日:2018-11-15
申请号:US15673574
申请日:2017-08-10
申请人: Apple Inc.
CPC分类号: G10L15/22 , G10L15/16 , G10L15/1815 , G10L15/26 , G10L15/30 , G10L2013/083
摘要: Systems and processes for operating an intelligent automated assistant to perform text-to-speech conversion are provided. An example method includes, at an electronic device having one or more processors, receiving a text corpus comprising unstructured natural language text. The method further includes generating a sequence of normalized text based on the received text corpus; and generating a pronunciation sequence representing the sequence of the normalized text. The method further includes causing an audio output to be provided to the user based on the pronunciation sequence. At least one of the sequence of normalized text and the pronunciation sequence is generated based on a data-driven learning network.
-