专利检索 ap:("Apple Inc.") AND inv:"Ramya RASIPURAM" 第 1 页

1.

发明申请
GENERATING GENRE APPROPRIATE VOICES FOR AUDIO BOOKS 有权

公开(公告)号：US20230134970A1

公开(公告)日：2023-05-04

申请号：US17977360

申请日：2022-10-31

申请人： Apple Inc.

发明人： Ramya RASIPURAM , William BECKMAN , Ladan GOLIPOUR , David A. WINARSKY , Cheng-Chieh YEH , Weicheng ZHANG

IPC分类号： G10L13/10 , G06F40/30 , G06F40/284 , G10L13/033

摘要： Systems and processes for generating audio books from text are provided. An example process includes, at an electronic device having one or more processors and memory: receiving a text including at least a first subset and a second subset, wherein at least a portion of the first subset overlaps with at least a portion of the second subset; determining, based on the text, a prosody for a speech output, wherein the prosody is representative of a genre; determining a semantic meaning of the text; and generating, based on the prosody and the semantic meaning, the speech output of the text.

2.

发明申请
TEXT NORMALIZATION BASED ON A DATA-DRIVEN LEARNING NETWORK 审中-公开

公开(公告)号：US20180330729A1

公开(公告)日：2018-11-15

申请号：US15673574

申请日：2017-08-10

申请人： Apple Inc.

发明人： Ladan GOLIPOUR , Matthias NEERACHER , Ramya RASIPURAM

IPC分类号： G10L15/22 , G10L15/30

CPC分类号： G10L15/22 , G10L15/16 , G10L15/1815 , G10L15/26 , G10L15/30 , G10L2013/083

摘要： Systems and processes for operating an intelligent automated assistant to perform text-to-speech conversion are provided. An example method includes, at an electronic device having one or more processors, receiving a text corpus comprising unstructured natural language text. The method further includes generating a sequence of normalized text based on the received text corpus; and generating a pronunciation sequence representing the sequence of the normalized text. The method further includes causing an audio output to be provided to the user based on the pronunciation sequence. At least one of the sequence of normalized text and the pronunciation sequence is generated based on a data-driven learning network.