-
公开(公告)号:US12249313B2
公开(公告)日:2025-03-11
申请号:US17914010
申请日:2020-10-27
Applicant: GOOGLE LLC
Inventor: Michael Hassid , Sapir Caduri , Nadav Bar , Danielle Cohen , Benny Schlesinger , Michelle Tadmor Ramanovich
Abstract: A method and system is disclosed for speech synthesis of streaming text. At a text-to-speech (“ITS) system, a real-time streaming text string having a starting point and an ending point may be received, and a first sub-string comprising a first portion of the text string received from an initial point to a first trigger point may be accumulated. The initial point is no earlier than the starting point and is prior to the first trigger point, and the first trigger point is no further than the ending point. A punctuation model of the ITS system may be applied to the first sub-string to generate a pre-processed first sub-string comprising the first sub-string with added grammatical punctuation as determined by the punctuation model. TTS synthesis processing may be applied to at least the pre-processed first sub-string to generate first synthesized speech, and audio play out of the first synthesized speech produced.