-
公开(公告)号:US11011184B2
公开(公告)日:2021-05-18
申请号:US16685187
申请日:2019-11-15
Applicant: GOOGLE LLC
Inventor: Sourish Chaudhuri , Nebojsa Ciric , Khiem Pham
Abstract: The technology disclosed herein may determine timing windows for speech captions of an audio stream. In one example, the technology may involve accessing audio data comprising a plurality of segments; determining, by a processing device, that one or more of the plurality of segments comprise speech sounds; identifying a time duration for the speech sounds; and providing a user interface element corresponding to the time duration for the speech sounds, wherein the user interface element indicates an estimate of a beginning and ending of the speech sounds and is configured to receive caption text associated with the speech sounds of the audio data.
-
公开(公告)号:US20200090678A1
公开(公告)日:2020-03-19
申请号:US16685187
申请日:2019-11-15
Applicant: GOOGLE LLC
Inventor: Sourish Chaudhuri , Nebojsa Ciric , Khiem Pham
Abstract: The technology disclosed herein may determine timing windows for speech captions of an audio stream. In one example, the technology may involve accessing audio data comprising a plurality of segments; determining, by a processing device, that one or more of the plurality of segments comprise speech sounds; identifying a time duration for the speech sounds; and providing a user interface element corresponding to the time duration for the speech sounds, wherein the user interface element indicates an estimate of a beginning and ending of the speech sounds and is configured to receive caption text associated with the speech sounds of the audio data.
-