-
公开(公告)号:US20230267926A1
公开(公告)日:2023-08-24
申请号:US17676170
申请日:2022-02-20
Applicant: GOOGLE LLC
Inventor: Dirk Padfield , Noah Murad , Edward Lo , Bryan Huh
IPC: G10L15/187 , G06F40/166 , G10L15/22 , G10L15/02 , G10L15/06
CPC classification number: G10L15/187 , G06F40/166 , G10L15/22 , G10L15/02 , G10L15/063 , G10L2015/025
Abstract: An automated speech recognition (ASR) transcript of at least a portion of a media content is obtained from an ASR tool. Suggested words are received for corrected words of the ASR transcript of the media content. Features are obtained using at least the suggested words or the corrected words. The features include features relating to sound similarities between the suggested words and the corrected words. The features are input into a machine learning (ML) model to obtain a determination regarding a validity of the suggested words. Responsive to the suggested words constituting a valid suggestion, the suggested words are incorporated into the ASR transcript. At least a portion of the ASR transcript is transmitted to a user device in conjunction with at least a portion of the media content.
-
公开(公告)号:US12254874B2
公开(公告)日:2025-03-18
申请号:US17676170
申请日:2022-02-20
Applicant: GOOGLE LLC
Inventor: Dirk Padfield , Noah Murad , Edward Lo , Bryan Huh
IPC: G10L15/187 , G06F40/166 , G10L15/02 , G10L15/06 , G10L15/22
Abstract: An automated speech recognition (ASR) transcript of at least a portion of a media content is obtained from an ASR tool. Suggested words are received for corrected words of the ASR transcript of the media content. Features are obtained using at least the suggested words or the corrected words. The features include features relating to sound similarities between the suggested words and the corrected words. The features are input into a machine learning (ML) model to obtain a determination regarding a validity of the suggested words. Responsive to the suggested words constituting a valid suggestion, the suggested words are incorporated into the ASR transcript. At least a portion of the ASR transcript is transmitted to a user device in conjunction with at least a portion of the media content.
-