-
公开(公告)号:US12190871B1
公开(公告)日:2025-01-07
申请号:US17468415
申请日:2021-09-07
Applicant: Amazon Technologies, Inc.
Inventor: Christian Garcia Siagian , Charles Effinger , Nicholas Ren-Jie Capel , Jobel Kyle Petallana Vecino , Gordon Zheng , Kymry Michael Burwell , Stephen Andrew Low
IPC: G10L15/04 , G06Q30/0241 , G10L15/16 , G10L15/18
Abstract: Techniques and methods are disclosed for detecting long-form audio content in one or more audio files. A computing system receives first audio data corresponding to a first version of an audio file and second audio data corresponding to a second version of the audio file. The computing system generates a first transcript of the first audio data and a second transcript of the second audio data. The computing system compares the first audio data and the second audio data and the first transcript and the second transcript to identify advertisement portions and content portions of the audio data. Using a semantic model based on a machine learning (ML) transformer, the computing system can determine advertisement segments within the advertisement portions, the advertisement segments corresponding to separate advertisements. Information corresponding to the duration and location of the advertisement segments is stored in a data store of the computing system.