Invention Grant
- Patent Title: Segmenting and classifying video content using sounds
-
Application No.: US16711797Application Date: 2019-12-12
-
Publication No.: US11342003B1Publication Date: 2022-05-24
- Inventor: Christian Garcia Siagian , Christian Ciabattoni , David Niu , Lawrence Kyuil Chang , Gordon Zheng , Ritesh Pase , Shiva Krishnamurthy , Ramakanth Mudumba
- Applicant: Amazon Technologies, Inc.
- Applicant Address: US WA Seattle
- Assignee: Amazon Technologies, Inc.
- Current Assignee: Amazon Technologies, Inc.
- Current Assignee Address: US WA Seattle
- Agency: Thomas | Horstemeyer, LLP
- Main IPC: G11B27/30
- IPC: G11B27/30 ; G10L25/57 ; G10L15/08 ; G10L15/26 ; G06V20/40 ; G06V40/16

Abstract:
Disclosed are various embodiments for segmenting and classifying video content using sounds. In one embodiment, a plurality of segments of a video content item are generated by analyzing audio accompanying the video content item. A subset of the plurality of segments that correspond to music segments is selected based at least in part on an audio characteristic of the subset of the plurality of segments. Individual segments of the subset of the plurality of segments are processed to determine whether a classification applies to the individual segments. A list of segments of the video content item to which the classification applies is generated.
Information query
IPC分类: