Invention Grant
- Patent Title: Speech detection using image classification
-
Application No.: US17805822Application Date: 2022-06-07
-
Publication No.: US12073837B2Publication Date: 2024-08-27
- Inventor: Stephen Gregory Dame , Les Eugene Atlas
- Applicant: The Boeing Company , University of Washington
- Applicant Address: US IL Chicago
- Assignee: The Boeing Company,University of Washington
- Current Assignee: The Boeing Company,University of Washington
- Current Assignee Address: US VA Arlington; US WA Seattle
- Agency: Alleman Hall & Tuttle LLP
- Main IPC: G10L15/24
- IPC: G10L15/24 ; G06V10/82 ; G10L15/05

Abstract:
Speech detection can be achieved by identifying a speech segment within an audio segment using image classification. An audio segment of radio communications is obtained. An audio sub-segment within the audio segment is extracted. A sampled histogram is generated of a plurality of sampled values across a sampled time window of the audio sub-segment. A two-dimensional image is generated that represents a two-dimensional mapping of the sampled histogram along a first dimension and a predefined histogram along a second dimension that is orthogonal to the first dimension. The two-dimensional image is provided to an image classifier previously trained using the predefined histogram. An output is received from the image classifier based on the two-dimensional image. The output indicates whether the audio sub-segment contains speech.
Public/Granted literature
- US20220406310A1 SPEECH DETECTION USING IMAGE CLASSIFICATION Public/Granted day:2022-12-22
Information query