Speech detection using image classification

Invention Grant

US12073837B2 Speech detection using image classification 有权

Please log in to see more content

Patent Title: Speech detection using image classification
Application No.: US17805822

Application Date: 2022-06-07
Publication No.: US12073837B2

Publication Date: 2024-08-27
Inventor: Stephen Gregory Dame , Les Eugene Atlas
Applicant: The Boeing Company , University of Washington
Applicant Address: US IL Chicago
Assignee: The Boeing Company,University of Washington
Current Assignee: The Boeing Company,University of Washington
Current Assignee Address: US VA Arlington; US WA Seattle
Agency: Alleman Hall & Tuttle LLP
Main IPC: G10L15/24
IPC: G10L15/24 ; G06V10/82 ; G10L15/05

Speech detection using image classification

Abstract:

Speech detection can be achieved by identifying a speech segment within an audio segment using image classification. An audio segment of radio communications is obtained. An audio sub-segment within the audio segment is extracted. A sampled histogram is generated of a plurality of sampled values across a sampled time window of the audio sub-segment. A two-dimensional image is generated that represents a two-dimensional mapping of the sampled histogram along a first dimension and a predefined histogram along a second dimension that is orthogonal to the first dimension. The two-dimensional image is provided to an image classifier previously trained using the predefined histogram. An output is received from the image classifier based on the two-dimensional image. The output indicates whether the audio sub-segment contains speech.

Public/Granted literature

US20220406310A1 SPEECH DETECTION USING IMAGE CLASSIFICATION Public/Granted day:2022-12-22

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/24	.利用非声学特征的语音识别