专利检索 ap:("Google LLC") AND inv:"Omer Ahmed Siddig Osman" 第 1 页

1.

发明公开
Machine Learning Based Enhancement of Audio for a Voice Call 审中-公开

公开(公告)号：US20240153514A1

公开(公告)日：2024-05-09

申请号：US18548949

申请日：2021-03-05

申请人： Google LLC

发明人： Omer Ahmed Siddig Osman , Dominik Roblek , Yunpeng Li , Marco Tagliasacchi , Oleg Rybakov , Victor Ungureanu , Eric Giguere

IPC分类号： G10L19/06 , G10L19/16 , G10L25/30 , G10L25/69

CPC分类号： G10L19/06 , G10L19/167 , G10L25/30 , G10L25/69

摘要： Apparatus and methods related to enhancement of audio content are provided. An example method includes receiving, by a computing device and via a communications network interface, a compressed audio data frame, wherein the compressed audio data frame is received after transmission over a communications network, The method further includes decompressing the compressed audio data frame to extract an audio waveform. The method also includes predicting, by applying a neural network to the audio waveform, an enhanced version of the audio waveform, wherein the neural network has been trained on (i) a ground truth sample comprising unencoded audio waveforms prior to compression by an audio encoder, and (ii) a training dataset comprising decoded audio waveforms after compression of the unencoded audio waveforms by the audio encoder. The method additionally includes providing, by an audio output component of the computing device, the enhanced version of the audio waveform.

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类