-
公开(公告)号:EP3963901A1
公开(公告)日:2022-03-09
申请号:EP20719803.7
申请日:2020-03-17
-
公开(公告)号:EP4345816A2
公开(公告)日:2024-04-03
申请号:EP24156964.9
申请日:2020-03-19
发明人: YOSHIOKA, Takuya , STOLCKE, Andreas , CHEN, Zhuo , DIMITRIADIS, Dimitrios Basile , ZENG, Nanshan , QIN, Lijuan , HINTHORN, William Isaac , HUANG, Xuedong
IPC分类号: G10L15/14
摘要: A computer implemented method processes audio streams recorded during a meeting by a plurality of distributed devices. Operations include performing speech recognition on each audio stream by a corresponding speech recognition system to generate utterance-level posterior probabilities as hypotheses for each audio stream, aligning the hypotheses and formatting them as word confusion networks with associated word-level posteriors probabilities, performing speaker recognition on each audio stream by a speaker identification algorithm that generates a stream of speaker-attributed word hypotheses, formatting speaker hypotheses with associated speaker label posterior probabilities and speaker-attributed hypotheses for each audio stream as a speaker confusion network, aligning the word and speaker confusion networks from all audio streams to each other to merge the posterior probabilities and align word and speaker labels, and creating a best speaker-attributed word transcript by selecting the sequence of word and speaker labels with the highest posterior probabilities.
-
公开(公告)号:EP3785144A1
公开(公告)日:2021-03-03
申请号:EP19719014.3
申请日:2019-04-06
发明人: CHEN, Ling , SHI, Yu , CHEN, Yining , ZENG, Nanshan , LI, Dong
IPC分类号: G06F17/27
-
公开(公告)号:EP4345816A3
公开(公告)日:2024-05-29
申请号:EP24156964.9
申请日:2020-03-19
发明人: YOSHIOKA, Takuya , STOLCKE, Andreas , CHEN, Zhuo , DIMITRIADIS, Dimitrios Basile , ZENG, Nanshan , QIN, Lijuan , HINTHORN, William Isaac , HUANG, Xuedong
IPC分类号: H04L12/18 , H04L65/403 , G10L15/26 , G10L19/018 , H04M3/56
CPC分类号: G10L15/26 , H04L12/1831 , H04M3/568 , H04L12/1822 , G10L19/018 , H04L65/403 , H04L51/10 , H04M2201/4120130101 , H04M2201/4020130101 , H04M3/42221
摘要: A computer implemented method processes audio streams recorded during a meeting by a plurality of distributed devices. Operations include performing speech recognition on each audio stream by a corresponding speech recognition system to generate utterance-level posterior probabilities as hypotheses for each audio stream, aligning the hypotheses and formatting them as word confusion networks with associated word-level posteriors probabilities, performing speaker recognition on each audio stream by a speaker identification algorithm that generates a stream of speaker-attributed word hypotheses, formatting speaker hypotheses with associated speaker label posterior probabilities and speaker-attributed hypotheses for each audio stream as a speaker confusion network, aligning the word and speaker confusion networks from all audio streams to each other to merge the posterior probabilities and align word and speaker labels, and creating a best speaker-attributed word transcript by selecting the sequence of word and speaker labels with the highest posterior probabilities.
-
5.
公开(公告)号:EP4281967A1
公开(公告)日:2023-11-29
申请号:EP21848357.6
申请日:2021-12-17
发明人: ZHU, Chenguang , ZENG, Nanshan
-
公开(公告)号:EP4281966A1
公开(公告)日:2023-11-29
申请号:EP21844441.2
申请日:2021-12-09
发明人: ZHU, Chenguang , ZENG, Nanshan
IPC分类号: G10L15/06 , G10L15/18 , G10L13/02 , G06F40/30 , G10L13/033
-
公开(公告)号:EP3963579A1
公开(公告)日:2022-03-09
申请号:EP20719805.2
申请日:2020-03-18
发明人: YOSHIOKA, Takuya , STOLCKE, Andreas , CHEN, Zhuo , DIMITRIADIS, Dimitrios Basile , ZENG, Nanshan , QIN, Lijuan , HINTHORN, William Isaac , HUANG, Xuedong
IPC分类号: G10L21/0272
-
公开(公告)号:EP3963576A1
公开(公告)日:2022-03-09
申请号:EP20719824.3
申请日:2020-03-19
-
公开(公告)号:EP3963575A1
公开(公告)日:2022-03-09
申请号:EP20719808.6
申请日:2020-03-18
-
公开(公告)号:EP3963574A1
公开(公告)日:2022-03-09
申请号:EP20718467.2
申请日:2020-03-17
-
-
-
-
-
-
-
-
-