-
公开(公告)号:US20190122103A1
公开(公告)日:2019-04-25
申请号:US15792051
申请日:2017-10-24
Applicant: International Business Machines Corporation
Inventor: Peng Gao , Xiu Li Li , Yong Qin , Shi Lei Zhang , Xiaolu Zhang , Xin Zhang , Shi Wan Zhao
Abstract: Techniques facilitating attention based sequential image processing are provided. A system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can comprise an initialization component that can perform self-attention based training on a model that comprises context information associated with a sequence of images. Images of the sequence of images can be selected during the self-attention based training. The computer executable components can also comprise a localization component that can extract local information from the images selected during the self-attention based training based on the context information. In addition, the computer executable components can also comprise an integration component that can update the model based on an end-to-end integrated attention training framework comprising the context information and the local information.
-
公开(公告)号:US20170039440A1
公开(公告)日:2017-02-09
申请号:US14821258
申请日:2015-08-07
Applicant: International Business Machines Corporation
Inventor: Min Li , Wen Liu , Yong Qin , Zhong Su , Shi Lei Zhang , Shiwan Zhao
IPC: G06K9/00 , G10L15/25 , G10L25/57 , G10L13/027 , G10L15/04
CPC classification number: G06K9/00906 , G06K9/00281 , G06K9/00315 , G10L13/027 , G10L15/04 , G10L15/25 , G10L17/22 , G10L25/57
Abstract: In an approach for visual liveness detection, a video-audio signal related to a speaker speaking a text is obtained. The video-audio signal is split into a video signal which records images of the speaker and an audio signal which records a speech spoken by the speaker. Then a first sequence indicating visual mouth openness is obtained from the video signal, and a second sequence indicating acoustic mouth openness is obtained based on the text and the audio signal. Synchrony between the first and second sequences is measured, and the liveness of the speaker is determined based on the synchrony.
Abstract translation: 在视觉活动检测的方法中,获得与说话者说话的扬声器相关的视频 - 音频信号。 视频 - 音频信号被分割成记录扬声器的图像的视频信号和记录扬声器所说出的语音的音频信号。 然后,从视频信号获得指示视觉开放性的第一序列,并且基于文本和音频信号获得指示声音开口性的第二序列。 测量第一和第二序列之间的同步,并且基于同步来确定说话者的活力。
-
公开(公告)号:US20160292267A1
公开(公告)日:2016-10-06
申请号:US15185316
申请日:2016-06-17
Applicant: International Business Machines Corporation
Inventor: Feng Jin , Qin Jin , Wen Liu , Yong Qin , Xu Dong Tu , Shi Lei Zhang
CPC classification number: G06F16/683 , G06F16/686 , G06N20/00
Abstract: A pattern based audio searching method includes labeling a plurality of source audio data based on patterns to obtain audio label sequences of the source audio data; obtaining, with a processing device, an audio label sequence of target audio data; determining matching degree between the target audio data and the source audio data according to a predetermined matching rule based on the audio label sequence of the target audio data and the audio label sequences of the source audio data; and outputting source audio data having matching degree higher than a predetermined matching threshold as a search result.
-
34.
公开(公告)号:US09183367B2
公开(公告)日:2015-11-10
申请号:US14291059
申请日:2014-05-30
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Sheng Hu Bao , Min Li , Yong Qin , Zhong Su , Liu Wen , Shi Lei Zhang
Abstract: Voice based biometric authentication method, apparatus (system), and computer program product. Provided is voice verification solution with a high accuracy rate that can prevent cheating via recording. The method includes: transmitting to the user a question prompt requiring the user to speak out a voice segment and an answer to a dynamic question, the voice segment having a corresponding text dependent speaker verification model enrolled before the authentication; segmenting, in response to receiving the voice answer, the voice segment part and the dynamic question answer part out from the voice answer; and verifying boundary smoothness between the voice segment and the answer to the dynamic question within the voice answer. With this method, whether a voice answer relates to cheating via recording is determined according to the degree of smoothness at a detected boundary. The apparatus and computer program product carry out the steps of the above-mentioned method.
Abstract translation: 基于语音的生物识别方法,装置(系统)和计算机程序产品。 提供了具有高准确率的语音验证解决方案,可以通过录制来防止作弊。 该方法包括:向用户发送要求用户说出语音段和对动态问题的答案的问题提示,该语音段具有在认证之前注册的对应的文本相关说明者验证模型; 响应于接收到语音回答,语音段部分和动态问题回答部分从语音答案中分割; 并在语音答案中验证语音段与动态问题的答案之间的边界平滑度。 使用该方法,根据检测到的边界处的平滑度来确定语音应答与记录作弊有关。 该装置和计算机程序产品执行上述方法的步骤。
-
公开(公告)号:US20150170044A1
公开(公告)日:2015-06-18
申请号:US14105874
申请日:2013-12-13
Applicant: International Business Machines Corporation
Inventor: Feng Jin , Qin Jin , Wen Liu , Yong Qin , Xu Dong Tu , Shi Lei Zhang
CPC classification number: G06F17/30743 , G06F17/30752 , G06N99/005
Abstract: A pattern based audio searching method includes labeling a plurality of source audio data based on patterns to obtain audio label sequences of the source audio data; obtaining, with a processing device, an audio label sequence of target audio data; determining matching degree between the target audio data and the source audio data according to a predetermined matching rule based on the audio label sequence of the target audio data and the audio label sequences of the source audio data; and outputting source audio data having matching degree higher than a predetermined matching threshold as a search result.
Abstract translation: 基于图案的音频搜索方法包括基于模式来标记多个源音频数据以获得源音频数据的音频标签序列; 利用处理设备获得目标音频数据的音频标签序列; 基于目标音频数据的音频标签序列和源音频数据的音频标签序列,根据预定匹配规则确定目标音频数据和源音频数据之间的匹配度; 并输出具有高于预定匹配阈值的匹配度的源音频数据作为搜索结果。
-
公开(公告)号:US11055330B2
公开(公告)日:2021-07-06
申请号:US16199923
申请日:2018-11-26
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
IPC: G06F16/00 , G06F16/332 , G06N3/04 , G06F16/951 , G06F16/33 , G06F16/9038
Abstract: A computer-implemented method for utilizing external knowledge and memory networks in a question-answering system includes receiving, from a search engine of a question-answering system, one or more search results based on a search query associated with a question submitted via a user interface associated with a computing device, analyzing the one or more search results to generate search evidence as a source of external knowledge for generating an answer to the question, the search evidence including one or more titles and one or more corresponding text snippets, encoding the search evidence and the search query to generate vectors stored in a memory network, obtaining a final vector representation based on the encoding, and decoding the final vector representation to obtain the answer to the question.
-
公开(公告)号:US10789298B2
公开(公告)日:2020-09-29
申请号:US15352842
申请日:2016-11-16
Applicant: International Business Machines Corporation
IPC: G06F16/00 , G06F16/9032 , G06F16/36 , G06F16/33
Abstract: Techniques are provided for generating recommended query terms that are specialized to a topic of desired information based on a query associated with a user. In one example, a computer-implemented method comprising selecting, by a system operatively coupled to a processor, a coarse cluster of corpus terms having a defined relatedness to a query associated with a user from a plurality of coarse clusters of corpus terms; and determining, by the system, a plurality of candidate terms from search results associated with the query. The computer-implemented method can also comprise determining, by the system, at least one recommended query term based on refined clusters of the coarse cluster, the candidate terms, and the query; and displaying, by the system, the at least one recommended query term on a display device associated with the query.
-
公开(公告)号:US20200218744A1
公开(公告)日:2020-07-09
申请号:US16241000
申请日:2019-01-07
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Ke Wang , Pei Ni Liu , Wen Sun , Jing Min Xu , Songfang Huang , Yong Qin
Abstract: Methods and systems for processing records include extracting feature vectors from words in an unstructured portion of a record. The feature vectors are weighted based similarity to a topic vector from a structured portion of the record associated with the unstructured portion. The weighted feature vectors are classified using a machine learning model to determine respective probability vectors that assign a probability to each of a set of possible relations for each feature vector. Relations between entities are determined within the record based on the probability vectors. An action is performed responsive to the determined relations.
-
公开(公告)号:US20200210621A1
公开(公告)日:2020-07-02
申请号:US16238216
申请日:2019-01-02
Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventor: Sui Jun Tong , Wen Sun , Yi Qin Yu , Eryu Xia , Yong Qin
Abstract: A system for decentralized privacy-preserving clinical data evaluation includes a plurality of sites of a decentralized private network, a memory device for storing program code, and at least one processor device operatively coupled to the memory device and configured to execute program code stored on the memory device to, for each of the local datasets, evaluate the local dataset using each of the local models to obtain one or more features related to a degree of outlierness, determine at least one outlier dataset based on the one or more features, and implement one or more actions based on the determination.
-
公开(公告)号:US20200160183A1
公开(公告)日:2020-05-21
申请号:US16773456
申请日:2020-01-27
Applicant: International Business Machines Corporation
Inventor: Peng Gao , Xiu Li Li , Yong Qin , Shi Lei Zhang , Xiaolu Zhang , Xin Zhang , Shi Wan Zhao
Abstract: Techniques facilitating attention based sequential image processing are provided. A system can comprise a memory that stores computer executable components and a processor that executes the computer executable components stored in the memory. The computer executable components can comprise an initialization component that can perform self-attention based training on a model that comprises context information associated with a sequence of images. Images of the sequence of images can be selected during the self-attention based training. The computer executable components can also comprise a localization component that can extract local information from the images selected during the self-attention based training based on the context information. In addition, the computer executable components can also comprise an integration component that can update the model based on an end-to-end integrated attention training framework comprising the context information and the local information.
-
-
-
-
-
-
-
-
-