System and method for dynamic facial features for speaker recognition
    1.
    发明授权
    System and method for dynamic facial features for speaker recognition 有权
    用于说话者识别的动态面部特征的系统和方法

    公开(公告)号:US09218815B2

    公开(公告)日:2015-12-22

    申请号:US14551907

    申请日:2014-11-24

    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for performing speaker verification. A system configured to practice the method receives a request to verify a speaker, generates a text challenge that is unique to the request, and, in response to the request, prompts the speaker to utter the text challenge. Then the system records a dynamic image feature of the speaker as the speaker utters the text challenge, and performs speaker verification based on the dynamic image feature and the text challenge. Recording the dynamic image feature of the speaker can include recording video of the speaker while speaking the text challenge. The dynamic feature can include a movement pattern of head, lips, mouth, eyes, and/or eyebrows of the speaker. The dynamic image feature can relate to phonetic content of the speaker speaking the challenge, speech prosody, and the speaker's facial expression responding to content of the challenge.

    Abstract translation: 本文公开了用于执行说话者验证的系统,方法和非暂时的计算机可读存储介质。 被配置为实施该方法的系统接收到验证说话者的请求,产生对该请求是唯一的文本挑战,并且响应该请求提示说话者发出文本挑战。 然后当扬声器发出文本挑战时,系统记录扬声器的动态图像特征,并且基于动态图像特征和文本挑战来执行说话者验证。 录制扬声器的动态图像功能可以包括在说出文本挑战时录制扬声器的视频。 动态特征可以包括扬声器的头部,嘴唇,嘴巴,眼睛和/或眉毛的运动模式。 动态图像特征可以涉及讲话者讲话的语音内容,语音韵律以及响应于挑战内容的说话者的面部表情。

    System and method for combining frame and segment level processing, via temporal pooling, for phonetic classification
    2.
    发明授权
    System and method for combining frame and segment level processing, via temporal pooling, for phonetic classification 有权
    用于组合帧和段级处理的系统和方法,通过时间池进行语音分类

    公开(公告)号:US09208778B2

    公开(公告)日:2015-12-08

    申请号:US14537400

    申请日:2014-11-10

    CPC classification number: G10L15/02 G10L15/08 G10L15/16

    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for combining frame and segment level processing, via temporal pooling, for phonetic classification. A frame processor unit receives an input and extracts the time-dependent features from the input. A plurality of pooling interface units generates a plurality of feature vectors based on pooling the time-dependent features and selecting a plurality of time-dependent features according to a plurality of selection strategies. Next, a plurality of segmental classification units generates scores for the feature vectors. Each segmental classification unit (SCU) can be dedicated to a specific pooling interface unit (PIU) to form a PIU-SCU combination. Multiple PIU-SCU combinations can be further combined to form an ensemble of combinations, and the ensemble can be diversified by varying the pooling operations used by the PIU-SCU combinations. Based on the scores, the plurality of segmental classification units selects a class label and returns a result.

    Abstract translation: 本文公开了用于通过时间池来组合帧和段级处理用于语音分类的系统,方法和非暂时的计算机可读存储介质。 帧处理器单元接收输入并从输入中提取与时间相关的特征。 多个池化接口单元基于集合时间依赖特征并根据多个选择策略选择多个时间相关特征来生成多个特征向量。 接下来,多个分段分类单元生成特征向量的得分。 每个分段分类单元(SCU)可专用于特定的汇聚接口单元(PIU)以形成PIU-SCU组合。 可以进一步组合多个PIU-SCU组合以形成组合的集合,并且可以通过改变PIU-SCU组合使用的合并操作来使集合多样化。 基于分数,多个分段分类单元选择分类标签并返回结果。

    System and method for combining frame and segment level processing, via temporal pooling, for phonetic classification

    公开(公告)号:US09728183B2

    公开(公告)日:2017-08-08

    申请号:US14936772

    申请日:2015-11-10

    CPC classification number: G10L15/02 G10L15/08 G10L15/16

    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for combining frame and segment level processing, via temporal pooling, for phonetic classification. A frame processor unit receives an input and extracts the time-dependent features from the input. A plurality of pooling interface units generates a plurality of feature vectors based on pooling the time-dependent features and selecting a plurality of time-dependent features according to a plurality of selection strategies. Next, a plurality of segmental classification units generates scores for the feature vectors. Each segmental classification unit (SCU) can be dedicated to a specific pooling interface unit (PIU) to form a PIU-SCU combination. Multiple PIU-SCU combinations can be further combined to form an ensemble of combinations, and the ensemble can be diversified by varying the pooling operations used by the PIU-SCU combinations. Based on the scores, the plurality of segmental classification units selects a class label and returns a result.

    Systems and Methods for Rule-Based Anomaly Detection on IP Network Flow
    7.
    发明申请
    Systems and Methods for Rule-Based Anomaly Detection on IP Network Flow 有权
    基于规则的IP网络异常检测系统与方法

    公开(公告)号:US20160105462A1

    公开(公告)日:2016-04-14

    申请号:US14969591

    申请日:2015-12-15

    Abstract: A system to detect anomalies in internet protocol (IP) flows uses a set of machine-learning (ML) rules that can be applied in real time at the IP flow level. A communication network has a large number of routers equipped with flow monitoring capability. A flow collector collects flow data from the routers throughout the communication network and provides them to a flow classifier. At the same time, a limited number of locations in the network monitor data packets and generate alerts based on packet data properties. The packet alerts and the flow data are provided to a machine learning system that detects correlations between the packet-based alerts and the flow data to thereby generate a series of flow-level alerts. These rules are provided to the flow time classifier. Over time, the new packet alerts and flow data are used to provide updated rules generated by the machine learning system.

    Abstract translation: 检测互联网协议(IP)流中的异常的系统使用一组机器学习(ML)规则,可以在IP流级别实时应用。 通信网络具有大量具有流量监控功能的路由器。 集流器在通信网络中收集来自路由器的流数据,并将其提供给流分类器。 同时,网络中有限数量的位置监视数据包,并根据数据包数据属性生成警报。 分组警报和流数据被提供给机器学习系统,其检测基于分组的警报和流数据之间的相关性,从而生成一系列流级别警报。 这些规则提供给流时间分类器。 随着时间的推移,新的数据包警报和流数据用于提供机器学习系统生成的更新规则。

    SYSTEM AND METHOD FOR DYNAMIC FACIAL FEATURES FOR SPEAKER RECOGNITION
    8.
    发明申请
    SYSTEM AND METHOD FOR DYNAMIC FACIAL FEATURES FOR SPEAKER RECOGNITION 有权
    用于声音识别的动态特征的系统和方法

    公开(公告)号:US20150081302A1

    公开(公告)日:2015-03-19

    申请号:US14551907

    申请日:2014-11-24

    Abstract: Disclosed herein are systems, methods, and non-transitory computer-readable storage media for performing speaker verification. A system configured to practice the method receives a request to verify a speaker, generates a text challenge that is unique to the request, and, in response to the request, prompts the speaker to utter the text challenge. Then the system records a dynamic image feature of the speaker as the speaker utters the text challenge, and performs speaker verification based on the dynamic image feature and the text challenge. Recording the dynamic image feature of the speaker can include recording video of the speaker while speaking the text challenge. The dynamic feature can include a movement pattern of head, lips, mouth, eyes, and/or eyebrows of the speaker. The dynamic image feature can relate to phonetic content of the speaker speaking the challenge, speech prosody, and the speaker's facial expression responding to content of the challenge.

    Abstract translation: 本文公开了用于执行说话者验证的系统,方法和非暂时的计算机可读存储介质。 被配置为实施该方法的系统接收到验证说话者的请求,产生对该请求是唯一的文本挑战,并且响应该请求提示说话者发出文本挑战。 然后当扬声器发出文本挑战时,系统记录扬声器的动态图像特征,并且基于动态图像特征和文本挑战来执行说话者验证。 录制扬声器的动态图像功能可以包括在说出文本挑战时录制扬声器的视频。 动态特征可以包括扬声器的头部,嘴唇,嘴巴,眼睛和/或眉毛的运动模式。 动态图像特征可以涉及讲话者讲话的语音内容,语音韵律以及响应于挑战内容的说话者的面部表情。

Patent Agency Ranking