Efficient string search
    1.
    发明授权
    Efficient string search 有权
    高效的字符串搜索

    公开(公告)号:US08086441B1

    公开(公告)日:2011-12-27

    申请号:US11881556

    申请日:2007-07-27

    IPC分类号: G06F17/28

    摘要: Some embodiments of an efficient string search have been presented. In one embodiment, a string of bytes representing content written in a non-delimited language is received, wherein the content has been classified into a predetermined category. In a single pass through the string of bytes, a set of N-grams is searched for simultaneously. Statistical information on occurrences of the N-grams, if any, in the string of bytes is collected. In some embodiments, a model is generated based on the statistical information, where the model is usable by a content filter to classify content.

    摘要翻译: 已经提出了有效的字符串搜索的一些实施例。 在一个实施例中,接收表示以非分隔语言编写的内容的字节串,其中内容已被分类为预定类别。 在通过字符串的单次传递中,同时搜索一组N-gram。 收集字节串中N-gram出现的统计信息(如果有的话)。 在一些实施例中,基于统计信息生成模型,其中模型可由内容过滤器用于对内容进行分类。

    Training procedure for N-gram-based statistical content classification
    2.
    发明授权
    Training procedure for N-gram-based statistical content classification 有权
    基于N-gram的统计内容分类的训练程序

    公开(公告)号:US07792846B1

    公开(公告)日:2010-09-07

    申请号:US11881770

    申请日:2007-07-27

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30705

    摘要: A training procedure for N-gram based statistical document classification has been disclosed. In one embodiment, a set of N-grams is selected out of a second set of N-grams, each of the N-grams having a sequence of N bytes, where N is an integer. Then a statistical content classification model is generated based on occurrences of the N-grams, if any, in a set of training documents and a set of validation documents. The statistical content classification model is provided to content filters to classify content.

    摘要翻译: 已经公开了基于N-gram的统计文件分类的训练程序。 在一个实施例中,从第二组N-gram中选出一组N克,每个N克具有N个字节的序列,其中N是整数。 然后,根据一组训练文件和一组验证文件中的N-gram的出现(如果有的话)生成统计内容分类模型。 统计内容分类模型提供给内容过滤器以对内容进行分类。

    On-the-fly pattern recognition with configurable bounds
    3.
    发明授权
    On-the-fly pattern recognition with configurable bounds 有权
    具有可配置边界的动态模式识别

    公开(公告)号:US08370374B1

    公开(公告)日:2013-02-05

    申请号:US13196480

    申请日:2011-08-02

    IPC分类号: G06F7/00 G06F17/30

    摘要: Some embodiments of on-the-fly pattern recognition with configurable bounds have been presented. In one embodiment, a pattern matching engine is configured based on user input, which may include values of one or more user configurable bounds on searching. Then the configured pattern matching engine is used to search for a set of features in an incoming string. A set of scores is updated based on the presence of any of the features in the string while searching for the features. Each score may indicate a likelihood of the content of the string being in a category. The search is terminated if the end of the string is reached or if the user configurable bounds are met. After terminating the search, the scores are output.

    摘要翻译: 已经提出了具有可配置界限的动态模式识别的一些实施例。 在一个实施例中,模式匹配引擎被配置为基于用户输入,其可以包括搜索上的一个或多个用户可配置边界的值。 然后,配置的模式匹配引擎用于搜索传入字符串中的一组要素。 基于在搜索特征时字符串中的任何特征的存在来更新一组分数。 每个分数可以指示字符串的内容在类别中的可能性。 如果达到字符串的结尾或满足用户可配置的界限,则搜索终止。 结束搜索后,输出得分。

    Efficient string search
    4.
    发明授权
    Efficient string search 有权
    高效的字符串搜索

    公开(公告)号:US08577669B1

    公开(公告)日:2013-11-05

    申请号:US13335743

    申请日:2011-12-22

    IPC分类号: G06F17/28

    摘要: Some embodiments of an efficient string search have been presented. In one embodiment, a string of bytes representing content written in a non-delimited language is received, wherein the content has been classified into a predetermined category. In a single pass through the string of bytes, a set of N-grams is searched for simultaneously. Statistical information on occurrences of the N-grams, if any, in the string of bytes is collected. In some embodiments, a model is generated based on the statistical information, where the model is usable by a content filter to classify content.

    摘要翻译: 已经提出了有效的字符串搜索的一些实施例。 在一个实施例中,接收表示以非分隔语言编写的内容的字节串,其中内容已被分类为预定类别。 在通过字符串的单次传递中,同时搜索一组N-gram。 收集字节串中N-gram出现的统计信息(如果有的话)。 在一些实施例中,基于统计信息生成模型,其中模型可由内容过滤器用于对内容进行分类。

    Training procedure for N-gram-based statistical content classification
    5.
    发明授权
    Training procedure for N-gram-based statistical content classification 有权
    基于N-gram的统计内容分类的训练程序

    公开(公告)号:US07917522B1

    公开(公告)日:2011-03-29

    申请号:US12822439

    申请日:2010-06-24

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30705

    摘要: A training procedure for N-gram based statistical document classification has been disclosed. In one embodiment, a set of N-grams is selected out of a second set of N-grams, each of the N-grams having a sequence of N bytes, where N is an integer. Then a statistical content classification model is generated based on occurrences of the N-grams, if any, in a set of training documents and a set of validation documents. The statistical content classification model is provided to content filters to classify content.

    摘要翻译: 已经公开了基于N-gram的统计文件分类的训练程序。 在一个实施例中,从第二组N-gram中选出一组N克,每个N克具有N个字节的序列,其中N是整数。 然后,根据一组训练文件和一组验证文件中的N-gram的出现(如果有的话)生成统计内容分类模型。 统计内容分类模型提供给内容过滤器以对内容进行分类。

    On-the-fly pattern recognition with configurable bounds
    7.
    发明授权
    On-the-fly pattern recognition with configurable bounds 有权
    具有可配置边界的动态模式识别

    公开(公告)号:US07996415B1

    公开(公告)日:2011-08-09

    申请号:US12846102

    申请日:2010-07-29

    IPC分类号: G06F7/00 G06F17/30

    摘要: Some embodiments of on-the-fly pattern recognition with configurable bounds have been presented. In one embodiment, a pattern matching engine is configured based on user input, which may include values of one or more user configurable bounds on searching. Then the configured pattern matching engine is used to search for a set of features in an incoming string. A set of scores is updated based on the presence of any of the features in the string while searching for the features. Each score may indicate a likelihood of the content of the string being in a category. The search is terminated if the end of the string is reached or if the user configurable bounds are met. After terminating the search, the scores are output.

    摘要翻译: 已经提出了具有可配置界限的动态模式识别的一些实施例。 在一个实施例中,模式匹配引擎被配置为基于用户输入,其可以包括搜索上的一个或多个用户可配置边界的值。 然后,配置的模式匹配引擎用于搜索传入字符串中的一组要素。 基于在搜索特征时字符串中的任何特征的存在来更新一组分数。 每个分数可以指示字符串的内容在类别中的可能性。 如果达到字符串的结尾或满足用户可配置的界限,则搜索终止。 结束搜索后,输出得分。

    On-the-fly pattern recognition with configurable bounds
    8.
    发明授权
    On-the-fly pattern recognition with configurable bounds 有权
    具有可配置边界的动态模式识别

    公开(公告)号:US07792850B1

    公开(公告)日:2010-09-07

    申请号:US11881530

    申请日:2007-07-27

    IPC分类号: G06F7/00 G06F17/30

    摘要: Some embodiments of on-the-fly pattern recognition with configurable bounds have been presented. In one embodiment, a pattern matching engine is configured based on user input, which may include values of one or more user configurable bounds on searching. Then the configured pattern matching engine is used to search for a set of features in an incoming string. A set of scores is updated based on the presence of any of the features in the string while searching for the features. Each score may indicate a likelihood of the content of the string being in a category. The search is terminated if the end of the string is reached or if the user configurable bounds are met. After terminating the search, the scores are output.

    摘要翻译: 已经提出了具有可配置界限的动态模式识别的一些实施例。 在一个实施例中,模式匹配引擎被配置为基于用户输入,其可以包括搜索上的一个或多个用户可配置边界的值。 然后,配置的模式匹配引擎用于搜索传入字符串中的一组要素。 基于在搜索特征时字符串中的任何特征的存在来更新一组分数。 每个分数可以指示字符串的内容在类别中的可能性。 如果达到字符串的结尾或满足用户可配置的界限,则搜索终止。 结束搜索后,输出得分。

    Method and apparatus for multimedia content filtering
    9.
    发明授权
    Method and apparatus for multimedia content filtering 有权
    多媒体内容过滤的方法和装置

    公开(公告)号:US09275047B1

    公开(公告)日:2016-03-01

    申请号:US11236280

    申请日:2005-09-26

    IPC分类号: G06F17/30

    摘要: Method and apparatus for multimedia content filtering are described herein. In one embodiment, an example of a network access device, in response to multimedia content transmitted from a source over a first network and destined to a destination over a second network, opens the multimedia content within the network access device interfacing the first and second networks. A content rating operation is performed on the opened multimedia content to determine whether the multimedia content should be transmitted to the destination over the second network. Other methods and apparatuses are also described.

    摘要翻译: 本文描述了用于多媒体内容过滤的方法和装置。 在一个实施例中,网络接入设备的示例响应于通过第一网络从源发送并且通过第二网络发往目的地的多媒体内容,打开在与第一和第二网络接口的网络接入设备内的多媒体内容 。 对打开的多媒体内容执行内容评级操作,以确定是否应通过第二网络将多媒体内容发送到目的地。 还描述了其它方法和装置。

    Net-based email filtering
    10.
    发明授权
    Net-based email filtering 有权
    基于网络的电子邮件过滤

    公开(公告)号:US08671447B2

    公开(公告)日:2014-03-11

    申请号:US13155819

    申请日:2011-06-08

    IPC分类号: G06F15/16

    摘要: A local gateway device receives email across the internet from a sender of the email and forwards it across the internet to an email filtering system. The email filtering system analyzes the email to determine whether it is spam, phishing or contains a virus and sends it back to the local gateway device along with the filtered determination. The local gateway device forwards the received email and the filtered determination to a local junk store which handles the email appropriately. For example, if the email has been determined to be spam, phishing or containing a virus, the junk store can quarantine the email and if the email has been determined to be non-spun and/or not phishing and/or not containing a virus, the junk store can forward the email to a local mail server for delivery.

    摘要翻译: 本地网关设备通过互联网从电子邮件的发送者接收电子邮件,并将其通过互联网转发到电子邮件过滤系统。 电子邮件过滤系统分析电子邮件,以确定是垃圾邮件,网络钓鱼还是包含病毒,并将其与过滤后的确定一起发送回本地网关设备。 本地网关设备将接收到的电子邮件和过滤的确定转发给适当处理电子邮件的本地垃圾商店。 例如,如果电子邮件被确定为垃圾邮件,网络钓鱼或包含病毒,垃圾商店可以隔离电子邮件,并且如果电子邮件已被确定为不转动和/或不进行网络钓鱼和/或不包含病毒 垃圾商店可以将邮件转发到本地邮件服务器进行传送。