Classification using a cascade approach
    52.
    发明授权
    Classification using a cascade approach 失效
    使用级联方法分类

    公开(公告)号:US07693806B2

    公开(公告)日:2010-04-06

    申请号:US11766434

    申请日:2007-06-21

    IPC分类号: G06F15/18 G06N3/08

    摘要: A system and method that facilitates and effectuates optimizing a classifier for greater performance in a specific region of classification that is of interest, such as a low false positive rate or a low false negative rate. A two-stage classification model can be trained and employed, where the first stage classification is optimized over the entire classification region and the second stage classifier is optimized for the specific region of interest. During training the entire set of training data is employed by a first stage classifier. Only data that is classified by the first stage classifier or by cross validation to fall within a region of interest is used to train the second stage classifier. During classification, data that is classified within the region of interest by the first classification is given the first stage classifier's classification value, otherwise the classification value for the instance of data from the second stage classifier is used.

    摘要翻译: 促进并实现分类器在特定感兴趣区域中的更高性能的系统和方法,例如低假阳性率或低假阴性率。 可以训练和采用两阶段分类模型,其中对整个分类区域优化第一阶段分类,并针对特定的兴趣区域优化第二阶段分类器。 在训练期间,整套训练数据由第一阶段分类器采用。 仅使用由第一阶段分类器分类的数据或通过交叉验证落入感兴趣区域内的数据来训练第二阶段分类器。 在分类期间,通过第一分类对分类在感兴趣区域内的数据给予第一阶段分类器的分类值,否则使用来自第二阶段分类器的数据实例的分类值。

    Origination/destination features and lists for spam prevention
    53.
    发明授权
    Origination/destination features and lists for spam prevention 有权
    起始/目的地功能和垃圾邮件防范列表

    公开(公告)号:US07665131B2

    公开(公告)日:2010-02-16

    申请号:US11621363

    申请日:2007-01-09

    IPC分类号: H04L29/06

    CPC分类号: H04L51/12 G06Q10/107

    摘要: The present invention involves a system and method that facilitate extracting data from messages for spam filtering. The extracted data can be in the form of features, which can be employed in connection with machine learning systems to build improved filters. Data associated with origination information as well as other information embedded in the body of the message that allows a recipient of the message to contact and/or respond to the sender of the message can be extracted as features. The features, or a subset thereof, can be normalized and/or deobfuscated prior to being employed as features of the machine learning systems. The (deobfuscated) features can be employed to populate a plurality of feature lists that facilitate spam detection and prevention. Exemplary features include an email address, an IP address, a URL, an embedded image pointing to a URL, and/or portions thereof.

    摘要翻译: 本发明涉及一种便于从垃圾邮件过滤的消息中提取数据的系统和方法。 提取的数据可以是特征的形式,其可以与机器学习系统结合使用以构建改进的过滤器。 可以提取与发起信息相关联的数据以及嵌入在消息正文中的允许消息的接收者联系和/或响应消息的发送者的其他信息作为特征。 特征或其子集可以在被用作机器学习系统的特征之前被归一化和/或去混淆。 可以使用(去模糊化)功能来填充便于垃圾邮件检测和预防的多个特征列表。 示例性特征包括电子邮件地址,IP地址,URL,指向URL的嵌入图像和/或其部分。

    Automatically displaying keywords and other supplemental information
    54.
    发明授权
    Automatically displaying keywords and other supplemental information 有权
    自动显示关键字和其他补充信息

    公开(公告)号:US07664740B2

    公开(公告)日:2010-02-16

    申请号:US11426509

    申请日:2006-06-26

    IPC分类号: G06F7/00

    摘要: Various embodiments can utilize information that is displayed for a user to automatically generate a list of keywords and use that list as a means to display supplemental information that is relevant to the keywords. In at least some embodiments, the displayed information is analyzed using an extraction algorithm to identify words or, more generally, character strings of interest. If these words or character strings of interest are determined to constitute relevant search terms or “keywords”, then a special user interface portion can be used to display this supplemental information along with the information that is already displayed for the user. This supplemental information can include the search terms themselves, ads that pertain to the search terms, and/or search results that have been ascertained from a web search engine.

    摘要翻译: 各种实施例可以利用为用户显示的信息来自动生成关键字列表,并使用该列表作为显示与关键字相关的补充信息的手段。 在至少一些实施例中,使用提取算法来分析所显示的信息以识别字词,或者更一般地,识别感兴趣的字符串。 如果确定这些关键词或字符串构成相关搜索词或“关键字”,则可以使用特殊用户界面部分来显示该补充信息以及已经为用户显示的信息。 该补充信息可以包括搜索词本身,与搜索词相关的广告,和/或已从网页搜索引擎确定的搜索结果。

    Phishing detection, prevention, and notification
    55.
    发明授权
    Phishing detection, prevention, and notification 有权
    网路钓鱼检测,预防和通知

    公开(公告)号:US07634810B2

    公开(公告)日:2009-12-15

    申请号:US11129222

    申请日:2005-05-13

    IPC分类号: H04L29/06 G06F21/00

    摘要: Phishing detection, prevention, and notification is described. In an embodiment, a messaging application facilitates communication via a messaging user interface, and receives a communication, such as an email message, from a domain. A phishing detection module detects a phishing attack in the communication by determining that the domain is similar to a known phishing domain, or by detecting suspicious network properties of the domain. In another embodiment, a Web browsing application receives content, such as data for a Web page, from a network-based resource, such as a Web site or domain. The Web browsing application initiates a display of the content, and a phishing detection module detects a phishing attack in the content by determining that a domain of the network-based resource is similar to a known phishing domain, or that an address of the network-based resource from which the content is received has suspicious network properties.

    摘要翻译: 描述网络钓鱼检测,预防和通知。 在一个实施例中,消息收发应用促进通过消息收发用户界面的通信,并从域接收诸如电子邮件消息之类的通信。 钓鱼检测模块通过确定域与已知的网络钓鱼域相似,或通过检测域的可疑网络属性来检测通信中的网络钓鱼攻击。 在另一个实施例中,Web浏览应用程序从基于网络的资源(诸如网站或域)接收诸如网页的数据的内容。 Web浏览应用程序启动内容的显示,并且网络钓鱼检测模块通过确定基于网络的资源的域类似于已知的网络钓鱼域来检测内容中的网络钓鱼攻击,或者网络 - 收到内容的基于资源的资源具有可疑的网络属性。

    PROVIDING A TASK DESCRIPTION NAME SPACE MAP FOR THE INFORMATION WORKER
    56.
    发明申请
    PROVIDING A TASK DESCRIPTION NAME SPACE MAP FOR THE INFORMATION WORKER 有权
    为信息工作者提供任务说明名称空间地图

    公开(公告)号:US20090254336A1

    公开(公告)日:2009-10-08

    申请号:US12098189

    申请日:2008-04-04

    IPC分类号: G06F17/27

    CPC分类号: G06F9/451

    摘要: Providing for generation of a task oriented data structure that can correlate natural language descriptions of computer related tasks to application level commands and functions is described herein. By way of example, a system can include an activity translation component that can receive a natural language description of an application level task. Furthermore, the system can include a language modeling component that can generate the data structure based on an association between the description of the task and at least one application level command utilized in executing the computer related task. Once generated, the data structure can be utilized to automate computer related tasks by input of a human centric description of those tasks. According to further embodiments, machine learning can be employed to train classifiers and heuristic models to optimize task/description relationships and/or tailor such relationships to the needs of particular users.

    摘要翻译: 本文描述了生成可将计算机相关任务的自然语言描述与应用级命令和功能相关联的面向任务的数据结构。 作为示例,系统可以包括能够接收应用级任务的自然语言描述的活动翻译组件。 此外,系统可以包括语言建模组件,该组件可以基于任务描述与在执行计算机相关任务中使用的至少一个应用程序级别命令之间的关联来生成数据结构。 一旦生成,数据结构可以用于通过输入对这些任务的以人为中心的描述来自动化计算机相关任务。 根据另外的实施例,可以采用机器学习来训练分类器和启发式模型以优化任务/描述关系和/或根据特定用户的需要定制这样的关系。

    Out-of-vocabulary word determination and user interface for text input via reduced keypad keys
    57.
    发明授权
    Out-of-vocabulary word determination and user interface for text input via reduced keypad keys 有权
    通过减少的键盘键进行文字输入的词汇词定义和用户界面

    公开(公告)号:US07385591B2

    公开(公告)日:2008-06-10

    申请号:US09823585

    申请日:2001-03-31

    申请人: Joshua T. Goodman

    发明人: Joshua T. Goodman

    IPC分类号: H03K17/94 G06F3/02

    CPC分类号: G06F3/0237

    摘要: Out-of-vocabulary (OOV) word determination corresponding to a key sequence entered by the user on a (typically numeric) keypad, and a user interface for the user to select one of the words, are disclosed. A word-determining logic determines letter sequences corresponding to the entered key sequence, and presents the sequences within the user interface in which the user can select one of the letter sequences as the intended word, or select the first letter of the intended word. When letters are selected, the word-determining logic determines new letter sequences, consistent with the key sequence and the selected letters, and presents the new letter sequences. The user again selects one of the letter sequences as the intended word, or selects the second letter of the intended word. This process is repeated until the user has selected the intended word.

    摘要翻译: 公开了与用户在(通常是数字)键盘上输入的键序列对应的词汇词义(OOV)字,以及用户用户选择其中一个词的用户界面。 字确定逻辑确定与输入的键序列相对应的字母序列,并且在用户界面内呈现用户可以选择其中一个字母序列作为预期词的序列,或者选择预期词的第一个字母。 当选择字母时,字确定逻辑确定与键序列和所选字母一致的新字母序列,并呈现新字母序列。 用户再次选择一个字母序列作为预期的单词,或者选择预期单词的第二个字母。 重复该过程,直到用户选择了预期的单词。

    ARCHITECTURE FOR USER- AND CONTEXT- SPECIFIC PREFETCHING AND CACHING OF INFORMATION ON PORTABLE DEVICES
    58.
    发明申请
    ARCHITECTURE FOR USER- AND CONTEXT- SPECIFIC PREFETCHING AND CACHING OF INFORMATION ON PORTABLE DEVICES 有权
    用户和上下文的特定提示和便携式设备信息的架构

    公开(公告)号:US20080005695A1

    公开(公告)日:2008-01-03

    申请号:US11427755

    申请日:2006-06-29

    IPC分类号: G06F3/048 H04Q7/20

    摘要: Content management architecture for a portable wireless device. Caching and fetching techniques are provided to improve content handling for portable devices such as cellular telephones and portable computers. A search component automatically performs searches as a background process, and potentially desired content is received and cached by a content storing component to be available in the future when and if needed, mitigating latency associated with slow download speeds, refresh rates, and other system and/or network impediments. Content from background search results can be trickled into the device as part of the background process so as not to burden system resources for other processes. As part of memory management, aged and/or low priority or low interest content can be selectively removed or archived to increase available cache or memory space, as well as to maintain relevant content within the device. A presentation component facilitates presentation of the pre-stored content.

    摘要翻译: 便携式无线设备的内容管理架构。 提供缓存和提取技术以改进便携式设备(例如蜂窝电话和便携式计算机)的内容处理。 搜索组件自动执行搜索作为后台进程,并且可能期望的内容被内容存储组件接收和缓存,以便将来在需要时可用,减轻与慢下载速度,刷新率和其他系统相关联的延迟,以及 /或网络障碍。 来自后台搜索结果的内容可以作为后台进程的一部分进入设备,以免对其他进程造成系统资源的负担。 作为内存管理的一部分,老化和/或低优先级或低兴趣内容可以被选择性地删除或归档以增加可用的高速缓存或存储器空间,以及维护设备内的相关内容。 演示组件便于显示预存的内容。