Machine learning contextual approach to word determination for text input via reduced keypad keys

    公开(公告)号:US07103534B2

    公开(公告)日:2006-09-05

    申请号:US09823619

    申请日:2001-03-31

    申请人: Joshua T. Goodman

    发明人: Joshua T. Goodman

    IPC分类号: G06F17/20 G06F17/21

    CPC分类号: G06F3/0237 G06F17/276

    摘要: Determination of a word input on a reduced keypad, such as a numeric keypad, by entering a key sequence ambiguously corresponding to the word, by taking into account the context of the word via a machine learning approach, is disclosed. Either the left context, the right context, or the double-sided context of the number sequence can be used to determine the intended word. The machine learning approach can use a statistical language model, such as an n-gram language model. The compression of a language model for use with small devices, such as mobile phones and other types of small devices, is also disclosed.

    Executive reporting
    63.
    发明授权
    Executive reporting 有权
    行政报告

    公开(公告)号:US08239227B2

    公开(公告)日:2012-08-07

    申请号:US11874151

    申请日:2007-10-17

    IPC分类号: G06Q40/00

    摘要: Providing for generating an executive report of business or personal activity is described herein. By way of example, such executive report can identify a change and related cause with respect to a prior report. As a particular example, an inference engine can receive an activity report and reference prior reports to identify the change and related cause. A set of results containing such information can be provided to a synthesis component that can include and highlight such information in the executive report. In addition, additional sources of data can be referenced in order to include and/or customize the report to a particular individual, organization, culture, or the like. As described, aspects of the subject innovation can provide an executive report highlighting important aspects of data and tailoring those aspects to interests of one or more users.

    摘要翻译: 本文描述了提供生成业务或个人活动的执行报告。 作为例子,这样的执行报告可以针对先前的报告确定变更和相关原因。 作为特定示例,推理引擎可以接收活动报告并参考先前报告以识别变化和相关原因。 可以向综合组件提供包含此类信息的一组结果,其中可以在执行报告中包含和突出显示这些信息。 此外,可以引用额外的数据来源,以便将报告包括和/或定制到特定个人,组织,文化等。 如上所述,主题创新的方面可以提供强调数据的重要方面的执行报告,并将这些方面定制为一个或多个用户的兴趣。

    Spam filtration utilizing sender activity data
    64.
    发明授权
    Spam filtration utilizing sender activity data 有权
    垃圾邮件过滤利用发送者活动数据

    公开(公告)号:US08224905B2

    公开(公告)日:2012-07-17

    申请号:US11567632

    申请日:2006-12-06

    IPC分类号: G06F15/16

    CPC分类号: G06Q10/107

    摘要: Spam is identified by computing sender reputation derived from historical activity data across counts for various categories. A spam filter or machine learning system can be trained utilizing pre-categorized data in conjunction with activity data associated with a sender aggregated across at least one time period. This sender activity filter can be employed alone or in combination with other filters to facilitate classification of messages as spam or non-spam.

    摘要翻译: 通过计算来自各种类别的历史活动数据的发送者信誉来识别垃圾邮件。 可以使用预先分类的数据结合在至少一个时间段内聚集的发送者的活动数据来训练垃圾邮件过滤器或机器学习系统。 该发件人活动过滤器可以单独使用或与其他过滤器组合使用,以便于将邮件分类为垃圾邮件或非垃圾邮件。

    Exponential priors for maximum entropy models
    65.
    发明授权
    Exponential priors for maximum entropy models 有权
    最大熵模型的指数先验

    公开(公告)号:US07483813B2

    公开(公告)日:2009-01-27

    申请号:US11550908

    申请日:2006-10-19

    申请人: Joshua T. Goodman

    发明人: Joshua T. Goodman

    IPC分类号: G06F15/00 G06F11/30

    CPC分类号: G06K9/6217 G06N99/005

    摘要: The subject invention provides for systems and methods that facilitate optimizing one or mores sets of training data by utilizing an Exponential distribution as the prior on one or more parameters in connection with a maximum entropy (maxent) model to mitigate overfitting. Maxent is also known as logistic regression. More specifically, the systems and methods can facilitate optimizing probabilities that are assigned to the training data for later use in machine learning processes, for example. In practice, training data can be assigned their respective weights and then a probability distribution can be assigned to those weights.

    摘要翻译: 本发明提供了通过利用指数分布作为与最大熵(maxent)模型相结合的一个或多个参数之前的指数分布来优化一个或多个训练数据组的系统和方法,以减轻过拟合。 Maxent也被称为逻辑回归。 更具体地,系统和方法可以有助于优化分配给训练数据的概率,以备以后在机器学习过程中使用。 实际上,训练数据可以分配它们各自的权重,然后将概率分布分配给这些权重。

    EXTENSIBLE EMAIL
    66.
    发明申请
    EXTENSIBLE EMAIL 审中-公开
    可扩展的电子邮件

    公开(公告)号:US20080022097A1

    公开(公告)日:2008-01-24

    申请号:US11424379

    申请日:2006-06-15

    IPC分类号: H04L9/00

    CPC分类号: G06Q10/109 G06Q10/107

    摘要: A computer-implemented method and system for obtaining data is provided. In the method, to obtain data pertaining to another party, a request for an authentication key is made. Upon receiving the requested authentication key in an email, the method and system automatically send the authentication key as part of a HTTP, HTTPS or SMTP request for data. Then, in response to the request for data containing the authentication key, the requested data is received.

    摘要翻译: 提供了一种用于获取数据的计算机实现的方法和系统。 在该方法中,为了获得与另一方相关的数据,进行认证密钥的请求。 在电子邮件中收到所请求的认证密钥后,该方法和系统自动发送认证密钥作为数据的HTTP,HTTPS或SMTP请求的一部分。 然后,响应于对包含认证密钥的数据的请求,接收所请求的数据。

    Targeted advertising in brick-and-mortar establishments
    67.
    发明授权
    Targeted advertising in brick-and-mortar establishments 有权
    实体广告业务

    公开(公告)号:US08725567B2

    公开(公告)日:2014-05-13

    申请号:US11427761

    申请日:2006-06-29

    IPC分类号: G06Q30/00

    摘要: Architecture for presenting advertisements in realtime in retail establishments. A sensor component includes sensors for collecting information about a customer or group of customers as they move through the store. The sensors can include capability for image processing, audio processing, light sensing, velocity sensing, direction sensing, proximity sensing, face recognition, pose recognition, transaction recognition, and biometric sensing, for example. A customer component analyzes the information and generates a profile about the customer. Advertisements are selected for presentation that target the customers as they walk in proximity of a presentation system of the store. An advertisement component facilitates dynamic presentation of a targeted advertisement to the individual as a function of the profile. The customer component can infer information during analysis using machine learning and reasoning.

    摘要翻译: 在零售店内实时呈现广告的建筑。 传感器组件包括传感器,用于在客户或客户群经过商店移动时收集有关客户或客户群的信息。 例如,传感器可以包括用于图像处理,音频处理,光感测,速度感测,方向感测,接近感测,面部识别,姿态识别,事务识别和生物测量感测的能力。 客户组件分析信息并生成有关客户的配置文件。 广告被选择用于呈现,当客户在商店的呈现系统附近走动时将其定位成客户。 广告组件有助于作为简档的函数向个人动态地呈现目标广告。 客户组件可以使用机器学习和推理来分析分析期间的信息。

    Web document keyword and phrase extraction
    68.
    发明授权
    Web document keyword and phrase extraction 有权
    Web文档关键字和短语提取

    公开(公告)号:US08135728B2

    公开(公告)日:2012-03-13

    申请号:US11619230

    申请日:2007-01-03

    IPC分类号: G06F7/00 G06F17/30 G06F13/14

    摘要: Extraction analysis techniques biased, in part, by query frequency information from a query log file and/or search engine cache are employed along with machine learning processes to determine candidate keywords and/or phrases of web documents. Web oriented features associated with the candidate keywords and/or phrases are also utilized to analyze the web documents. A keyword and/or phrase extraction mechanism can be utilized to score keywords and/or phrases in a web document and estimate a likelihood that the keywords and/or phrases are relevant, for example, in an advertising system and the like.

    摘要翻译: 提取分析技术部分地通过来自查询日志文件和/或搜索引擎高速缓冲存储器的查询频率信息以及机器学习过程来偏移来确定web文档的候选关键字和/或短语。 与候选关键字和/或短语相关联的面向Web的功能也用于分析网络文档。 可以使用关键字和/或短语提取机制来评估网络文档中的关键字和/或短语,并估计关键词和/或短语相关的可能性,例如在广告系统等中。