System and method for approximating probabilities using a decision tree
    1.
    发明授权
    System and method for approximating probabilities using a decision tree 有权
    使用决策树近似概率的系统和方法

    公开(公告)号:US06718315B1

    公开(公告)日:2004-04-06

    申请号:US09740067

    申请日:2000-12-18

    IPC分类号: G06F1518

    CPC分类号: G06N99/005

    摘要: Disclosed is a system for approximating conditional probabilities using an annotated decision tree where predictor values that did not exist in training data for the system are tracked, stored, and referenced to determine if statistical aggregation should be invoked. Further disclosed is a system for storing statistics for deriving a non-leaf probability corresponding to predictor values, and a system for aggregating such statistics to approximate conditional probabilities.

    摘要翻译: 公开了一种使用注释决策树近似条件概率的系统,其中跟踪,存储和引用系统的训练数据中不存在的预测值,以确定是否应调用统计聚合。 进一步披露的是用于存储用于导出与预测值相对应的非叶概率的统计的系统,以及用于将这种统计量聚合以近似条件概率的系统。

    Trees of classifiers for detecting email spam
    2.
    发明授权
    Trees of classifiers for detecting email spam 有权
    用于检测电子邮件垃圾邮件的分类树

    公开(公告)号:US07930353B2

    公开(公告)日:2011-04-19

    申请号:US11193691

    申请日:2005-07-29

    IPC分类号: G06F15/16

    CPC分类号: H04L51/12

    摘要: Decision trees populated with classifier models are leveraged to provide enhanced spam detection utilizing separate email classifiers for each feature of an email. This provides a higher probability of spam detection through tailoring of each classifier model to facilitate in more accurately determining spam on a feature-by-feature basis. Classifiers can be constructed based on linear models such as, for example, logistic-regression models and/or support vector machines (SVM) and the like. The classifiers can also be constructed based on decision trees. “Compound features” based on internal and/or external nodes of a decision tree can be utilized to provide linear classifier models as well. Smoothing of the spam detection results can be achieved by utilizing classifier models from other nodes within the decision tree if training data is sparse. This forms a base model for branches of a decision tree that may not have received substantial training data.

    摘要翻译: 利用分类器模型填充的决策树利用电子邮件的每个功能使用单独的电子邮件分类器来提供增强的垃圾邮件检测。 这通过定制每个分类器模型提供了更高的垃圾邮件检测的概率,以便于在逐个特征的基础上更准确地确定垃圾邮件。 分类器可以基于诸如逻辑回归模型和/或支持向量机(SVM)等线性模型来构建。 分类器也可以基于决策树构建。 基于决策树的内部和/或外部节点的“复合特征”也可以用于提供线性分类器模型。 垃圾邮件检测结果的平滑可以通过使用来自决策树内的其他节点的分类器模型来实现,如果训练数据是稀疏的。 这形成了可能没有接收到大量训练数据的决策树的分支的基本模型。

    Systems and methods for new time series model probabilistic ARMA
    3.
    发明授权
    Systems and methods for new time series model probabilistic ARMA 有权
    新时间序列模型概率ARMA的系统和方法

    公开(公告)号:US07580813B2

    公开(公告)日:2009-08-25

    申请号:US10463145

    申请日:2003-06-17

    IPC分类号: G06F17/50 G05B23/02

    CPC分类号: G06F17/18

    摘要: The present invention utilizes a cross-prediction scheme to predict values of discrete and continuous time observation data, wherein conditional variance of each continuous time tube variable is fixed to a small positive value. By allowing cross-predictions in an ARMA based model, values of continuous and discrete observations in a time series are accurately predicted. The present invention accomplishes this by extending an ARMA model such that a first time series “tube” is utilized to facilitate or “cross-predict” values in a second time series tube to form an “ARMAxp” model. In general, in the ARMAxp model, the distribution of each continuous variable is a decision graph having splits only on discrete variables and having linear regressions with continuous regressors at all leaves, and the distribution of each discrete variable is a decision graph having splits only on discrete variables and having additional distributions at all leaves.

    摘要翻译: 本发明利用交叉预测方案来预测离散和连续时间观测数据的值,其中每个连续时间管变量的条件方差固定为小的正值。 通过在基于ARMA的模型中允许交叉预测,可以准确预测时间序列中连续和离散观测值。 本发明通过扩展ARMA模型来实现这一目的,使得第一时间序列“管”用于促进或“交叉预测”第二时间序列管中的值以形成“ARMAxp”模型。 一般来说,在ARMAxp模型中,每个连续变量的分布是仅在离散变量上分裂并具有在所有叶上具有连续回归的线性回归的决策图,并且每个离散变量的分布是仅分解为 离散变量,并在所有叶子上具有额外的分布。

    SOCIAL REWARDS FOR ONLINE GAME PLAYING
    5.
    发明申请
    SOCIAL REWARDS FOR ONLINE GAME PLAYING 有权
    在线游戏玩的社会奖励

    公开(公告)号:US20080153595A1

    公开(公告)日:2008-06-26

    申请号:US11614588

    申请日:2006-12-21

    IPC分类号: A63F9/24

    摘要: Useful information is acquired from a community of individuals by way of a game that rewards participants with social information about other participants. Points can be awarded to participants simply for participation and/or as a function of game performance. Such points can subsequently be exchanged to reveal information about game partners or other community members. Among other things, such a reward system can motivate individuals to perform tasks that might not otherwise be compelling and/or enjoyable.

    摘要翻译: 有用的信息是通过游戏方式从个人社区获取的,该游戏会奖励参与者有关其他参与者的社交信息。 点数可以仅授予参与者参与和/或作为游戏演出的功能。 随后可以交换这些点以揭示关于游戏伙伴或其他社区成员的信息。 除此之外,这种奖励制度可以激励个人执行可能无法强制和/或愉快的任务。

    USER INTERACTION-BIASED ADVERTISING
    6.
    发明申请
    USER INTERACTION-BIASED ADVERTISING 审中-公开
    用户互动偏好广告

    公开(公告)号:US20080114639A1

    公开(公告)日:2008-05-15

    申请号:US11559992

    申请日:2006-11-15

    IPC分类号: G06Q30/00 G06F17/40

    摘要: On-line and/or off-line advertisement interactions are tracked for individual users. This information can then be utilized to adjust display parameters for an advertisement. Tracking can be accomplished via a client-side tracking mechanism and/or a server side tracking mechanism. The advertisement interactions allow advertisers to adjust their advertising campaigns to better target their advertisements. The tracked interactions can include, but are not limited to selections (clicking, etc.) and/or conversions (purchases) and the like. Some instances include a display component that can employ the user-specific interaction information to automatically adjust, for example, location, frequency, and/or to whom an advertisement is displayed. The interaction information can also be utilized for revenue generation by charging advertisers for the information and/or for adjusting their advertising campaigns and the like. Instances can be utilized with on-line and/or off-line advertising media.

    摘要翻译: 为个人用户追踪在线和/或离线广告交互。 然后可以利用该信息来调整广告的显示参数。 跟踪可以通过客户端跟踪机制和/或服务器端跟踪机制来实现。 广告互动允许广告客户调整他们的广告活动,以更好地定位他们的广告。 跟踪的交互可以包括但不限于选择(点击等)和/或转换(购买)等。 一些实例包括可以使用用户特定交互信息来自动调整例如位置,频率和/或广告被显示给谁的显示组件。 交互信息还可以通过向广告商收取信息和/或调整其广告活动等来用于创收。 实例可以与在线和/或离线广告媒体一起使用。

    SEARCH QUERY MONETIZATION-BASED RANKING AND FILTERING
    7.
    发明申请
    SEARCH QUERY MONETIZATION-BASED RANKING AND FILTERING 审中-公开
    搜索查询基于功能的排序和筛选

    公开(公告)号:US20080033797A1

    公开(公告)日:2008-02-07

    申请号:US11461552

    申请日:2006-08-01

    IPC分类号: G06Q30/00

    摘要: Advertiser monetization information is utilized to determine a search query monetization value that can be employed in web-search ranking to facilitate in ranking search results and/or in email spam filtering to reduce unsolicited emails and the like. Various methods can be employed to filter and/or rank and the like based on the search query monetization value. This can include biasing based on high values and/or low values. The search query monetization value can be determined based on, for example, independent phrases and/or bids. In other instances, personal user advertising interactions can be employed as well to facilitate search result ranking and/or email spam filtering. Employment of search query monetization value techniques can substantially reduce various types of subversive/undesired information.

    摘要翻译: 广告商获利信息用于确定可以在网页搜索排名中使用的搜索查询营利价值,以便于排名搜索结果和/或电子邮件垃圾邮件过滤以减少未经请求的电子邮件等。 可以使用各种方法来基于搜索查询营利值来过滤和/或排名等。 这可以包括基于高值和/或低值的偏置。 可以基于例如独立短语和/或出价来确定搜索查询营利值。 在其他情况下,也可以使用个人用户广告交互来促进搜索结果排名和/或邮件垃圾邮件过滤。 采用搜索查询营利价值技术可以大大减少各种类型的颠覆性/不需要的信息。

    MANAGING COMMITMENTS OF TIME ACROSS A NETWORK
    8.
    发明申请
    MANAGING COMMITMENTS OF TIME ACROSS A NETWORK 审中-公开
    管理网络时间的承诺

    公开(公告)号:US20080010124A1

    公开(公告)日:2008-01-10

    申请号:US11426679

    申请日:2006-06-27

    IPC分类号: G06Q30/00

    摘要: A service manager manages connection tokens in a network of users. The connection token has a plurality of defined terms and can be representative of a commitment of time for a user in the network. Connection tokens can be used to engage in a real-time communication with another user in exchange for a fee. The service manager manages possession of the connection tokens amongst the users of the network and executes the connection token in accordance with the defined terms. Additionally, the service manager can facilitate real-time communication among users based on the connection tokens.

    摘要翻译: 服务管理器管理用户网络中的连接令牌。 连接令牌具有多个定义的术语,并且可以代表网络中用户对时间的承诺。 连接令牌可用于与其他用户进行实时通信,以交换费用。 服务管理器管理在网络的用户中拥有连接令牌,并根据定义的术语执行连接令牌。 此外,服务管理器可以促进基于连接令牌的用户之间的实时通信。

    MANAGING INFORMATION SOLICITATIONS ACROSS A NETWORK
    9.
    发明申请
    MANAGING INFORMATION SOLICITATIONS ACROSS A NETWORK 审中-公开
    通过网络管理信息索引

    公开(公告)号:US20080005011A1

    公开(公告)日:2008-01-03

    申请号:US11424120

    申请日:2006-06-14

    IPC分类号: G06Q40/00

    CPC分类号: G06Q30/08 G06Q40/04

    摘要: A service manager manages information solicitations in a network of users. An information solicitation is posted that is received from an information consumer. The posted information solicitation is provided to at least a portion of the users of the network for auction. The information solicitation includes a request to engage in a real-time communication with an information provider about a particular subject. Bids are received from a plurality of information providers. The bids are provided to the information consumer for selection. The information consumer is connected with a selected one of the plurality of information providers.

    摘要翻译: 服务管理器管理用户网络中的信息请求。 从信息消费者那里收到信息请求。 发布的信息请求被提供给用于拍卖的网络的至少一部分用户。 信息征集包括与信息提供商就特定主题进行实时通信的请求。 从多个信息提供者接收出价。 出价提供给信息消费者进行选择。 所述信息使用者与所述多个信息提供者中选择的一个相关联。

    Apparatus and accompanying methods for visualizing clusters of data and hierarchical cluster classifications
    10.
    发明授权
    Apparatus and accompanying methods for visualizing clusters of data and hierarchical cluster classifications 有权
    用于可视化数据集群和分级集群分类的装置和相关方法

    公开(公告)号:US06742003B2

    公开(公告)日:2004-05-25

    申请号:US09845151

    申请日:2001-04-30

    IPC分类号: G06F1730

    摘要: A system that incorporates an interactive graphical user interface for visualizing clusters (categories) and segments (summarized clusters) of data. Specifically, the system automatically categorizes incoming case data into clusters, summarizes those clusters into segments, determines similarity measures for the segments, scores the selected segments through the similarity measures, and then forms and visually depicts hierarchical organizations of those selected clusters. The system also automatically and dynamically reduces, as necessary, a depth of the hierarchical organization, through elimination of unnecessary hierarchical levels and inter-nodal links, based on similarity measures of segments or segment groups. Attribute/value data that tends to meaningfully characterize each segment is also scored, rank ordered based on normalized scores, and then graphically displayed. The system permits a user to browse through the hierarchy, and, to readily comprehend segment inter-relationships, selectively expand and contract the displayed hierarchy, as desired, as well as to compare two selected segments or segment groups together and graphically display the results of that comparison. An alternative discriminant-based cluster scoring technique is also presented.

    摘要翻译: 一个包含交互式图形用户界面的系统,用于可视化数据的集群(类别)和分段(聚合集群)。 具体来说,系统将传入的病例数据自动分类为群集,将这些群集合成段,确定段的相似性度量,通过相似性度量对所选段进行分类,然后形成并可视地描绘这些群集的层次结构。 基于片段或段组的相似性度量,系统还可以根据需要自动和动态地减少层次组织的深度,通过消除不必要的层级和节点间链接。 倾向于对每个段进行有意义表征的属性/值数据也被划分,基于归一化分数进行排序,然后以图形方式显示。 该系统允许用户浏览层次结构,并且为了容易地理解分段相互关系,根据需要选择性地扩展和收缩所显示的层次结构,以及将两个选定的分段或分段组进行比较,并以图形方式显示 那个比较。 还提出了一种替代的基于判别式的聚类评分技术。