-
公开(公告)号:US20110313994A1
公开(公告)日:2011-12-22
申请号:US12818919
申请日:2010-06-18
申请人: Roy Varshavsky , Kfir Karmon , Daniel Sitton , Limor Lahiani , David Heckerman , Robert Davidson
发明人: Roy Varshavsky , Kfir Karmon , Daniel Sitton , Limor Lahiani , David Heckerman , Robert Davidson
IPC分类号: G06F17/30
CPC分类号: G06F16/9535
摘要: A particular method of content personalization based on user information includes receiving data representing an information retrieval task. The data is received at a server from a computing device associated with a user. The information retrieval task is executed to generate result information. Personalization information associated with the user that is relevant to the information retrieval task is retrieved. The personalization information associated with the user includes information associated with at least one of a genotype of the user and a phenotype of the user. The method includes modifying the result information based on the retrieved personalization information to generate personalized result information. The personalized result information is transmitted to the computing device associated with the user
摘要翻译: 基于用户信息的特定的内容个性化方法包括接收表示信息检索任务的数据。 在与用户相关联的计算设备的服务器处接收数据。 执行信息检索任务以生成结果信息。 检索与用户相关的与信息检索任务相关的个性化信息。 与用户相关联的个性化信息包括与用户的基因型和用户的表型中的至少一个相关联的信息。 该方法包括基于所检索的个性化信息来修改结果信息以生成个性化结果信息。 将个性化结果信息发送到与用户相关联的计算设备
-
公开(公告)号:US20070188901A1
公开(公告)日:2007-08-16
申请号:US11353382
申请日:2006-02-14
IPC分类号: G11B5/02
CPC分类号: G09B19/00
摘要: A unique recording system and method that facilitates recording live meetings, discussions or conversations whereby such recordings are available for immediate or near immediate playback is provided. As a result, a user who has momentarily become distracted or inattentive during the meeting can quickly re-listen to what was missed or misunderstood in order to readily catch up to the current discussion. The current discussion can continue to be recorded during playback of any previously recorded data. User behavior can be monitored to estimate when the user has started to become inattentive and likely segments or time points of the recordings can be suggested for playback. One or more portions of the recordings can be filtered or selected for playback so that any desired content can be eliminated or skipped in the playback version.
摘要翻译: 提供了一种独特的记录系统和方法,其便于记录实时会议,讨论或对话,从而可以立即或即时重放这样的记录。 结果,在会议中暂时变得分心或不注意的用户可以快速重新听取被遗漏或被误解的内容,以便能够赶上当前的讨论。 在播放任何先前记录的数据期间,可以继续记录当前的讨论。 可以监视用户行为,以估计用户何时开始变得不注意,并且可能建议录制的段或时间点进行播放。 可以对记录的一个或多个部分进行过滤或选择以进行重放,以便可以在回放版本中消除或跳过任何期望的内容。
-
公开(公告)号:US20070010966A1
公开(公告)日:2007-01-11
申请号:US11519317
申请日:2006-09-11
申请人: Pyungchul Kim , Zhaohui Tang , David Heckerman , Scott Oveson
发明人: Pyungchul Kim , Zhaohui Tang , David Heckerman , Scott Oveson
CPC分类号: G06F17/18 , G06F16/2465
摘要: Systems and methods are provided for producing displays of the accuracy of data mining or statistical models that produce associative predictions. For all cases in a testing data set, the model makes predictions and provides associated probabilities. The cases are sorted by their probability of making accurate predictions and a graph is made of the accuracy of the model over various subsets containing the highest probability cases as evaluated by the model. Where a number of probabilities are presented for the predictions in a basket of predictions, those probabilities are combined to yield a probability score for the entire basket. Additionally, the accuracy of a model over different basket sizes may be graphed. The accuracy graph may also be produced for any models making a prediction, by graphing the probability of making accurate predictions and a graph made of the accuracy of the model over various subsets of the data containing the highest probability cases.
摘要翻译: 提供系统和方法用于产生数据挖掘的准确性的显示或产生关联预测的统计模型。 对于测试数据集中的所有情况,模型进行预测并提供相关概率。 这些案例按照准确预测的概率进行排序,并且通过模型评估,对包含最高概率案例的各种子集进行模型的精度图。 在对一篮子预测中的预测提出若干概率的情况下,将这些概率组合起来以产生整个篮子的概率得分。 此外,可以绘制不同篮子尺寸的模型的精度。 也可以通过绘制准确预测的概率和通过包含最高概率情况的数据的各种子集对模型的精度进行绘制的图形来产生准确度图。
-
公开(公告)号:US20060160071A1
公开(公告)日:2006-07-20
申请号:US11324634
申请日:2005-12-30
申请人: David Heckerman , Simon Mallal , Carl Kadie , Corey Moore , Nebojsa Jojic
发明人: David Heckerman , Simon Mallal , Carl Kadie , Corey Moore , Nebojsa Jojic
CPC分类号: G06F19/22 , C12Q1/6883 , C12Q2537/165 , G06F19/00 , G06F19/14 , G06F19/18 , G06F19/24 , G16H50/20 , Y02A90/26
摘要: A system comprising a machine learning classifier trained on a plurality of associations between a host and a pathogen to predict a pathogen characteristic is described herein. The pathogen characteristic can relate to a disease state of the host. Computer-executable instructions for performing a method of forecasting a portion of a target molecule anticipated to influence an organism's condition also are described herein. The method comprises employing population data to automatically analyze one or more areas of the target molecule to determine the portion of the target molecule anticipated to influence the organism's condition. The population data can pertain to at least one relationship between at least one diverse organism trait and the target molecule. One or more epitopes forecast by employing the method also are contemplated.
摘要翻译: 这里描述了一种系统,其包括由主机和病原体之间的多个关联训练的机器学习分类器来预测病原体特征。 病原体特征可以与宿主的疾病状态有关。 本文还描述了用于执行预测影响生物体状况的目标分子的一部分的预测方法的计算机可执行指令。 该方法包括使用群体数据来自动分析目标分子的一个或多个区域以确定预期影响生物体状况的目标分子的部分。 人口数据可以涉及至少一种不同的生物性状和目标分子之间的至少一种关系。 也考虑了采用该方法预测的一个或多个表位。
-
公开(公告)号:US20060160070A1
公开(公告)日:2006-07-20
申请号:US11324467
申请日:2005-12-30
申请人: Simon Mallal , David Heckerman , Nebojsa Jojic , Vladimir Jojic , Christopher Meek , Corey Moore , Carl Kadie
发明人: Simon Mallal , David Heckerman , Nebojsa Jojic , Vladimir Jojic , Christopher Meek , Corey Moore , Carl Kadie
摘要: Systems that facilitate immunogen design are described herein. An optimization component is provided to determine an immunogen according to at least one criterion. The immunogen comprises a set of overlapping sequences comprising sequences that are known to be and/or are likely to be immunogenic. At least one of the sequences that are likely to be immunogenic can be determined by analyzing associations between a host and a pathogen at a population level. Methods of determining an epitome are described herein. A plurality of sequences are received. At least one of the sequences is predicted to be an epitope based on a relationship between a diverse trait of a population and a mutation of a pathogen. A collection of the plurality of sequences is optimized according to one or more criteria to determine the epitome. Epitomes and immunogens determined by the systems and methods described herein are also contemplated.
摘要翻译: 本文描述了促进免疫原设计的系统。 提供优化组件以根据至少一个标准确定免疫原。 免疫原包含一组重叠序列,其包含已知是和/或可能是免疫原性的序列。 可能通过在群体水平上分析宿主和病原体之间的关联来确定可能是免疫原性的序列中的至少一个。 本文描述了确定缩影的方法。 接收多个序列。 基于群体的不同性状和病原体的突变之间的关系,至少有一个序列被预测为表位。 根据一个或多个标准来优化多个序列的集合以确定缩写。 还考虑了通过本文所述的系统和方法确定的病原体和免疫原。
-
公开(公告)号:US20060106560A1
公开(公告)日:2006-05-18
申请号:US11299539
申请日:2005-12-12
申请人: Allan Folting , Bo Thiesson , David Heckerman , David Chickering , Eric Vigesaa
发明人: Allan Folting , Bo Thiesson , David Heckerman , David Chickering , Eric Vigesaa
IPC分类号: G06F19/00
CPC分类号: G06F17/30592 , G06N7/00 , Y10S707/957 , Y10S707/958 , Y10S707/99943
摘要: The present invention leverages curve fitting data techniques to provide automatic detection of data anomalies in a “data tube” from a data perspective, allowing, for example, detection of data anomalies such as on-screen, drill down, and drill across data anomalies in, for example, pivot tables and/or OLAP cubes. It determines if data substantially deviates from a predicted value established by a curve fitting process such as, for example, a piece-wise linear function applied to the data tube. A threshold value can also be employed by the present invention to facilitate in determining a degree of deviation necessary before a data value is considered anomalous. The threshold value can be supplied dynamically and/or statically by a system and/or a user via a user interface. Additionally, the present invention provides an indication to a user of the type and location of a detected anomaly from a top level data perspective.
-
公开(公告)号:US20050041027A1
公开(公告)日:2005-02-24
申请号:US10954743
申请日:2004-09-30
申请人: David Chickering , Zhaohui Tang , David Heckerman , Robert Rounthwaite , Alexei Bocharov , Scott Oveson
发明人: David Chickering , Zhaohui Tang , David Heckerman , Robert Rounthwaite , Alexei Bocharov , Scott Oveson
CPC分类号: G06T11/206 , G06F17/30256 , G06F17/30259 , G06F17/30265 , Y10S707/917 , Y10S707/99945
摘要: Distribution displays for categories are provided which illuminate the distribution of continuous attributes over all cases in a category, and which provide a histogram of the population of the different states of categorical attributes. An array of such displays by attribute (in one dimension) and category (in another dimension) may be provided. Category diagram displays are also provided for visualizing the different categories, and their distributions, populations, and similarities. These are displayed through different shading of nodes and edges representing categories and the relationship between two categories, and through proximity of nodes.
摘要翻译: 提供了类别的分布显示,其显示了类别中所有情况下的连续属性的分布,并且提供了分类属性的不同状态的总体的直方图。 可以提供由属性(在一个维度)和类别(在另一维度中)的这种显示器的数组。 还提供类别图显示,用于可视化不同类别及其分布,人口和相似之处。 这些通过不同的节点和边缘的阴影显示,表示类别和两个类别之间的关系,以及通过节点的接近。
-
公开(公告)号:US20120159620A1
公开(公告)日:2012-06-21
申请号:US13159978
申请日:2011-06-14
申请人: Christian Seifert , Jack Stokes , Long Lu , David Heckerman , Christina Colcernian , Sasi Parthasarathy , Navaneethan Santhanam
发明人: Christian Seifert , Jack Stokes , Long Lu , David Heckerman , Christina Colcernian , Sasi Parthasarathy , Navaneethan Santhanam
IPC分类号: G06F21/00
CPC分类号: H04L63/1483 , H04L63/1416 , H04L63/168
摘要: A machine-implemented method for detecting scareware includes the steps of accessing one or more landing pages to be evaluated, extracting one or more features from the landing pages, and providing a classifier to compare the features extracted from the landing pages with features of known scareware and non-scareware pages. The classifier determines a likelihood that the landing page is scareware. If determined to be scareware, the landing page is removed from search results generated by a search engine. The features can be URLs, text, image interest points, image descriptors, a number of pop-ups generated, IP addresses, hostnames, domain names, text derived from images, images, metadata, identifiers of executables, and combinations thereof.
摘要翻译: 用于检测scareware的机器实现的方法包括访问要评估的一个或多个着陆页,从着陆页提取一个或多个特征并提供分类器以将从着陆页提取的特征与已知scareware的特征进行比较的步骤 和非scareware页面。 分类器确定着陆页是scareware的可能性。 如果确定是scareware,则着陆页将从搜索引擎生成的搜索结果中删除。 特征可以是URL,文本,图像兴趣点,图像描述符,生成的弹出窗口的数量,IP地址,主机名,域名,从图像,图像,元数据,可执行文件的标识符以及它们的组合导出的文本。
-
公开(公告)号:US20070112597A1
公开(公告)日:2007-05-17
申请号:US11556069
申请日:2006-11-02
申请人: David Heckerman , Craig Mundie , Nebojsa Jojic , Randy Hinrichs
发明人: David Heckerman , Craig Mundie , Nebojsa Jojic , Randy Hinrichs
CPC分类号: G06Q30/0273 , G06N20/00 , G06Q30/0201 , G06Q30/0203 , G06Q30/0207 , G06Q30/0256 , G06Q30/0277 , G06Q50/22
摘要: The subject matter described herein facilitates monetizing a database of unclean health-related data collected on a large-scale and pertaining to a non-selected population. At least one pattern can be automatically ascertained from the unclean health-related data at least in part by applying a statistical technique, a data mining technique and/or a machine-learning technique to the database. The use of the database can be tracked and fees determined accordingly.
摘要翻译: 本文描述的主题有助于通过大规模收集的和与非选择群体有关的不洁净的健康相关数据的数据库进行货币化。 至少一部分模式可以至少部分地通过对数据库应用统计技术,数据挖掘技术和/或机器学习技术从不洁净的健康相关数据自动确定。 可以跟踪使用数据库,并相应地确定费用。
-
公开(公告)号:US20060112190A1
公开(公告)日:2006-05-25
申请号:US11324960
申请日:2006-01-03
IPC分类号: G06F15/173
CPC分类号: G06F17/30687 , G06F17/30536 , G06N7/005 , G06Q30/00
摘要: A dependency network is created from a training data set utilizing a scalable method. A statistical model (or pattern), such as for example a Bayesian network, is then constructed to allow more convenient inferencing. The model (or pattern) is employed in lieu of the training data set for data access. The computational complexity of the method that produces the model (or pattern) is independent of the size of the original data set. The dependency network directly returns explicitly encoded data in the conditional probability distributions of the dependency network. Non-explicitly encoded data is generated via Gibbs sampling, approximated, or ignored.
摘要翻译: 从使用可伸缩方法的训练数据集创建依赖网络。 然后构建统计模型(或模式),例如贝叶斯网络,以允许更方便的推论。 采用模型(或模式)代替用于数据访问的训练数据集。 产生模型(或模式)的方法的计算复杂度与原始数据集的大小无关。 依赖网络直接在依赖网络的条件概率分布中返回显式编码的数据。 通过Gibbs采样,近似或忽略来生成非显式编码数据。
-
-
-
-
-
-
-
-
-