Efficient development of a rule-based system using crowd-sourcing
    2.
    发明授权
    Efficient development of a rule-based system using crowd-sourcing 有权
    使用群众采购高效地开发基于规则的系统

    公开(公告)号:US08949204B2

    公开(公告)日:2015-02-03

    申请号:US13597589

    申请日:2012-08-29

    IPC分类号: G06F17/00 G06F17/30

    摘要: Described herein are methods, systems, apparatuses and products for efficient development of a rule-based system. An aspect provides a method including accessing data records; converting said data records to an intermediate form; utilizing intermediate forms to compute similarity scores for said data records; and selecting as an example to be provided for rule making at least one record of said data records having a maximum dissimilarity score indicative of dissimilarity to already considered examples.

    摘要翻译: 这里描述了用于有效开发基于规则的系统的方法,系统,设备和产品。 一方面提供了一种包括访问数据记录的方法; 将所述数据记录转换成中间形式; 利用中间形式来计算所述数据记录的相似度分数; 并且选择为规则提供用于规则制作所述数据记录的至少一个记录,其具有指示与已经考虑的示例的不相似性的最大不相似性分数。

    Technique for searching for keywords determining event occurrence
    3.
    发明授权
    Technique for searching for keywords determining event occurrence 失效
    搜索确定事件发生的关键技术

    公开(公告)号:US08005829B2

    公开(公告)日:2011-08-23

    申请号:US12044378

    申请日:2008-03-07

    IPC分类号: G06F7/00 G06F17/30 G06Q30/00

    CPC分类号: G06F17/30985 G06Q30/0256

    摘要: A keyword search system including a text input unit for inputting subtexts obtained by dividing each text into parts, while associating the subtexts with an event through a process recorded in the text; a prediction device adjuster for adjusting a corresponding event prediction device to maximize the percentage of text in which the inputted event is identical to a prediction result in a first text group selected from the subtexts; a prediction processor for generating a prediction result for each section, by inputting each text in a second text group selected from the corresponding subtexts in the adjusted event prediction device; and a search unit for calculating the prediction precision for the second text group of the event prediction device using a comparison between the inputted event and the prediction result for each subtext, and searching for keywords in sections with a certain degree of prediction precision.

    摘要翻译: 一种关键词搜索系统,包括:文本输入单元,用于输入通过将每个文本分割成多个部分而获得的子文本,同时通过文本中记录的处理将文本与事件相关联; 预测装置调整器,用于调整相应的事件预测装置,以使输入的事件与从所述子文件中选择的第一文本组中的预测结果相同的文本的百分比最大化; 一个预测处理器,用于通过在从被调整的事件预测装置中的相应子文件中选出的第二文本组中输入每个文本来产生每个部分的预测结果; 以及搜索单元,用于使用所输入的事件和每个子文本的预测结果之间的比较来计算事件预测装置的第二文本组的预测精度,并且以一定程度的预测精度搜索关键字。

    RESOURCES MANAGEMENT IN DISTRIBUTED COMPUTING ENVIRONMENT
    4.
    发明申请
    RESOURCES MANAGEMENT IN DISTRIBUTED COMPUTING ENVIRONMENT 有权
    分布式计算环境中的资源管理

    公开(公告)号:US20110191781A1

    公开(公告)日:2011-08-04

    申请号:US12697228

    申请日:2010-01-30

    IPC分类号: G06F9/50

    CPC分类号: G06F9/50

    摘要: A method, system and a computer program product for determining resources allocation in a distributed computing environment. An embodiment may include identifying resources in a distributed computing environment, computing provisioning parameters, computing configuration parameters and quantifying service parameters in response to a set of service level agreements (SLA). The embodiment may further include iteratively computing a completion time required for completion of the assigned task and a cost. Embodiments may further include computing an optimal resources configuration and computing at least one of an optimal completion time and an optimal cost corresponding to the optimal resources configuration. Embodiments may further include dynamically modifying the optimal resources configuration in response to at least one change in at least one of provisioning parameters, computing parameters and quantifying service parameters.

    摘要翻译: 一种用于在分布式计算环境中确定资源分配的方法,系统和计算机程序产品。 一个实施例可以包括在分布式计算环境中识别资源,计算供应参数,计算配置参数和响应一组服务水平协议(SLA)量化服务参数。 该实施例还可以包括迭代地计算完成分配的任务所需的完成时间和成本。 实施例还可以包括计算最佳资源配置并计算与最佳资源配置相对应的最佳完成时间和最优成本中的至少一个。 实施例还可以包括响应于供应参数,计算参数和量化服务参数中的至少一个的至少一个变化来动态地修改最佳资源配置。

    Method for segmenting communication transcripts using unsupervised and semi-supervised techniques
    5.
    发明授权
    Method for segmenting communication transcripts using unsupervised and semi-supervised techniques 有权
    使用无监督和半监督技术分割沟通成绩单的方法

    公开(公告)号:US07912714B2

    公开(公告)日:2011-03-22

    申请号:US12060469

    申请日:2008-04-01

    IPC分类号: G10L15/06

    CPC分类号: G06F17/3071 G10L15/04

    摘要: A method is provided for forming discrete segment clusters of one or more sequential sentences from a corpus of communication transcripts of transactional communications that comprises dividing the communication transcripts of the corpus into a first set of sentences spoken by a caller and a second set of sentences spoken by a responder; generating a set of sentence clusters by grouping the first and second sets of sentences according to a measure of lexical similarity using an unsupervised partitional clustering method; generating a collection of sequences of sentence types by assigning a distinct sentence type to each sentence cluster and representing each sentence of each communication transcript of the corpus with the sentence type assigned to the sentence cluster into which the sentence is grouped; and generating a specified number of discrete segment clusters by successively merging sentence clusters according to a proximity-based measure between the sentence types assigned to the sentence clusters within sequences of the collection.

    摘要翻译: 提供了一种用于从事务通信的通信转录语料库形成一个或多个顺序句子的离散段聚类的方法,其包括将语料库的通信记录分成由呼叫者说出的第一组句子和第二组句子 由答复者 通过使用无监督分数聚类方法,根据词汇相似度的度量,对第一和第二组句子进行分组,从而生成一组句子群; 通过为每个句子集分配不同的句子类型并以分配给句子分组的句子集合的句子类型表示语料库的每个通信录音的每个句子来生成句子序列的集合; 以及通过根据在集合的序列内分配给句子集群的句子类型之间的基于邻近度的度量连续地合并语句集群来生成指定数量的离散分段集群。

    Method and apparatus for determining decision points for streaming conversational data
    6.
    发明授权
    Method and apparatus for determining decision points for streaming conversational data 有权
    用于确定流对话数据的决策点的方法和装置

    公开(公告)号:US07904399B2

    公开(公告)日:2011-03-08

    申请号:US11940551

    申请日:2007-11-15

    IPC分类号: G06E1/00

    CPC分类号: G06F17/279 G10L15/1822

    摘要: A method for determining a decision point in real-time for a data stream from a conversation includes receiving streaming conversational data; and determining when to classify the streaming conversational data, using a measure of certainty, by performing certainty calculations at a plurality of time instances during the conversation and by selecting a decision point in response to the certainty calculations, the decision point not being based on a fixed window of conversational data but being based on accumulated conversational data available at different ones of the plurality of time instances. Systems and computer program products are also provided.

    摘要翻译: 一种用于从对话确定数据流的实时决策点的方法包括接收流对话数据; 以及通过在对话期间在多个时间实例进行确定性计算以及响应于所述确定性计算选择一个决定点来确定何时对所述流对话数据进行分类,使用确定性的度量,所述决策点不是基于 固定的对话数据窗口,但是基于在多个时间实例中的不同时间可用的累积会话数据。 还提供系统和计算机程序产品。

    Method and Apparatus for Determining Decision Points for Streaming Conversational Data
    7.
    发明申请
    Method and Apparatus for Determining Decision Points for Streaming Conversational Data 有权
    用于确定流对话数据的决策点的方法和装置

    公开(公告)号:US20090132442A1

    公开(公告)日:2009-05-21

    申请号:US11940551

    申请日:2007-11-15

    IPC分类号: G06F15/18

    CPC分类号: G06F17/279 G10L15/1822

    摘要: A method for determining a decision point in real-time for a data stream from a conversation includes receiving streaming conversational data; and determining when to classify the streaming conversational data, using a measure of certainty, by performing certainty calculations at a plurality of time instances during the conversation and by selecting a decision point in response to the certainty calculations, the decision point not being based on a fixed window of conversational data but being based on accumulated conversational data available at different ones of the plurality of time instances. Systems and computer program products are also provided.

    摘要翻译: 一种用于从对话确定数据流的实时决策点的方法包括接收流对话数据; 以及通过在对话期间在多个时间实例进行确定性计算以及响应于所述确定性计算选择一个决定点来确定何时对所述流对话数据进行分类,使用确定性的度量,所述决策点不是基于 固定的对话数据窗口,但是基于在多个时间实例中的不同时间可用的累积会话数据。 还提供系统和计算机程序产品。

    TECHNIQUE FOR SEARCHING FOR KEYWORDS DETERMINING EVENT OCCURRENCE
    8.
    发明申请
    TECHNIQUE FOR SEARCHING FOR KEYWORDS DETERMINING EVENT OCCURRENCE 失效
    搜索关键词确定事件发生的技巧

    公开(公告)号:US20080256063A1

    公开(公告)日:2008-10-16

    申请号:US12044378

    申请日:2008-03-07

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30985 G06Q30/0256

    摘要: A keyword search system including a text input unit for inputting subtexts obtained by dividing each text into parts, while associating the subtexts with an event through a process recorded in the text; a prediction device adjuster for adjusting a corresponding event prediction device to maximize the percentage of text in which the inputted event is identical to a prediction result in a first text group selected from the subtexts; a prediction processor for generating a prediction result for each section, by inputting each text in a second text group selected from the corresponding subtexts in the adjusted event prediction device; and a search unit for calculating the prediction precision for the second text group of the event prediction device using a comparison between the inputted event and the prediction result for each subtext, and searching for keywords in sections with a certain degree of prediction precision.

    摘要翻译: 一种关键词搜索系统,包括:文本输入单元,用于输入通过将每个文本分割成多个部分而获得的子文本,同时通过文本中记录的处理将文本与事件相关联; 预测装置调整器,用于调整相应的事件预测装置,以使输入的事件与从所述子文件中选择的第一文本组中的预测结果相同的文本的百分比最大化; 一个预测处理器,用于通过在从被调整的事件预测装置中的相应子文件中选出的第二文本组中输入每个文本来产生每个部分的预测结果; 以及搜索单元,用于使用所输入的事件和每个子文本的预测结果之间的比较来计算事件预测装置的第二文本组的预测精度,并且以一定程度的预测精度搜索关键字。

    Cleansing a database system to improve data quality
    9.
    发明授权
    Cleansing a database system to improve data quality 有权
    清理数据库系统以提高数据质量

    公开(公告)号:US09104709B2

    公开(公告)日:2015-08-11

    申请号:US13422280

    申请日:2012-03-16

    IPC分类号: G06F7/00 G06F17/00 G06F17/30

    摘要: According to one embodiment of the present invention, a system controls cleansing of data within a database system, and comprises a computer system including at least one processor. The system receives a data set from the database system, and one or more features of the data set are selected for determining values for one or more characteristics of the selected features. The determined values are applied to a data quality estimation model to determine data quality estimates for the data set. Problematic data within the data set are identified based on the data quality estimates, where the cleansing is adjusted to accommodate the identified problematic data. Embodiments of the present invention further include a method and computer program product for controlling cleansing of data within a database system in substantially the same manner described above.

    摘要翻译: 根据本发明的一个实施例,系统控制数据库系统内的数据清理,并且包括包括至少一个处理器的计算机系统。 系统从数据库系统接收数据集,并且选择数据集的一个或多个特征以确定所选特征的一个或多个特征的值。 将确定的值应用于数据质量估计模型以确定数据集的数据质量估计。 基于数据质量估计来识别数据集中的有问题的数据,其中调整清洁以适应所识别的有问题的数据。 本发明的实施例还包括一种方法和计算机程序产品,用于以与上述基本相同的方式控制数据库系统内的数据清洗。