专利检索 ap:"L Venkata Subramaniam" 第 1 页

1.

发明授权
In-querying data cleansing with semantic standardization 有权

公开(公告)号：US10120916B2

公开(公告)日：2018-11-06

申请号：US13493945

申请日：2012-06-11

申请人： Tanveer A. Faruquie , Mukesh K. Mohania , L. Venkata Subramaniam , Charles D. Wolfson

发明人： Tanveer A. Faruquie , Mukesh K. Mohania , L. Venkata Subramaniam , Charles D. Wolfson

IPC分类号： G06F17/30

摘要： The present invention relates to data cleansing, and in particular performing the semantic standardization process within a database before the transform portion of the extract-transform-load (ETL) process. Provided are a method, system and computer program product for standardizing data within a database engine, configuring the standardization function to determine at least one standardized value for at least one data value by applying the standardization table in a context of at least one data value, receiving a database query identifying the standardization function, at least one database value and the context of the data, and invoking the standardization function.

2.

发明授权
Efficient development of a rule-based system using crowd-sourcing 有权
标题翻译：使用群众采购高效地开发基于规则的系统

公开(公告)号：US08949204B2

公开(公告)日：2015-02-03

申请号：US13597589

申请日：2012-08-29

申请人： Snigdha Chaturvedi , Tanveer Afzal Faruquie , L. Venkata Subramaniam

发明人： Snigdha Chaturvedi , Tanveer Afzal Faruquie , L. Venkata Subramaniam

IPC分类号： G06F17/00 , G06F17/30

CPC分类号： G06F17/00 , G06F17/30 , G06F17/30303

摘要： Described herein are methods, systems, apparatuses and products for efficient development of a rule-based system. An aspect provides a method including accessing data records; converting said data records to an intermediate form; utilizing intermediate forms to compute similarity scores for said data records; and selecting as an example to be provided for rule making at least one record of said data records having a maximum dissimilarity score indicative of dissimilarity to already considered examples.

摘要翻译： 这里描述了用于有效开发基于规则的系统的方法，系统，设备和产品。一方面提供了一种包括访问数据记录的方法; 将所述数据记录转换成中间形式; 利用中间形式来计算所述数据记录的相似度分数; 并且选择为规则提供用于规则制作所述数据记录的至少一个记录，其具有指示与已经考虑的示例的不相似性的最大不相似性分数。

3.

发明授权
Technique for searching for keywords determining event occurrence 失效
标题翻译：搜索确定事件发生的关键技术

公开(公告)号：US08005829B2

公开(公告)日：2011-08-23

申请号：US12044378

申请日：2008-03-07

申请人： Tetsuya Nasukawa , Shourya Roy , L. Venkata Subramaniam , Hironori Takeuchi

发明人： Tetsuya Nasukawa , Shourya Roy , L. Venkata Subramaniam , Hironori Takeuchi

IPC分类号： G06F7/00 , G06F17/30 , G06Q30/00

CPC分类号： G06F17/30985 , G06Q30/0256

摘要： A keyword search system including a text input unit for inputting subtexts obtained by dividing each text into parts, while associating the subtexts with an event through a process recorded in the text; a prediction device adjuster for adjusting a corresponding event prediction device to maximize the percentage of text in which the inputted event is identical to a prediction result in a first text group selected from the subtexts; a prediction processor for generating a prediction result for each section, by inputting each text in a second text group selected from the corresponding subtexts in the adjusted event prediction device; and a search unit for calculating the prediction precision for the second text group of the event prediction device using a comparison between the inputted event and the prediction result for each subtext, and searching for keywords in sections with a certain degree of prediction precision.

摘要翻译： 一种关键词搜索系统，包括：文本输入单元，用于输入通过将每个文本分割成多个部分而获得的子文本，同时通过文本中记录的处理将文本与事件相关联; 预测装置调整器，用于调整相应的事件预测装置，以使输入的事件与从所述子文件中选择的第一文本组中的预测结果相同的文本的百分比最大化; 一个预测处理器，用于通过在从被调整的事件预测装置中的相应子文件中选出的第二文本组中输入每个文本来产生每个部分的预测结果; 以及搜索单元，用于使用所输入的事件和每个子文本的预测结果之间的比较来计算事件预测装置的第二文本组的预测精度，并且以一定程度的预测精度搜索关键字。

4.

发明申请
RESOURCES MANAGEMENT IN DISTRIBUTED COMPUTING ENVIRONMENT 有权
标题翻译：分布式计算环境中的资源管理

公开(公告)号：US20110191781A1

公开(公告)日：2011-08-04

申请号：US12697228

申请日：2010-01-30

申请人： Hima P. Karanam , Tanveer A. Faruquie , L. Venkata Subramaniam , Mukesh K. Mohania , Girish Venkatachaliah

发明人： Hima P. Karanam , Tanveer A. Faruquie , L. Venkata Subramaniam , Mukesh K. Mohania , Girish Venkatachaliah

IPC分类号： G06F9/50

CPC分类号： G06F9/50

摘要： A method, system and a computer program product for determining resources allocation in a distributed computing environment. An embodiment may include identifying resources in a distributed computing environment, computing provisioning parameters, computing configuration parameters and quantifying service parameters in response to a set of service level agreements (SLA). The embodiment may further include iteratively computing a completion time required for completion of the assigned task and a cost. Embodiments may further include computing an optimal resources configuration and computing at least one of an optimal completion time and an optimal cost corresponding to the optimal resources configuration. Embodiments may further include dynamically modifying the optimal resources configuration in response to at least one change in at least one of provisioning parameters, computing parameters and quantifying service parameters.

摘要翻译： 一种用于在分布式计算环境中确定资源分配的方法，系统和计算机程序产品。一个实施例可以包括在分布式计算环境中识别资源，计算供应参数，计算配置参数和响应一组服务水平协议（SLA）量化服务参数。该实施例还可以包括迭代地计算完成分配的任务所需的完成时间和成本。实施例还可以包括计算最佳资源配置并计算与最佳资源配置相对应的最佳完成时间和最优成本中的至少一个。实施例还可以包括响应于供应参数，计算参数和量化服务参数中的至少一个的至少一个变化来动态地修改最佳资源配置。

5.

发明授权
Method for segmenting communication transcripts using unsupervised and semi-supervised techniques 有权
标题翻译：使用无监督和半监督技术分割沟通成绩单的方法

公开(公告)号：US07912714B2

公开(公告)日：2011-03-22

申请号：US12060469

申请日：2008-04-01

申请人： Krishna Kummamuru , Deepak S. Padmanaban , Shourya Roy , L. Venkata Subramaniam

发明人： Krishna Kummamuru , Deepak S. Padmanaban , Shourya Roy , L. Venkata Subramaniam

IPC分类号： G10L15/06

CPC分类号： G06F17/3071 , G10L15/04

摘要： A method is provided for forming discrete segment clusters of one or more sequential sentences from a corpus of communication transcripts of transactional communications that comprises dividing the communication transcripts of the corpus into a first set of sentences spoken by a caller and a second set of sentences spoken by a responder; generating a set of sentence clusters by grouping the first and second sets of sentences according to a measure of lexical similarity using an unsupervised partitional clustering method; generating a collection of sequences of sentence types by assigning a distinct sentence type to each sentence cluster and representing each sentence of each communication transcript of the corpus with the sentence type assigned to the sentence cluster into which the sentence is grouped; and generating a specified number of discrete segment clusters by successively merging sentence clusters according to a proximity-based measure between the sentence types assigned to the sentence clusters within sequences of the collection.

摘要翻译： 提供了一种用于从事务通信的通信转录语料库形成一个或多个顺序句子的离散段聚类的方法，其包括将语料库的通信记录分成由呼叫者说出的第一组句子和第二组句子由答复者通过使用无监督分数聚类方法，根据词汇相似度的度量，对第一和第二组句子进行分组，从而生成一组句子群; 通过为每个句子集分配不同的句子类型并以分配给句子分组的句子集合的句子类型表示语料库的每个通信录音的每个句子来生成句子序列的集合; 以及通过根据在集合的序列内分配给句子集群的句子类型之间的基于邻近度的度量连续地合并语句集群来生成指定数量的离散分段集群。

6.

发明授权
Method and apparatus for determining decision points for streaming conversational data 有权
标题翻译：用于确定流对话数据的决策点的方法和装置

公开(公告)号：US07904399B2

公开(公告)日：2011-03-08

申请号：US11940551

申请日：2007-11-15

申请人： L. Venkata Subramaniam , Ganesh Ramakrishnan , Tanveer A Faruquie

发明人： L. Venkata Subramaniam , Ganesh Ramakrishnan , Tanveer A Faruquie

IPC分类号： G06E1/00

CPC分类号： G06F17/279 , G10L15/1822

摘要： A method for determining a decision point in real-time for a data stream from a conversation includes receiving streaming conversational data; and determining when to classify the streaming conversational data, using a measure of certainty, by performing certainty calculations at a plurality of time instances during the conversation and by selecting a decision point in response to the certainty calculations, the decision point not being based on a fixed window of conversational data but being based on accumulated conversational data available at different ones of the plurality of time instances. Systems and computer program products are also provided.

摘要翻译： 一种用于从对话确定数据流的实时决策点的方法包括接收流对话数据; 以及通过在对话期间在多个时间实例进行确定性计算以及响应于所述确定性计算选择一个决定点来确定何时对所述流对话数据进行分类，使用确定性的度量，所述决策点不是基于固定的对话数据窗口，但是基于在多个时间实例中的不同时间可用的累积会话数据。还提供系统和计算机程序产品。

7.

发明申请
Method and Apparatus for Determining Decision Points for Streaming Conversational Data 有权
标题翻译：用于确定流对话数据的决策点的方法和装置

公开(公告)号：US20090132442A1

公开(公告)日：2009-05-21

申请号：US11940551

申请日：2007-11-15

申请人： L. Venkata Subramaniam , Ganesh Ramakrishnan , Tanveer A. Faruquie

发明人： L. Venkata Subramaniam , Ganesh Ramakrishnan , Tanveer A. Faruquie

IPC分类号： G06F15/18

CPC分类号： G06F17/279 , G10L15/1822

摘要： A method for determining a decision point in real-time for a data stream from a conversation includes receiving streaming conversational data; and determining when to classify the streaming conversational data, using a measure of certainty, by performing certainty calculations at a plurality of time instances during the conversation and by selecting a decision point in response to the certainty calculations, the decision point not being based on a fixed window of conversational data but being based on accumulated conversational data available at different ones of the plurality of time instances. Systems and computer program products are also provided.

摘要翻译： 一种用于从对话确定数据流的实时决策点的方法包括接收流对话数据; 以及通过在对话期间在多个时间实例进行确定性计算以及响应于所述确定性计算选择一个决定点来确定何时对所述流对话数据进行分类，使用确定性的度量，所述决策点不是基于固定的对话数据窗口，但是基于在多个时间实例中的不同时间可用的累积会话数据。还提供系统和计算机程序产品。

8.

发明申请
TECHNIQUE FOR SEARCHING FOR KEYWORDS DETERMINING EVENT OCCURRENCE 失效
标题翻译：搜索关键词确定事件发生的技巧

公开(公告)号：US20080256063A1

公开(公告)日：2008-10-16

申请号：US12044378

申请日：2008-03-07

申请人： Tetsuya Nasukawa , Shourya Roy , L. Venkata Subramaniam , Hironori Takeuchi

发明人： Tetsuya Nasukawa , Shourya Roy , L. Venkata Subramaniam , Hironori Takeuchi

IPC分类号： G06F17/30

CPC分类号： G06F17/30985 , G06Q30/0256

摘要： A keyword search system including a text input unit for inputting subtexts obtained by dividing each text into parts, while associating the subtexts with an event through a process recorded in the text; a prediction device adjuster for adjusting a corresponding event prediction device to maximize the percentage of text in which the inputted event is identical to a prediction result in a first text group selected from the subtexts; a prediction processor for generating a prediction result for each section, by inputting each text in a second text group selected from the corresponding subtexts in the adjusted event prediction device; and a search unit for calculating the prediction precision for the second text group of the event prediction device using a comparison between the inputted event and the prediction result for each subtext, and searching for keywords in sections with a certain degree of prediction precision.

摘要翻译： 一种关键词搜索系统，包括：文本输入单元，用于输入通过将每个文本分割成多个部分而获得的子文本，同时通过文本中记录的处理将文本与事件相关联; 预测装置调整器，用于调整相应的事件预测装置，以使输入的事件与从所述子文件中选择的第一文本组中的预测结果相同的文本的百分比最大化; 一个预测处理器，用于通过在从被调整的事件预测装置中的相应子文件中选出的第二文本组中输入每个文本来产生每个部分的预测结果; 以及搜索单元，用于使用所输入的事件和每个子文本的预测结果之间的比较来计算事件预测装置的第二文本组的预测精度，并且以一定程度的预测精度搜索关键字。

9.

发明授权
Cleansing a database system to improve data quality 有权
标题翻译：清理数据库系统以提高数据质量

公开(公告)号：US09104709B2

公开(公告)日：2015-08-11

申请号：US13422280

申请日：2012-03-16

申请人： Snigdha Chaturvedi , Tanveer A Faruquie , Hima P Karanam , Mukesh K Mohania , L Venkata Subramaniam

发明人： Snigdha Chaturvedi , Tanveer A Faruquie , Hima P Karanam , Mukesh K Mohania , L Venkata Subramaniam

IPC分类号： G06F7/00 , G06F17/00 , G06F17/30

CPC分类号： G06F17/30306 , G06F17/30303 , G06F17/30536

摘要： According to one embodiment of the present invention, a system controls cleansing of data within a database system, and comprises a computer system including at least one processor. The system receives a data set from the database system, and one or more features of the data set are selected for determining values for one or more characteristics of the selected features. The determined values are applied to a data quality estimation model to determine data quality estimates for the data set. Problematic data within the data set are identified based on the data quality estimates, where the cleansing is adjusted to accommodate the identified problematic data. Embodiments of the present invention further include a method and computer program product for controlling cleansing of data within a database system in substantially the same manner described above.

摘要翻译： 根据本发明的一个实施例，系统控制数据库系统内的数据清理，并且包括包括至少一个处理器的计算机系统。系统从数据库系统接收数据集，并且选择数据集的一个或多个特征以确定所选特征的一个或多个特征的值。将确定的值应用于数据质量估计模型以确定数据集的数据质量估计。基于数据质量估计来识别数据集中的有问题的数据，其中调整清洁以适应所识别的有问题的数据。本发明的实施例还包括一种方法和计算机程序产品，用于以与上述基本相同的方式控制数据库系统内的数据清洗。

10.

发明授权
Automatically mining patterns for rule based data standardization systems 有权
标题翻译：自动挖掘基于规则的数据标准化系统的模式

公开(公告)号：US08996524B2

公开(公告)日：2015-03-31

申请号：US13415144

申请日：2012-03-08

申请人： Snigdha Chaturvedi , Tanveer A Faruquie , Hima P. Karanam , Marvin Mendelssohn , Mukesh K. Mohania , L. Venkata Subramaniam

发明人： Snigdha Chaturvedi , Tanveer A Faruquie , Hima P. Karanam , Marvin Mendelssohn , Mukesh K. Mohania , L. Venkata Subramaniam

IPC分类号： G06F7/00 , G06F17/30

CPC分类号： G06F17/30705 , G06F17/2775 , G06F17/30675 , G06F2216/03 , G06Q10/06 , G06Q10/10 , G06Q30/02

摘要： Methods, computer program products and systems are provided for mining for sub-patterns within a text data set. The embodiments facilitate finding a set of N frequently occurring sub-patterns within the data set, extracting the N sub-patterns from the data set, and clustering the extracted sub-patterns into K groups, where each extracted sub-pattern is placed within the same group with other extracted sub-patterns based upon a distance value D that determines a degree of similarity between the sub-pattern and every other sub-pattern within the same group.

摘要翻译： 提供方法，计算机程序产品和系统用于挖掘文本数据集中的子模式。这些实施例有助于找到数据集内的N个经常出现的子模式的集合，从数据集中提取N个子模式，并将所提取的子模式聚类成K个组，其中每个提取的子模式被放置在基于距离值D的与其他提取的子模式相同的组，其确定子模式和同一组内的每个其他子模式之间的相似度。

搜索结果

国家/区域

专利有效性

申请日

公布(公告)日

申请人

申请人所在国/区域

发明人

IPC

IPC部

IPC大类

IPC小类

IPC大组

IPC小组

外观分类