Document text processing using edge detection
    11.
    发明授权
    Document text processing using edge detection 有权
    使用边缘检测记录文本处理

    公开(公告)号:US09569413B2

    公开(公告)日:2017-02-14

    申请号:US13465833

    申请日:2012-05-07

    CPC classification number: G06F17/2241 G06F17/30719

    Abstract: A document is received that has a plurality of lines with text. This document includes text associated with at least one topic of interest and text not associated with the at least one topic of interest. Thereafter, it is determined, for each line in the document, a length of the line and a number of off-topic indicators with the off-topic indicators characterizing portions of the document as likely being not being associated with the at least one topic of interest. Thereafter, a density for each line can be determined based on the determined line length and the determined number of off-topic indicators. The determined densities for each line are used to identify portions of the documents likely associated with the at least one topic of interest so that data characterizing the identified portions of the document can be provided. Related apparatus, systems, techniques and articles are also described.

    Abstract translation: 收到具有多行文本的文档。 本文档包括与至少一个感兴趣的主题相关联的文本和与该至少一个感兴趣的主题不相关联的文本。 此后,对于文档中的每一行,确定具有表示文档部分的偏离主题指示符的线的长度和偏离主题指示符的数量可能不与至少一个主题相关联 利益。 此后,可以基于确定的行长度和确定的脱离主题指示数来确定每行的密度。 用于确定每行的密度用于识别可能与所述至少一个感兴趣的主题相关联的文档的部分,从而可以提供表征文档的所识别的部分的数据。 还描述了相关设备,系统,技术和物品。

    Document Text Processing Using Edge Detection
    12.
    发明申请
    Document Text Processing Using Edge Detection 有权
    使用边缘检测的文档处理

    公开(公告)号:US20130297999A1

    公开(公告)日:2013-11-07

    申请号:US13465833

    申请日:2012-05-07

    CPC classification number: G06F17/2241 G06F17/30719

    Abstract: A document is received that has a plurality of lines with text. This document includes text associated with at least one topic of interest and text not associated with the at least one topic of interest. Thereafter, it is determined, for each line in the document, a length of the line and a number of off-topic indicators with the off-topic indicators characterizing portions of the document as likely being not being associated with the at least one topic of interest. Thereafter, a density for each line can be determined based on the determined line length and the determined number of off-topic indicators. The determined densities for each line are used to identify portions of the documents likely associated with the at least one topic of interest so that data characterizing the identified portions of the document can be provided. Related apparatus, systems, techniques and articles are also described.

    Abstract translation: 收到具有多行文本的文档。 本文档包括与至少一个感兴趣的主题相关联的文本和与该至少一个感兴趣的主题不相关联的文本。 此后,对于文档中的每一行,确定具有表示文档部分的偏离主题指示符的线的长度和偏离主题指示符的数量可能不与至少一个主题相关联 利益。 此后,可以基于确定的行长度和确定的脱离主题指示数来确定每行的密度。 用于确定每行的密度用于识别可能与所述至少一个感兴趣的主题相关联的文档的部分,从而可以提供表征文档的所识别的部分的数据。 还描述了相关设备,系统,技术和物品。

    Entity Matching Using Machine Learning
    13.
    发明申请
    Entity Matching Using Machine Learning 有权
    实体匹配使用机器学习

    公开(公告)号:US20130185306A1

    公开(公告)日:2013-07-18

    申请号:US13350429

    申请日:2012-01-13

    Applicant: Sherif Botros

    Inventor: Sherif Botros

    CPC classification number: G06F17/30522

    Abstract: Techniques for information retrieval include receiving a plurality of data records, each data record including data fields associated with a business enterprise, the data fields including a name of the business enterprise; updating a plurality of database records associated with the received plurality of data records stored in a database, each database record including attributes including the name of the business enterprise and an alias associated with the name of the business enterprise; receiving a query for a particular database record, the query including at least one of the name of the business enterprise or the alias associated with the name of the business enterprise; and preparing for display, in response to the query, one or more of the database records based on at least one of the name of the business enterprise or the alias associated with the name of the business enterprise.

    Abstract translation: 用于信息检索的技术包括接收多个数据记录,每个数据记录包括与商业企业相关联的数据字段,所述数据字段包括商业企业的名称; 更新与存储在数据库中的所接收的多个数据记录相关联的多个数据库记录,每个数据库记录包括包括商业企业的名称和与商业企业的名称相关联的别名的属性; 接收对特定数据库记录的查询,所述查询包括至少一个所述商业企业的名称或与所述商业企业的名称相关联的别名; 以及基于所述商业企业的名称或与所述商业企业的名称相关联的别名中的至少一个来准备显示所述数据库记录中的一个或多个数据库记录。

    Customized Reporting and Mining of Event Data
    14.
    发明申请
    Customized Reporting and Mining of Event Data 有权
    定制报告和挖掘事件数据

    公开(公告)号:US20080172409A1

    公开(公告)日:2008-07-17

    申请号:US11623010

    申请日:2007-01-12

    CPC classification number: G06F17/30991 Y10S707/99943

    Abstract: Event data (e.g., log messages) are represented as sets of attribute/value pairs. An index maps each attribute/value pair or attribute/value tuple to a pointer that points to event data which contains the attribute/value pair or attribute/value tuple. An attribute co-occurrence map or matrix can be generated that includes attribute names that co-occur together. Queries and custom reports can be generated by projecting event data into one or more attributes or attribute/value pairs, and then determining statistics on other attributes using a combination of the inverted index, the attribute co-occurrence map or matrix, operations on sets and/or math and statistical functions.

    Abstract translation: 事件数据(例如,日志消息)被表示为属性/值对的集合。 索引将每个属性/值对或属性/值元组映射到指向包含属性/值对或属性/值元组的事件数据的指针。 可以生成包括共同出现的属性名称的属性共现映射或矩阵。 可以通过将事件数据投影到一个或多个属性或属性/值对中来生成查询和自定义报告,然后使用反向索引,属性共现映射或矩阵的组合来确定其他属性的统计信息,集合上的操作和 /或数学和统计功能。

    Adaptive record linking in a distributed computing system
    15.
    发明授权
    Adaptive record linking in a distributed computing system 有权
    分布式计算系统中的自适应记录链接

    公开(公告)号:US09552393B2

    公开(公告)日:2017-01-24

    申请号:US13350429

    申请日:2012-01-13

    Applicant: Sherif Botros

    Inventor: Sherif Botros

    CPC classification number: G06F17/30522

    Abstract: Techniques for information retrieval include the features of receiving a plurality of data records, updating a plurality of database records associated with the received plurality of data records stored in a database, receiving a query for a particular database record, and preparing for display, in response to the query, one or more of the database records based on at least one of the name of the business enterprise or the alias associated with the name of the business enterprise. Each data record includes data fields associated with a business enterprise. The data fields include a name of the business enterprise. Each database record includes attributes including the name of the business enterprise and an alias associated with the name of the business enterprise. The query includes at least one of the name of the business enterprise or the alias associated with the name of the business enterprise.

    Abstract translation: 用于信息检索的技术包括接收多个数据记录的特征,更新与存储在数据库中的所接收的多个数据记录相关联的多个数据库记录,接收特定数据库记录的查询,以及作为响应的准备显示 基于至少一个商业企业的名称或与商业企业的名称相关联的别名,查询一个或多个数据库记录。 每个数据记录包括与企业有关的数据字段。 数据字段包括商业企业的名称。 每个数据库记录包括属性,包括企业名称和与企业名称相关联的别名。 该查询至少包括一个商业企业的名称或与该企业名称相关联的别名。

    CLASSIFYING DATA USING MACHINE LEARNING
    16.
    发明申请
    CLASSIFYING DATA USING MACHINE LEARNING 有权
    使用机器学习分类数据

    公开(公告)号:US20130304740A1

    公开(公告)日:2013-11-14

    申请号:US13945720

    申请日:2013-07-18

    Applicant: Sherif Botros

    Inventor: Sherif Botros

    CPC classification number: G06F17/30598 G06Q10/00

    Abstract: Techniques for data classification include matching one or more attributes of a commodity with one or more terms of a plurality of terms in a word matrix; generating, based on the matching, a vector for the commodity; and identifying, based on the vector, one or more classification regions that each define a classification of the commodity.

    Abstract translation: 用于数据分类的技术包括将商品的一个或多个属性与单词矩阵中的多个术语中的一个或多个术语进行匹配; 基于匹配产生商品的向量; 以及基于所述向量来识别每个定义所述商品分类的一个或多个分类区域。

    Enterprise Resource Planning System Entity Event Monitoring
    17.
    发明申请
    Enterprise Resource Planning System Entity Event Monitoring 审中-公开
    企业资源规划系统实体事件监控

    公开(公告)号:US20130297361A1

    公开(公告)日:2013-11-07

    申请号:US13465869

    申请日:2012-05-07

    CPC classification number: G06Q10/0631

    Abstract: A company is associated, in an enterprise resource planning system, with a plurality of business entities that each have at least one structured record used by the enterprise resource planning system to characterize the business entity. Thereafter, documents are obtained from a plurality of information sources that characterize events associated with each business entity. It is then determined, using pre-defined business rules, which of the events are pertinent to the company so that enhancement records can be generated for the events determined to be pertinent to the company. These enhancement records characterize the corresponding event and are linked to the structured record for the corresponding business entity. Related apparatus, systems, techniques and articles are also described.

    Abstract translation: 公司在企业资源计划系统中与多个业务实体相关联,每个业务实体至少有一个由企业资源规划系统使用的结构化记录来表征业务实体。 此后,从表示与每个业务实体相关联的事件的多个信息源获得文档。 然后,使用预定义的业务规则确定哪个事件与公司相关,以便可以为确定与公司相关的事件生成增强记录。 这些增强记录表征相应的事件,并链接到相应业务实体的结构化记录。 还描述了相关设备,系统,技术和物品。

    Searching for associated events in log data
    18.
    发明授权
    Searching for associated events in log data 有权
    在日志数据中搜索关联的事件

    公开(公告)号:US08306967B2

    公开(公告)日:2012-11-06

    申请号:US11866337

    申请日:2007-10-02

    CPC classification number: G06F17/30424 G06F17/30637 G06F17/30666

    Abstract: To retrieve a sequence of associated events in log data, a request expression is parsed to retrieve types of dependencies between events which are searched, and the constraints (e.g., keywords) which characterize each event. Based on the parsing results, query components can be formed, expressing the constraints for individual events and interrelations (e.g., time spans) between events. A resultant span query comprising the query components can then be run against an index of events, which encodes a mutual location of associated events in storage.

    Abstract translation: 为了在日志数据中检索关联事件序列,解析请求表达式以检索搜索的事件之间的依赖关系类型以及表征每个事件的约束(例如,关键字)。 基于解析结果,可以形成查询组件,表示事件之间的各个事件和相互关系(例如,时间跨度)的约束。 然后可以针对事件索引来运行包括查询组件的合成跨度查询,该事件索引编码存储器中相关联的事件的相互位置。

    SEARCHING FOR ASSOCIATED EVENTS IN LOG DATA
    19.
    发明申请
    SEARCHING FOR ASSOCIATED EVENTS IN LOG DATA 有权
    在日志数据中搜索相关事件

    公开(公告)号:US20090089252A1

    公开(公告)日:2009-04-02

    申请号:US11866337

    申请日:2007-10-02

    CPC classification number: G06F17/30424 G06F17/30637 G06F17/30666

    Abstract: To retrieve a sequence of associated events in log data, a request expression is parsed to retrieve types of dependencies between events which are searched, and the constraints (e.g., keywords) which characterize each event. Based on the parsing results, query components can be formed, expressing the constraints for individual events and interrelations (e.g., time spans) between events. A resultant span query comprising the query components can then be run against an index of events, which encodes a mutual location of associated events in storage.

    Abstract translation: 为了在日志数据中检索关联事件序列,解析请求表达式以检索搜索的事件之间的依赖关系类型以及表征每个事件的约束(例如,关键字)。 基于解析结果,可以形成查询组件,表示事件之间的各个事件和相互关系(例如,时间跨度)的约束。 然后可以针对事件索引来运行包括查询组件的合成跨度查询,该事件索引编码存储器中相关联的事件的相互位置。

    Decentralized peer-to-peer advertisement
    20.
    发明授权
    Decentralized peer-to-peer advertisement 有权
    分散式对等广告

    公开(公告)号:US07263560B2

    公开(公告)日:2007-08-28

    申请号:US10231544

    申请日:2002-08-30

    Abstract: Embodiments of a shared resource distributed index mechanism that peers in a peer-to-peer network may utilize to distribute index entries corresponding to resources to indexes of shared resources among one or more other peers. These indexes may be used to direct queries to peers where the queries are most likely to be answered. When a query is received by a rendezvous peer including one or more indexes, contents of the query may be “looked up” in the index to find matches. The results of the lookup may include information on one or peer(s) that may hold advertisement(s) to the resource requested by the query. The query may then be forwarded to one or more peers that may hold the advertisement for the resource. Embodiments may provide “loosely-coupled” distribution of index entries for use in querying for resources in the peer-to-peer network.

    Abstract translation: 对等网络中的对等体可以利用的共享资源分布式索引机制的实施例将与资源相对应的索引条目分发到一个或多个其他对等体中的共享资源的索引。 这些索引可用于将查询引导到查询最有可能应答的对等体。 当包含一个或多个索引的会合对等体接收到查询时,查询的内容可能在索引中“查找”以查找匹配项。 查找的结果可以包括可以将广告保存到由查询请求的资源的一个或多个对等体上的信息。 然后可以将查询转发到可以保存资源的广告的一个或多个对等体。 实施例可以提供用于查询对等网络中的资源的索引条目的“松散耦合”分配。

Patent Agency Ranking