AUTOMATED ASSISTANCE FOR GENERATING RELEVANT AND VALUABLE SEARCH RESULTS FOR AN ENTITY OF INTEREST

    公开(公告)号:US20220138272A1

    公开(公告)日:2022-05-05

    申请号:US17564056

    申请日:2021-12-28

    Abstract: Systems and methods are provided for identifying relevant information for an entity, referred to as a seed entity. A plurality of search queries can be generated each comprising a property of a seed entity or one of the entities associated with the seed entity (seed-linked entities). Preferably, a collection of search queries includes ones representing different properties of the seed entity and properties of different seed-linked entities. Optionally, the collection of search queries is optimized to reduce search burden. Searches can then be conducted with the search queries in one or more data sources to obtain a plurality of search results, wherein each search result comprises a hit entity and one or more entities associated with the hit entity (hit-linked entity). For each of the search results, a score can be determined taking as input (a) likelihood of match between the seed entity and the hit entity or between a seed-linked entity and a hit-linked entity, (b) presence of a new entity in the search result not present in the search queries or a difference between the new entity and an entity present in the search queries, and (c) characteristic of the new entity in the search result. Based on the scores, high priority search results can be presented a user for further analysis.

    Systems and methods for validating data

    公开(公告)号:US11221898B2

    公开(公告)日:2022-01-11

    申请号:US16675056

    申请日:2019-11-05

    Abstract: Systems and methods are validating data in a data set. A data set including data to validate and a validator to use in validating the data is selected based on user input generated based on interactions of a user with a graphical user interface. The validator is applied to the data to determine whether one or more statistics generated through application of the validator to the data is valid or invalid based on a validation routine associated with the validator. A data quality report indicating whether the data set is valid or invalid, based on a determination of whether the one or more statistics is valid or invalid, is generated and selectively presented to the user through the graphical user interface.

    SYSTEMS AND METHODS FOR VALIDATING DATA
    13.
    发明申请

    公开(公告)号:US20200073743A1

    公开(公告)日:2020-03-05

    申请号:US16675056

    申请日:2019-11-05

    Abstract: Systems and methods are validating data in a data set. A data set including data to validate and a validator to use in validating the data is selected based on user input generated based on interactions of a user with a graphical user interface. The validator is applied to the data to determine whether one or more statistics generated through application of the validator to the data is valid or invalid based on a validation routine associated with the validator. A data quality report indicating whether the data set is valid or invalid, based on a determination of whether the one or more statistics is valid or invalid, is generated and selectively presented to the user through the graphical user interface.

    FEATURE CLUSTERING OF USERS, USER CORRELATION DATABASE ACCESS, AND USER INTERFACE GENERATION SYSTEM
    14.
    发明申请
    FEATURE CLUSTERING OF USERS, USER CORRELATION DATABASE ACCESS, AND USER INTERFACE GENERATION SYSTEM 审中-公开
    用户特征集,用户关联数据库访问和用户界面生成系统

    公开(公告)号:US20170060930A1

    公开(公告)日:2017-03-02

    申请号:US15239585

    申请日:2016-08-17

    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for a feature clustering of users, user correlation database access, and user interface generation system. The system can obtain information stored in different databases located across geographic regions, and determine unique users from the different information. The information can be included in unique records in the databases, with each record describing a particular user, and with each user described with imperfect identifying information. The system can analyze the different information utilizing machine learning models, and can associate each record with a particular unique user. The system can obtain identifications of items associated with each user, and determine the propensity of the user to disassociate with one or more items, or determine likelihoods of future association with different items not presently associated with the user.

    Abstract translation: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于用户的特征聚类,用户关联数据库访问和用户界面生成系统。 系统可以获取存储在跨地理区域的不同数据库中的信息,并根据不同信息确定唯一用户。 信息可以包含在数据库中的唯一记录中,每个记录描述一个特定用户,并且每个用户都用不完备的识别信息进行描述。 该系统可以利用机器学习模型分析不同的信息,并且可以将每个记录与特定的唯一用户相关联。 该系统可以获得与每个用户相关联的项目的标识,并且确定用户与一个或多个项目分离的倾向,或者确定将来与当前未与用户相关联的不同项目的关联的可能性。

    Crime risk forecasting
    15.
    发明授权
    Crime risk forecasting 有权
    犯罪风险预测

    公开(公告)号:US09129219B1

    公开(公告)日:2015-09-08

    申请号:US14319161

    申请日:2014-06-30

    Abstract: A computer-based crime risk forecasting system and corresponding method are provided for generating crime risk forecasts and conveying the forecasts to a user. With the conveyed forecasts, the user can more effectively gauge both the level of increased crime threat and its potential duration. The user can then leverage the information conveyed by the forecasts to take a more proactive approach to law enforcement in the affected areas during the period of increased crime threat.

    Abstract translation: 提供了基于计算机的犯罪风险预测系统和相应的方法,用于生成犯罪风险预测并向用户传达预测。 随着传达的预测,用户可以更有效地衡量犯罪威胁增加的程度及其潜在的持续时间。 然后,用户可以利用预测传达的信息,在犯罪威胁增加期间,对受影响地区的执法采取更积极的态度。

    SYSTEMS AND METHODS FOR SELECTING MACHINE LEARNING TRAINING DATA

    公开(公告)号:US20230008175A1

    公开(公告)日:2023-01-12

    申请号:US17930046

    申请日:2022-09-06

    Abstract: Systems and methods are provided for selecting training examples to increase the efficiency of supervised active machine learning processes. Training examples for presentation to a user may be selected according to measure of the model's uncertainty in labeling the examples. A number of training examples may be selected to increase efficiency between the user and the processing system by selecting the number of training examples to minimize user downtime in the machine learning process.

Patent Agency Ranking