DATA CONVERSION APPARATUS, DATA CONVERSION METHOD AND PROGRAM

    公开(公告)号:US20240241886A1

    公开(公告)日:2024-07-18

    申请号:US18561925

    申请日:2021-05-18

    摘要: Provided is a data conversion device 1 that converts log data into structured data, the device including:



    a determination unit 21 configured to determine, based on an appearance frequency of natural or non-natural language characters appearing in a document, whether the log data is first log data written in a natural language or second log data output mechanically output from a device;
    a classification unit 25 configured to generate a classifier for classifying first log data into a category based on several pieces of first log data, as well as a plurality of categories, classify each piece of first log data into one of the plurality of categories using the classifier, and assign a vector obtained by vectorizing the meaning of a word contained in the several pieces of first log data;
    a generation unit 26 configured to replace a plurality of words with a specific word, wherein the plurality of words have a vector similarity not less than a threshold and are regarded as the same word, among a plurality of words contained in the several pieces of first log data, for each category, and to generate log data composed of sentences shared by the several pieces of post-replacement first log data as a category template; and
    a second extraction unit 22 configured to specify, in a case where it is determined that to-be-converted log data is the first log data, a category into which the to-be-converted log data will be classified using the classifier, to extract a unique variable for the to-be-converted log data by comparing the to-be-converted log data and a category template of the specified category, and to output the category template and the unique variable as structured data of the to-be-converted log data.

    Machine learning-based generation of synthesized documents

    公开(公告)号:US12039256B2

    公开(公告)日:2024-07-16

    申请号:US17969862

    申请日:2022-10-20

    摘要: An apparatus comprises a processing device configured to receive a request to generate a synthesized document comprising one or more search terms, and to extract, utilizing a first machine learning model, keywords from a set of documents. The processing device is also configured to select first content for inclusion in a first section of the synthesized document based on a similarity of the search terms and the extracted keywords from corresponding first sections of the set of documents, and to determine, utilizing a second machine learning model that takes as input the selected first content, a set of terms for a second section of the synthesized document. The processing device is further configured to select second content for inclusion in the second section of the synthesized document based on a similarity of the determined set of terms and the extracted keywords from corresponding sections of the set of documents.

    Inline categorizing of events
    55.
    发明授权

    公开(公告)号:US12033008B2

    公开(公告)日:2024-07-09

    申请号:US18363031

    申请日:2023-08-01

    申请人: PagerDuty, Inc.

    摘要: A learner object that incorporates indications of agreements and disagreements with determinations obtained from a clustering engine of adding incoming events to one or more events groups is generated. An event is received based on monitored conditions. A determination is made not to add the event to an events group based on a first similarity score obtained from the clustering engine between the event and the events group not exceeding a threshold value. In response to determining not to add the event to the events group, a determination to add the event to the events group is obtained based on the learner object. In response to the determination obtained based on the learner object, the event is added with to the events group. A user interface configured to visually display and obtain feedback regarding additions of events to the events groups based on determinations of the clustering engine is generated.

    INFORMATION DISPLAY METHOD, DEVICE, COMPUTER APPARATUS AND STORAGE MEDIUM

    公开(公告)号:US20240220084A1

    公开(公告)日:2024-07-04

    申请号:US18523829

    申请日:2023-11-29

    摘要: The present disclosure provides an information display method, device, computer apparatus and storage medium, wherein the method comprises: receiving an access request for book encyclopedia information of a target book; acquiring the book encyclopedia information of the target book, wherein, the book encyclopedia information comprises a plurality of information modules, each information module corresponding to at least one book attribute dimension, and the book encyclopedia information belonging to each book attribute dimension is determined according to user's original innovative information and/or information obtained by automatically identifying book-related content of the target book; acquiring and displaying a book encyclopedia page matching book category of the target book, and displaying the book encyclopedia information of each information module in the book encyclopedia page.

    Confidentiality filter for AI content enhancement

    公开(公告)号:US12026452B1

    公开(公告)日:2024-07-02

    申请号:US18351923

    申请日:2023-07-13

    发明人: Bryan J. Jakovcic

    摘要: A method of enabling a remotely hosted AI to enhance a document while withholding sensitive information from the AI includes identifying sensitive terminology associated with the sensitive information and replacing by an exclusion filter of the sensitive terminology with redaction markers that cannot be interpreted as misspelled words or coined terms, thereby creating a redacted draft that is submitted to the AI. Upon receiving an enhanced, redacted draft from the AI, the original sensitive terminology is restored in place of the redaction markers, and the resulting enhanced document is delivered to a user. Sensitive terms can be local or global. Globally sensitive terms can be stored in databases directed to categories of sensitive information. Sensitive terms can include indicators directed to sensitive numerical quantities and/or other targets. In embodiments, the exclusion filter automatically corrects any grammatical errors arising from replacement of the redaction markers by the original sensitive terminology.