INFORMATION PROCESSING METHOD AND DEVICE, ELECTRONIC APPARATUS AND STORAGE MEDIUM

    公开(公告)号:US20240338392A1

    公开(公告)日:2024-10-10

    申请号:US18625945

    申请日:2024-04-03

    IPC分类号: G06F16/31 G06F16/338

    CPC分类号: G06F16/322 G06F16/338

    摘要: An information processing method and device, an electronic apparatus and a storage medium are disclosed. The information processing method includes: displaying a directory tree corresponding to documents in a document repository, wherein nodes are displayed in the directory tree according to a hierarchical relationship of the documents, and each of the nodes corresponds to at least one document; in response to a first operation event on the nodes in the directory tree, selecting a target node; in response to a second operation event on the target node, performing a bulk operation corresponding to the second operation event on a target document corresponding to the target node; the target document comprises at least two documents, and the target node comprises at least two nodes corresponding to the at least two documents.

    CONTENT PROCESSING METHOD AND COMPUTER-READABLE MEDIUM

    公开(公告)号:US20240248923A1

    公开(公告)日:2024-07-25

    申请号:US18627150

    申请日:2024-04-04

    IPC分类号: G06F16/338

    CPC分类号: G06F16/338

    摘要: Provided is a content processing method for determining a degree of priority of presentation of each of a plurality of contents, comprising: identifying the plurality of contents; receiving a first set including contents given positive evaluations from an operator and a second set including contents given negative evaluations from the operator from among the plurality of contents; extracting a first word set included in the first set and a second word set included in the second set; identifying a plurality of keywords including a plurality of positive keywords related to the first set and a plurality of negative keywords related to the second set according to a first evaluation criterion, the plurality of positive keywords being identified based on the first word set, the plurality of negative keywords being identified based on the second word set; giving weights to the plurality of keywords according to a second evaluation criterion so as to give a weight of zero or more to each of the plurality of positive keywords and give a weight of zero or less to each of the plurality of negative keywords; deriving a total for each of the plurality of contents by summing, over the plurality of keywords, a product of a frequency of appearance of each of the plurality of keywords and the given weight for the each of the plurality of keywords to obtain the total for each of the plurality of contents; and determining the degree of priority of presentation of each of the plurality of contents based on the total for the each of the plurality of contents.

    METHOD, APPARATUS, AND COMPUTER PROGRAM PRODUCT FOR CLASSIFICATION AND TAGGING OF TEXTUAL DATA

    公开(公告)号:US20240220526A1

    公开(公告)日:2024-07-04

    申请号:US18409278

    申请日:2024-01-10

    申请人: Groupon, Inc.

    发明人: Nick PENDAR

    摘要: Provided herein are systems, methods and computer readable media for classification and tagging of textual data. An example method may include accessing a corpus comprising a plurality of documents, each document having one or more labels indicative of services offered by a merchant, generating a query based on extracted features and the documents, generating a precision score for at least a portion of the generated query and selecting a subset of the generated queries based on an assigned precision score satisfying a precision score threshold, the selected subset of the generated queries configured to provide an indication of one or more labels to be applied to machine readable text. A second example method, utilized for tagging machine readable text with unknown labels, may include assigning a label to textual portions of the machine readable text based on results of the application of the queries.

    QUERY PROCESSING AND VISUALIZATION APPARATUSES, METHODS AND SYSTEMS

    公开(公告)号:US20240193198A1

    公开(公告)日:2024-06-13

    申请号:US18078034

    申请日:2022-12-08

    IPC分类号: G06F16/338 G06F16/31

    摘要: The QUERY PROCESSING AND VISUALIZATION APPARATUSES, METHODS AND SYSTEMS (“QPAV”) provides a platform that, in various embodiments, is configurable to receive, evaluate, and respond to queries over collections of structured and unstructured data, such as call records having associated metadata. Implementations provide for the generation of graphical representations of call networks, comprising nodes and links, in response to a received query which may comprise terms spoken in one or more call transcripts. The visual representation of query results may be enhanced by metadata, and may be configurable by the user to highlight particular connections, behaviors, or other insights associated with callers in the network.

    PRACTICAL SUPERVISED CLASSIFICATION OF DATA SETS

    公开(公告)号:US20240160652A1

    公开(公告)日:2024-05-16

    申请号:US18412703

    申请日:2024-01-15

    申请人: BASF SE

    摘要: The present invention relates to information retrieval. In order to facilitate a search and identification of documents, there is provided a computer-implemented method for training a classifier model for data classification in response to a search query. The computer-implemented method comprises:



    a) obtaining a dataset that comprises a seed set of labeled data representing a training dataset;
    b) training the classifier model by using the training dataset to fit parameters of the classifier model;
    c) evaluating a quality of the classifier model using a test dataset that comprises unlabeled data from the obtained dataset to generate a classifier confidence score indicative of a probability of correctness of the classifier model working on the test dataset;
    d) determining a global risk value of misclassification and a reward value based on the classifier confidence score on the test dataset;
    e) iteratively updating the parameters of the classifier model and performing steps b) to d) until the global risk value falls within a predetermined risk limit value or an expected reward value is reached.