INFORMATION PROCESSING APPARATUS, SYSTEM, METHOD, AND NON-TRANSITORY COMPUTER READABLE MEDIUM IN WHICH PROGRAM IS STORED

    公开(公告)号:US20240303433A1

    公开(公告)日:2024-09-12

    申请号:US18269177

    申请日:2021-10-08

    发明人: Kazuki Nakajima

    IPC分类号: G06F40/216

    CPC分类号: G06F40/216

    摘要: An information processing apparatus includes: an input unit configured to input a menu name of a restaurant; an output unit configured to output a category name; a storage unit configured to store the menu name and the category name corresponding to the menu name so as to be associated with each other; a morphological analysis unit configured to execute morphological analysis to divide the inputted menu name into words and determine parts of speech; and a specification unit configured to generate, in a case where no category name corresponding to the inputted menu name is stored, a feature amount vector characterizing whether each of the words is included in the menu name or not as a result of the morphological analysis, and specify a category name corresponding to the inputted menu name based on a result of learning the menu name and the category name.

    CLUSTERED METASEARCH
    5.
    发明公开

    公开(公告)号:US20240265054A1

    公开(公告)日:2024-08-08

    申请号:US18626067

    申请日:2024-04-03

    摘要: A clustered metasearch system receives a search query from a user. The system uses Natural Language Processing to identify an object of the search query and descriptors of the search query. The system sorts the search into an applicable realm based on the object of the search query. The system then conducts the search across a variety of search engines and collects root domains from the search results. Root domains within the same realm as the search query are prioritized and additional factors such as the presence of descriptors in the result, the recency of the result, the search engine rank of the result, and the distance from the center of the realm are used to determine the final ranking of the results. The results are then displayed to a user.

    Detecting random and/or algorithmically-generated character sequences in domain names

    公开(公告)号:US12026469B2

    公开(公告)日:2024-07-02

    申请号:US17529947

    申请日:2021-11-18

    申请人: Proofpoint, Inc.

    摘要: Aspects of the disclosure relate to detecting random and/or algorithmically-generated character sequences in domain names. A computing platform may train a machine learning model based on a set of semantically-meaningful words. Subsequently, the computing platform may receive a seed string and a set of domains to be analyzed in connection with the seed string. Based on the machine learning model, the computing platform may apply a classification algorithm to the seed string and the set of domains, where applying the classification algorithm to the seed string and the set of domains produces a classification result. Thereafter, the computing platform may store the classification result.