Multilingual content recommendation pipeline

    公开(公告)号:US12124812B2

    公开(公告)日:2024-10-22

    申请号:US17510850

    申请日:2021-10-26

    CPC classification number: G06F40/56 G06F40/284 G06F40/47

    Abstract: A data processing system implements obtaining first textual content in a first language from a first client device; determining that the first language is supported by a first machine learning model; obtaining a guard list of prohibited terms associated with the first language; determining that the textual content does not include one or more prohibited terms associated based on the guard list; providing the first textual content as an input to the first machine learning model responsive to the textual content not including the one or more prohibited terms; analyzing the first textual content with the first machine learning model to obtain a first content recommendation; obtaining a first content recommendation policy that identifies content associated with the first language that may not be provided as a content recommendation; determining that the first content recommendation is not prohibited; and providing the first content recommendation to the first client device.

    Generating document summary
    3.
    发明授权

    公开(公告)号:US12050636B2

    公开(公告)日:2024-07-30

    申请号:US17056728

    申请日:2019-06-17

    Abstract: According to implementations of the subject matter described herein, there is provided a solution for generating a summary of a document. In this solution, feature information of pages comprised in a document is extracted, which characterizes at least one type of content contained in each page. Respective importance of the pages is determined at least based on the extracted feature information. A summary of the document is generated for the document by selecting a predetermined number of pages less than the number of the pages based on the respective importance. Through the solution, instead of providing all the pages, pages containing important content may be determined automatically to serve as the summary of the document. This summary allows the user to learn quickly main content of the document, shorten the time consumed in browsing all documents and/or facilitate location of a document of interest as soon as possible.

Patent Agency Ranking