Generating time-based recaps of documents using a deep learning sequence to sequence model

    公开(公告)号:US11604924B2

    公开(公告)日:2023-03-14

    申请号:US17148125

    申请日:2021-01-13

    申请人: Adobe, Inc.

    摘要: Techniques are provided herein for generating improved document summaries that consider the amount of time that has passed since the user last accessed the document. The length of time that has passed since the user has accessed each previous portion of the document is used as a variable to determine how much the summary should focus on each of the previously read sections of the document. When a document is accessed by a user, a relevance score is assigned to content from previously accessed sections of that document, where the relevance score is weighted based on how long ago each of the sections was accessed by the user. Once the various content items of previous sections have been provided relevance scores, selected sentences with the highest relevance scores are fed to a deep learning sequence-to-sequence model is used to build the document summary.

    Digital Content Text Processing and Review Techniques

    公开(公告)号:US20220114624A1

    公开(公告)日:2022-04-14

    申请号:US17066886

    申请日:2020-10-09

    申请人: Adobe Inc.

    摘要: Digital content text processing techniques are described. In one example, a text corpus is extracted from digital content and text corpus keywords are identified that are included in the text corpus. A plurality of clusters is formed based on the text corpus keywords. Cluster scores are generated for reviews that define a probability the review belongs to a respective cluster, e.g., based on review keywords extracted from the reviews. Sentiment values and sentiment weights are also generated. The sentiment values describe a sentiment that each of the reviews exhibits towards a respective cluster, e.g., a type of sentiment such as positive, neutral, or negative. The sentiment weights describe an amount of weight to be applied for each sentiment with respect to that cluster. The service provider system then generates ranking scores based on the cluster scores and the sentiment scores which are used to control output of the reviews.

    GENERATING TIME-BASED RECAPS OF DOCUMENTS USING A DEEP LEARNING SEQUENCE TO SEQUENCE MODEL

    公开(公告)号:US20220222439A1

    公开(公告)日:2022-07-14

    申请号:US17148125

    申请日:2021-01-13

    申请人: Adobe, Inc.

    IPC分类号: G06F40/289

    摘要: Techniques are provided herein for generating improved document summaries that consider the amount of time that has passed since the user last accessed the document. The length of time that has passed since the user has accessed each previous portion of the document is used as a variable to determine how much the summary should focus on each of the previously read sections of the document. When a document is accessed by a user, a relevance score is assigned to content from previously accessed sections of that document, where the relevance score is weighted based on how long ago each of the sections was accessed by the user. Once the various content items of previous sections have been provided relevance scores, selected sentences with the highest relevance scores are fed to a deep learning sequence-to-sequence model is used to build the document summary.

    Asides detection in documents
    4.
    发明授权

    公开(公告)号:US11256913B2

    公开(公告)日:2022-02-22

    申请号:US16598680

    申请日:2019-10-10

    申请人: Adobe Inc.

    摘要: Techniques are disclosed for identifying asides within a document, and detecting a display order of contents based of the identified asides. In a document, an “aside” represents a content region of the document that is distinct from the main content regions, and may be visually distinguishable from the main content region. In an example, a document is received, where the document lacks identification of asides. The document is analyzed to identify asides within the document. A display order of contents within the document is then determined, based on the identified asides. For example, in the display order, the asides are ordered between two segments of the main content and/or at a beginning or an end of the main content, but may not be ordered to be embedded in between a segment of the main content. The document is displayed in accordance with the display order.

    ASIDES DETECTION IN DOCUMENTS
    5.
    发明申请

    公开(公告)号:US20220172501A1

    公开(公告)日:2022-06-02

    申请号:US17651433

    申请日:2022-02-17

    申请人: Adobe Inc.

    摘要: Techniques are disclosed for identifying asides within a document, and detecting a display order of contents based of the identified asides. In a document, an “aside” represents a content region of the document that is distinct from the main content regions, and may be visually distinguishable from the main content region. In an example, a document is received, where the document lacks identification of asides. The document is analyzed to identify asides within the document. A display order of contents within the document is then determined, based on the identified asides. For example, in the display order, the asides are ordered between two segments of the main content and/or at a beginning or an end of the main content, but may not be ordered to be embedded in between a segment of the main content. The document is displayed in accordance with the display order.

    ASIDES DETECTION IN DOCUMENTS
    6.
    发明申请

    公开(公告)号:US20210110151A1

    公开(公告)日:2021-04-15

    申请号:US16598680

    申请日:2019-10-10

    申请人: Adobe Inc.

    IPC分类号: G06K9/00 G06F17/21

    摘要: Techniques are disclosed for identifying asides within a document, and detecting a display order of contents based of the identified asides. In a document, an “aside” represents a content region of the document that is distinct from the main content regions, and may be visually distinguishable from the main content region. In an example, a document is received, where the document lacks identification of asides. The document is analyzed to identify asides within the document. A display order of contents within the document is then determined, based on the identified asides. For example, in the display order, the asides are ordered between two segments of the main content and/or at a beginning or an end of the main content, but may not be ordered to be embedded in between a segment of the main content. The document is displayed in accordance with the display order.