CONSTRUCTING CONTENT BASED ON MULTI-SENTENCE COMPRESSION OF SOURCE CONTENT

    公开(公告)号:US20190197184A1

    公开(公告)日:2019-06-27

    申请号:US15854320

    申请日:2017-12-26

    Applicant: ADOBE INC.

    CPC classification number: G06F16/334 G06F16/338 G06F17/2705 G06F17/277

    Abstract: Embodiments of the present invention provide systems, methods, and computer storage media directed to facilitating corpus-based content generation, in particular, using graph-based multi-sentence compression to generate a final content output. In one embodiment, pre-existing source content is identified and retrieved from a corpus. The source content is then parsed into sentence tokens, mapped and weighted. The sentence tokens are further parsed into word tokens and weighted. The mapped word tokens are then compressed into candidate sentences to be used in a final content. The final content is assembled using ranked candidate sentences, such that the final content is organized to reduce information redundancy and optimize content cohesion.

    SOCIAL BIAS MITIGATION IN TEXTUAL MODELS

    公开(公告)号:US20220147713A1

    公开(公告)日:2022-05-12

    申请号:US17092230

    申请日:2020-11-07

    Applicant: Adobe Inc.

    Abstract: A system for generating text using a trained language model comprises an encoder that includes a debiased language model that penalizes generated text based on an equalization loss that quantifies first and second probabilities of respective first and second tokens occurring at a first point in the generated text. The first and second tokens define respective first and second groups of people. The system further comprises a decoder configured to generate text using the debiased language model. The decoder is further configured to penalize the generated text based on a bias penalization loss that quantifies respective probabilities of the first and second tokens co-occurring with a generated word. The encoder and decoder are trained to produce the generated text using a task-specific training corpus.

    Stylistic Text Rewriting for a Target Author

    公开(公告)号:US20210264109A1

    公开(公告)日:2021-08-26

    申请号:US16800018

    申请日:2020-02-25

    Applicant: Adobe Inc.

    Abstract: Rewriting text in the writing style of a target author is described. A stylistic rewriting system receives input text and an indication of the target author. The system trains a language model to understand the target author's writing style using a corpus of text associated with the target author. The language model may be transformer-based, and is first trained on a different corpus of text associated with a range of different authors to understand linguistic nuances of a particular language. Copies of the language model are then cascaded into an encoder-decoder framework, which is further trained using a masked language modeling objective and a noisy version of the target author corpus. After training, the encoder-decoder framework of the trained language model automatically rewrites input text in the writing style of the target author and outputs the rewritten text as stylized text.

    Generating summary content tuned to a target characteristic using a word generation model

    公开(公告)号:US11062087B2

    公开(公告)日:2021-07-13

    申请号:US16262655

    申请日:2019-01-30

    Applicant: Adobe Inc.

    Abstract: Certain embodiments involve tuning summaries of input text to a target characteristic using a word generation model. For example, a method for generating a tuned summary using a word generation model includes generating a learned subspace representation of input text and a target characteristic token associated with the input text by applying an encoder to the input text and the target characteristic token. The method also includes generating, by a decoder, each word of a tuned summary of the input text from the learned subspace representation and from a feedback about preceding words of the tuned summary. The tuned summary is tuned to target characteristics represented by the target characteristic token.

Patent Agency Ranking