GENERATING DIGITAL CONTENT
    1.
    发明公开

    公开(公告)号:US20240362427A1

    公开(公告)日:2024-10-31

    申请号:US18308907

    申请日:2023-04-28

    Applicant: Adobe Inc.

    CPC classification number: G06F40/56 G06F40/106 G06F40/169

    Abstract: In implementations of systems for generating digital content, a computing device implements a generation system to receive a user input specifying a characteristic for digital content. The generation system generates input text based on the characteristic for processing by a first machine learning model. Output text generated by the first machine learning model based on processing the input text is received. The output text describes a digital content component. The generation system generates the digital content component by processing the output text using a second machine learning model. The generation system generates the digital content including the digital content component for display in a user interface based on the characteristic.

    Pose-invariant Visual Speech Recognition Using A Single View Input

    公开(公告)号:US20200294507A1

    公开(公告)日:2020-09-17

    申请号:US16298933

    申请日:2019-03-11

    Applicant: Adobe Inc.

    Inventor: Yaman Kumar

    Abstract: A pose-invariant visual speech recognition system obtains a single view input of a speaker, such as a single video stream captured by a single camera. The single view input provides a particular pose of the speaker, which refers to a view angle, relative to the lens or image capture component of the camera that captured the video of the speaker, at which the speaker's face is captured. The pose of the speaker is used to select a visual speech recognition model to use to generate a text label that is the words spoken by the speaker. One or more additional view angles of the speaker are also generated from the single view input of the speaker. These one or more additional view angles, along with the single view input of the speaker, are used by the selected visual speech recognition model to generate the text label for the speaker.

    SYSTEMS AND METHODS FOR GENERATING SCANPATHS

    公开(公告)号:US20240273377A1

    公开(公告)日:2024-08-15

    申请号:US18109990

    申请日:2023-02-15

    Applicant: Adobe Inc.

    Abstract: Some embodiments described herein relate to a training module comprising a scanpath generation model training system. The training module may be used to generate a scanpath generation model. The training module may comprise an adversarial training neural network. Using training data, which includes a text input and a recorded scanpath corresponding to the text input, the adversarial training neural network is trained to generate a scanpath generation model. A scanpath may comprise a sequence of words and a corresponding sequence of fixation durations, wherein the sequence of words comprises one or more words comprising the text input. The training module may then output the trained scanpath generation model.

Patent Agency Ranking