Digital media environment for conversational image editing and enhancement

    公开(公告)号:US11972757B2

    公开(公告)日:2024-04-30

    申请号:US18149286

    申请日:2023-01-03

    Applicant: Adobe Inc.

    Abstract: Conversational image editing and enhancement techniques are described. For example, an indication of a digital image is received from a user. Aesthetic attribute scores for multiple aesthetic attributes of the image are generated. A computing device then conducts a natural language conversation with the user to edit the digital image. The computing device receives inputs from the user to refine the digital image as the natural language conversation progresses. The computing device generates natural language suggestions to edit the digital image based on the aesthetic attribute scores as part of the natural language conversation. The computing device provides feedback to the user that includes edits to the digital image based on the series of inputs. The computing device also includes as feedback natural language outputs indicating options for additional edits to the digital image based on the series of inputs and the previous edits to the digital image.

    Collecting multimodal image editing requests

    公开(公告)号:US10769495B2

    公开(公告)日:2020-09-08

    申请号:US16052246

    申请日:2018-08-01

    Applicant: Adobe Inc.

    Abstract: In implementations of collecting multimodal image editing requests (IERs), a user interface is generated that exposes an image pair including a first image and a second image including at least one edit to the first image. A user simultaneously speaks a voice command and performs a user gesture that describe an edit of the first image used to generate the second image. The user gesture and the voice command are simultaneously recorded and synchronized with timestamps. The voice command is played back, and the user transcribes their voice command based on the play back, creating an exact transcription of their voice command. Audio samples of the voice command with respective timestamps, coordinates of the user gesture with respective timestamps, and a transcription are packaged as a structured data object for use as training data to train a neural network to recognize multimodal IERs in an image editing application.

    Collecting Multimodal Image Editing Requests
    15.
    发明申请

    公开(公告)号:US20200042286A1

    公开(公告)日:2020-02-06

    申请号:US16052246

    申请日:2018-08-01

    Applicant: Adobe Inc.

    Abstract: In implementations of collecting multimodal image editing requests (IERs), a user interface is generated that exposes an image pair including a first image and a second image including at least one edit to the first image. A user simultaneously speaks a voice command and performs a user gesture that describe an edit of the first image used to generate the second image. The user gesture and the voice command are simultaneously recorded and synchronized with timestamps. The voice command is played back, and the user transcribes their voice command based on the play back, creating an exact transcription of their voice command. Audio samples of the voice command with respective timestamps, coordinates of the user gesture with respective timestamps, and a transcription are packaged as a structured data object for use as training data to train a neural network to recognize multimodal IERs in an image editing application.

    Method and apparatus for recommending hashtags

    公开(公告)号:US10235464B2

    公开(公告)日:2019-03-19

    申请号:US14703889

    申请日:2015-05-05

    Applicant: Adobe Inc.

    Abstract: A method for recommending hashtags includes determining keywords from a post planned for publishing by a publisher. An input criteria comprising at least one of age group, geographical location, date range, or a keyword is received. Previous posts associated with the keywords and satisfying the input criteria are obtained. The previous posts are categorized into one or more categories based on sentiment of each post and for each category hashtags used in the obtained previous posts in that category are determined. The hashtags are ranked based on predefined criteria comprising at least one of frequency of appearance of respective hashtag in posts, number of likes or shares or retweets of post comprising respective hashtag, number of followers of person who used respective hashtag, or sentiment of post comprising respective hashtag. The hashtags are then recommended, based on ranking, to the publisher for use with the post planned for publishing.

    Digital media environment for conversational image editing and enhancement

    公开(公告)号:US11574630B2

    公开(公告)日:2023-02-07

    申请号:US17015765

    申请日:2020-09-09

    Applicant: Adobe Inc.

    Abstract: Conversational image editing and enhancement techniques are described. For example, an indication of a digital image is received from a user. Aesthetic attribute scores for multiple aesthetic attributes of the image are generated. A computing device then conducts a natural language conversation with the user to edit the digital image. The computing device receives inputs from the user to refine the digital image as the natural language conversation progresses. The computing device generates natural language suggestions to edit the digital image based on the aesthetic attribute scores as part of the natural language conversation. The computing device provides feedback to the user that includes edits to the digital image based on the series of inputs. The computing device also includes as feedback natural language outputs indicating options for additional edits to the digital image based on the series of inputs and the previous edits to the digital image.

Patent Agency Ranking