Classifying Structural Features of a Digital Document by Feature Type using Machine Learning

    公开(公告)号:US20200302016A1

    公开(公告)日:2020-09-24

    申请号:US16359402

    申请日:2019-03-20

    Applicant: Adobe Inc.

    Abstract: Classifying structural features of a digital document by feature type using machine learning is leveraged in a digital medium environment. A document analysis system is leveraged to extract structural features from digital documents, and to classifying the structural features by respective feature types. To do this, the document analysis system employs a character analysis model and a classification model. The character analysis model takes text content from a digital document and generates text vectors that represent the text content. A vector sequence is generated based on the text vectors and position information for structural features of the digital document, and the classification model processes the vector sequence to classify the structural features into different feature types. The document analysis system can generate a modifiable version of the digital document that enables its structural features to be modified based on their respective feature types.

    Conversational agent for search
    12.
    发明授权

    公开(公告)号:US10713317B2

    公开(公告)日:2020-07-14

    申请号:US15419497

    申请日:2017-01-30

    Applicant: ADOBE INC.

    Abstract: A conversational agent facilitates conversational searches for users. The conversational agent is a reinforcement learning (RL) agent trained using a user model generated from existing session logs from a search engine. The user model is generated from the session logs by mapping entries from the session logs to user actions understandable by the RL agent and computing conditional probabilities of user actions occurring given previous user actions in the session logs. The RL agent is trained by conducting conversations with the user model in which the RL agent selects agent actions in response to user actions sampled using the conditional probabilities from the user model.

    Semantics-aware hybrid encoder for improved related conversations

    公开(公告)号:US12223002B2

    公开(公告)日:2025-02-11

    申请号:US17454445

    申请日:2021-11-10

    Applicant: ADOBE INC.

    Abstract: A method of finding online relevant conversing posts, comprises receiving, by a web server serving an online forum, a query post from an inquirer using the online forum, computing a contextual similarity score between each conversing post of a set of conversing posts with a query post, wherein the contextual similarity score is computed between the body of each of conversing posts and of the query post, wherein N1 conversing posts with a highest contextual similarity score are selected; computing a fine grained similarity score between the subject of the query post and of each of the N1 conversing posts, wherein N2 conversing posts with a highest fine grained similarity score are selected; and boosting the fine grained similarity score of the N2 conversing posts based on relevance metrics, wherein N3 highest ranked conversing posts are selected as a list of conversing posts most relevant to the query post.

    Form structure similarity detection

    公开(公告)号:US12124497B1

    公开(公告)日:2024-10-22

    申请号:US18190686

    申请日:2023-03-27

    Applicant: Adobe Inc.

    CPC classification number: G06F16/383 G06F16/332 G06V30/19147 G06V30/412

    Abstract: Form structure similarity detection techniques are described. A content processing system, for instance, receives a query snippet that depicts a query form structure. The content processing system generates a query layout string that includes semantic indicators to represent the query form structure and generates candidate layout strings that represent form structures from a target document. The content processing system calculates similarity scores between the query layout string and the candidate layout strings. Based on the similarity scores, the content processing system generates a target snippet for display that depicts a form structure that is structurally similar to the query form structure. The content processing system is further operable to generate a training dataset that includes image pairs of snippets depicting form structures that are structurally similar. The content processing system utilizes the training dataset to train a machine learning model to perform form structure similarity matching.

    Form structure extraction by predicting associations

    公开(公告)号:US12086728B2

    公开(公告)日:2024-09-10

    申请号:US18135948

    申请日:2023-04-18

    Applicant: Adobe Inc.

    CPC classification number: G06N5/04 G06N3/08 G06N20/00 G06N20/10 G06V10/82

    Abstract: Techniques described herein extract form structures from a static form to facilitate making that static form reflowable. A method described herein includes accessing low-level form elements extracted from a static form. The method includes determining, using a first set of prediction models, second-level form elements based on the low-level form elements. Each second-level form element includes a respective one or more low-level form elements. The method further includes determining, using a second set of prediction models, high-level form elements based on the second-level form elements and the low-level form elements. Each high-level form element includes a respective one or more second-level form elements or low-level form elements. The method further includes generating a reflowable form based on the static form by, for each high-level form element, linking together the respective one or more second-level form elements or low-level form elements.

    Language model with external knowledge base

    公开(公告)号:US11997056B2

    公开(公告)日:2024-05-28

    申请号:US17897419

    申请日:2022-08-29

    Applicant: ADOBE INC.

    CPC classification number: H04L51/02 G06F40/295 G06N5/022

    Abstract: The technology described herein receives a natural-language sequence of words comprising multiple entities. The technology then identifies a plurality of entities in the natural-language sequence. The technology generates a masked natural-language sequence by masking a first entity in the natural-language sequence. The technology retrieves, from a knowledge base, information related to a second entity in the plurality of entities. The technology then trains a natural-language model to respond to a query. The training uses a first representation of the masked natural-language sequence, a second representation of the information, and the first entity.

    SELF-SUPERVISED HIERARCHICAL EVENT REPRESENTATION LEARNING

    公开(公告)号:US20230154186A1

    公开(公告)日:2023-05-18

    申请号:US17455126

    申请日:2021-11-16

    Applicant: ADOBE INC.

    CPC classification number: G06K9/00718 G06K9/00751 G06N3/088 G06K2009/00738

    Abstract: Systems and methods for video processing are described. Embodiments of the present disclosure generate a plurality of image feature vectors corresponding to a plurality of frames of a video; generate a plurality of low-level event representation vectors based on the plurality of image feature vectors, wherein a number of the low-level event representation vectors is less than a number of the image feature vectors; generate a plurality of high-level event representation vectors based on the plurality of low-level event representation vectors, wherein a number of the high-level event representation vectors is less than the number of the low-level event representation vectors; and identify a plurality of high-level events occurring in the video based on the plurality of high-level event representation vectors.

    GENERATING COMMONSENSE CONTEXT FOR TEXT USING KNOWLEDGE GRAPHS

    公开(公告)号:US20230153534A1

    公开(公告)日:2023-05-18

    申请号:US17526824

    申请日:2021-11-15

    Applicant: ADOBE INC.

    CPC classification number: G06F40/295 G06F16/3329 G06N20/00

    Abstract: Methods and systems are provided for facilitating generation and utilization of a commonsense contextualizing machine learning (ML) model, in accordance with embodiments described herein. In embodiments, a commonsense contextual ML model is trained by fine-tuning a pre-trained language model using a set of training path-sentence pairs. Each training path-sentence pair includes a commonsense path, identified via a commonsense knowledge graph, and a natural language sentence identified as contextually related to the commonsense path. The trained commonsense contextualizing ML model can then be used to generate a commonsense inference path for a text input. Such a commonsense inference path can include a sequence of entities and relations that provide commonsense context to the text input. Thereafter, the commonsense inference path can be provided to a natural language processing system for use in performing a natural language processing task.

    Interactive search experience using machine learning

    公开(公告)号:US11971884B2

    公开(公告)日:2024-04-30

    申请号:US17656772

    申请日:2022-03-28

    Applicant: Adobe Inc.

    Abstract: An interactive search session is implemented using an artificial intelligence model. For example, when the artificial intelligence model receives a search query from a user, the model selects an action from a plurality of actions based on the search query. The selected action queries the user for more contextual cues about the search query (e.g., may enquire about use of the search results, may request to refine the search query, or otherwise engage the user in conversation to better understand the intent of the search). The interactive search session may be in the form, for example, of a chat session between the user and the system, and the chat session may be displayed along with the search results (e.g., in a separate section of display). The interactive search session may enable the system to better understand the user's search needs, and accordingly may help provide more focused search results.

Patent Agency Ranking