-
1.
公开(公告)号:US20190286978A1
公开(公告)日:2019-09-19
申请号:US15921369
申请日:2018-03-14
Applicant: Adobe Inc.
Inventor: Milan Aggarwal , Balaji Krishnamurthy , Shagun Sodhani
Abstract: Systems and techniques map an input field from a data schema to a hierarchical standard data model (XDM). The XDM includes a tree of single XDM fields and each of the single XDM fields includes a composition of single level XDM fields. An input field from a data schema is processed by an unsupervised learning algorithm to obtain a sequence of vectors representing the input field and a sequence of vectors representing single level hierarchical standard data model (XDM) fields. These vectors are processed by a neural network to obtain a similarity score between the input field and each of the single level XDM fields. A probability of a match is determined using the similarity score between the input field and each of the single level XDM fields. The input field is mapped to the XDM field having the probability of the match with a highest score.
-
2.
公开(公告)号:US20190188463A1
公开(公告)日:2019-06-20
申请号:US15843953
申请日:2017-12-15
Applicant: Adobe Inc.
Inventor: Shagun Sodhani , Kartikay Garg , Balaji Krishnamurthy
CPC classification number: G06K9/00456 , G06K9/00463 , G06K9/6262 , G06K9/627 , G06K2209/01 , G06N3/0445 , G06N3/0481 , G06N3/084
Abstract: Techniques for determining reading order in a document. A current labeled text run (R1), RIGHT text run (R1) and DOWN text run (R3) are generated. The R1 labeled text run is processed by a first LSTM, the R2 labeled text run is processed by a second LSTM, and the R3 labeled text run is processed by a third LSTM, wherein each of the LSTMs generates a respective internal representation (R1′, R2′ and R3′). Deep learning tools other than LSTMs can be used, as will be appreciated. The respective internal representations R1′, R2′ and R3′ are concatenated or otherwise combined into a vector or tensor representation and provided to a classifier network that generates a predicted label for a next text run as RIGHT, DOWN or EOS in the reading order of the document.
-
公开(公告)号:US10915701B2
公开(公告)日:2021-02-09
申请号:US15925059
申请日:2018-03-19
Applicant: Adobe Inc.
Inventor: Shagun Sodhani , Kartikay Garg , Balaji Krishnamurthy
IPC: G06F17/24 , G06F40/174 , G06N3/08 , G06F16/33
Abstract: Caption association techniques as part of digital content creation by a computing device are described. The computing device is configured to extract text features and bounding boxes from an input document. These text features and bounding boxes are processed to reduce a number of possible search spaces. The processing may involve generating and utilizing a language model that captures the semantic meaning of the text features to identify and filter static text, and may involve identifying and filtering inline captions. A number of bounding boxes are identified for a potential caption. The potential caption and corresponding identified bounding boxes are concatenated into a vector. The concatenated vector is used to identify relationships among the bounding boxes to determine a single bounding box associated with the caption. The determined association is utilized to generate an output digital document that includes a structured association between the caption and a data entry field.
-
公开(公告)号:US20190171906A1
公开(公告)日:2019-06-06
申请号:US15831160
申请日:2017-12-04
Applicant: Adobe Inc.
Inventor: Shagun Sodhani , Nikaash Puri
CPC classification number: G06K9/6215 , G06F16/583 , G06F16/5838 , G06K9/623
Abstract: In implementations of digital image search based on arbitrary image features, a server computing device maintains an images database of digital images, and includes an image search system that receives a search input as a digital image depicting image features, and receives search criteria of depicted image features in the digital image. The image search system can then determine similar images to the received digital image based on similarity criterion corresponding to the search criteria. A trained image model of the image search system is applied to determine an image feature representation of the received digital image. A feature mask model of the image search system is applied to the image feature representation to determine a masked feature representation of the received digital image. The masked feature representation of the received digital image is compared to a masked feature representation of each respective database image to identify the similar images.
-
公开(公告)号:US10558887B2
公开(公告)日:2020-02-11
申请号:US15831160
申请日:2017-12-04
Applicant: Adobe Inc.
Inventor: Shagun Sodhani , Nikaash Puri
IPC: G06K9/62 , G06F16/583
Abstract: In implementations of digital image search based on arbitrary image features, a server computing device maintains an images database of digital images, and includes an image search system that receives a search input as a digital image depicting image features, and receives search criteria of depicted image features in the digital image. The image search system can then determine similar images to the received digital image based on similarity criterion corresponding to the search criteria. A trained image model of the image search system is applied to determine an image feature representation of the received digital image. A feature mask model of the image search system is applied to the image feature representation to determine a masked feature representation of the received digital image. The masked feature representation of the received digital image is compared to a masked feature representation of each respective database image to identify the similar images.
-
6.
公开(公告)号:US10423828B2
公开(公告)日:2019-09-24
申请号:US15843953
申请日:2017-12-15
Applicant: Adobe Inc.
Inventor: Shagun Sodhani , Kartikay Garg , Balaji Krishnamurthy
Abstract: Techniques for determining reading order in a document. A current labeled text run (R1), RIGHT text run (R1) and DOWN text run (R3) are generated. The R1 labeled text run is processed by a first LSTM, the R2 labeled text run is processed by a second LSTM, and the R3 labeled text run is processed by a third LSTM, wherein each of the LSTMs generates a respective internal representation (R1′, R2′ and R3′). Deep learning tools other than LSTMs can be used, as will be appreciated. The respective internal representations R1′, R2′ and R3′ are concatenated or otherwise combined into a vector or tensor representation and provided to a classifier network that generates a predicted label for a next text run as RIGHT, DOWN or EOS in the reading order of the document.
-
公开(公告)号:US10902322B2
公开(公告)日:2021-01-26
申请号:US15660547
申请日:2017-07-26
Applicant: Adobe Inc.
Inventor: Shagun Sodhani , Balaji Krishnamurthy
Abstract: A standardized data model (“SDM”) includes standardized data types that indicate classifications of data elements. In a data service platform, such as a marketing data platform, a data standardization module classifies received data elements. One or more components included in the data standardization module are trained using supervised or unsupervised learning techniques to classify received data elements into a standardized data type included in the SDM. In some cases, an output of an unsupervised learning phase is provided as an input to a supervised learning phase. In some cases, a classified data element is modified by the data standardization module to indicate the standardized data type into which the data element is classified.
-
公开(公告)号:US10536580B2
公开(公告)日:2020-01-14
申请号:US15705042
申请日:2017-09-14
Applicant: ADOBE INC.
Inventor: Nikaash Puri , Shagun Sodhani
Abstract: Some implementations provide a feature recommendation system that receives sequences from user sessions with applications, where each sequence is of features of the applications in an order the features were used by a user. The sequences are applied to a feature embedding model that learns semantic similarities between the features based on occurrences of the features in the sequences in a same user session. A request is received for a feature recommendation that identifies a feature of an application used by a given user in a user session. A recommended feature for the feature recommendation is determined from a set of the semantic similarities that are between the identified feature and others of the features. The feature recommendation is presented on a user device associated with the given user.
-
公开(公告)号:US10713317B2
公开(公告)日:2020-07-14
申请号:US15419497
申请日:2017-01-30
Applicant: ADOBE INC.
Inventor: Balaji Krishnamurthy , Shagun Sodhani , Aarushi Arora , Milan Aggarwal
IPC: G06F16/9535 , G06F16/9032 , G06N3/00 , G06N20/00 , G06F40/30 , G06F40/35 , G06N3/08 , G06N7/00
Abstract: A conversational agent facilitates conversational searches for users. The conversational agent is a reinforcement learning (RL) agent trained using a user model generated from existing session logs from a search engine. The user model is generated from the session logs by mapping entries from the session logs to user actions understandable by the RL agent and computing conditional probabilities of user actions occurring given previous user actions in the session logs. The RL agent is trained by conducting conversations with the user model in which the RL agent selects agent actions in response to user actions sampled using the conditional probabilities from the user model.
-
公开(公告)号:US20190286691A1
公开(公告)日:2019-09-19
申请号:US15925059
申请日:2018-03-19
Applicant: Adobe Inc.
Inventor: Shagun Sodhani , Kartikay Garg , Balaji Krishnamurthy
Abstract: Caption association techniques as part of digital content creation by a computing device are described. The computing device is configured to extract text features and bounding boxes from an input document. These text features and bounding boxes are processed to reduce a number of possible search spaces. The processing may involve generating and utilizing a language model that captures the semantic meaning of the text features to identify and filter static text, and may involve identifying and filtering inline captions. A number of bounding boxes are identified for a potential caption. The potential caption and corresponding identified bounding boxes are concatenated into a vector. The concatenated vector is used to identify relationships among the bounding boxes to determine a single bounding box associated with the caption. The determined association is utilized to generate an output digital document that includes a structured association between the caption and a data entry field.
-
-
-
-
-
-
-
-
-