Generating electronic summary documents for landing pages

    公开(公告)号:US11921766B2

    公开(公告)日:2024-03-05

    申请号:US17901885

    申请日:2022-09-02

    IPC分类号: G06F16/34 G06F16/33 G06N3/08

    摘要: Described herein are technologies related to constructing supplemental content items that summarize electronic landing pages. A sequence to sequence model that is configured to construct supplemental content items is trained based upon a corpus of electronic landing pages and supplemental content items that have been constructed by domain experts, wherein each landing page has a respective supplemental content item assigned thereto. The sequence to sequence model is additionally trained using self critical sequence training, where estimated click through rates of supplemental content items generated by the sequence to sequence model are employed to train the sequence to sequence model.

    Query rewriting and interactive inquiry framework

    公开(公告)号:US10654380B2

    公开(公告)日:2020-05-19

    申请号:US15612555

    申请日:2017-06-02

    摘要: The present application describes a system and method for converting a natural language query to a standard query using a sequence-to-sequence neural network. As described herein, when a natural language query is receive, the natural language query is converted to a standard query using a sequence-to-sequence model. In some cases, the sequence-to-sequence model is associated with an attention layer. A search using the standard query is performed and various documents may be returned. The documents that result from the search are scored based, at least in part, on a determined conditional entropy of the document. The conditional entropy is determined using the natural language query and the document.

    Query rewriting and interactive inquiry framework

    公开(公告)号:US11603017B2

    公开(公告)日:2023-03-14

    申请号:US16877410

    申请日:2020-05-18

    摘要: The present application describes a system and method for converting a natural language query to a standard query using a sequence-to-sequence neural network. As described herein, when a natural language query is receive, the natural language query is converted to a standard query using a sequence-to-sequence model. In some cases, the sequence-to-sequence model is associated with an attention layer. A search using the standard query is performed and various documents may be returned. The documents that result from the search are scored based, at least in part, on a determined conditional entropy of the document. The conditional entropy is determined using the natural language query and the document.

    Identifying relevant content items using a deep-structured neural network

    公开(公告)号:US10354182B2

    公开(公告)日:2019-07-16

    申请号:US14926617

    申请日:2015-10-29

    摘要: A computer-implemented technique is described herein for identifying one or more content items that are relevant to an input linguistic item (e.g., an input query) using a deep-structured neural network, trained based on a corpus of click-through data. The input linguistic item has a collection of input tokens. The deep-structured neural network includes a first part that produces word embeddings associated with the respective input tokens, a second part that generates state vectors that capture context information associated with the input tokens, and a third part which distinguishes important parts of the input linguistic item from less important parts. The second part of the deep-structured neural network can be implemented as a recurrent neural network, such as a bi-directional neural network. The third part of the deep-structured neural network can generate a concept vector by forming a weighted sum of the state vectors.

    Learning graph representations using hierarchical transformers for content recommendation

    公开(公告)号:US11676001B2

    公开(公告)日:2023-06-13

    申请号:US17093426

    申请日:2020-11-09

    IPC分类号: G06N3/045

    CPC分类号: G06N3/045

    摘要: Knowledge graphs can greatly improve the quality of content recommendation systems. There is a broad variety of knowledge graphs in the domain including clicked user-ad graphs, clicked query-ad graphs, keyword-display URL graphs etc. A hierarchical Transformer model learns entity embeddings in knowledge graphs. The model consists of two different Transformer blocks where the bottom block generates relation-dependent embeddings for the source entity and its neighbors, and the top block aggregates the outputs from the bottom block to produce the target entity embedding. To balance the information from contextual entities and the source entity itself, a masked entity model (MEM) task is combined with a link prediction task in model training.

    Pipeline for identifying supplemental content items that are related to objects in images

    公开(公告)号:US11163940B2

    公开(公告)日:2021-11-02

    申请号:US16422992

    申请日:2019-05-25

    摘要: Technologies are described herein that relate to identifying supplemental content items that are related to objects captured in images of webpages. A computing system receives an indication that a client computing device has a webpage displayed thereon that includes an image. The image is provided to a first DNN that is configured to identify a portion of the image that includes an object of a type from amongst a plurality of predefined types. Once the portion of the image is identified, the portion of the image is provided to a plurality of DNNs, with each of the DNNs configured to output a word or phrase that represents a value of a respective attribute of the object. A sequence of words or phrases output by the plurality of DNNs is provided to a search computing system, which identifies a supplemental content item based upon the sequence of words or phrases.

    Dynamic tensor attention for information retrieval scoring

    公开(公告)号:US10459928B2

    公开(公告)日:2019-10-29

    申请号:US15379262

    申请日:2016-12-14

    摘要: A technique of scoring a query against a document using sequence to sequence neural networks. The technique comprises: receiving a query comprising a plurality of words from a user; performing a search for a document comprising words based on the query; feeding the words of the document as the input of an encoder of a multilayer sequence to sequence converter; generating a plurality of vectors at a decoder of the multilayer sequence to sequence converter, each vector comprising a probability associated with a respective word in the query; looking up in the respective vector each word's probability of being associated with the document; multiplying every word's probability together to determine an overall probability of the query being associated with the document; and returning the document to the user if the overall probability of the query being associated with the document is greater than a threshold value.