-
公开(公告)号:US10242114B2
公开(公告)日:2019-03-26
申请号:US14144213
申请日:2013-12-30
发明人: Riham Hassan Abdel-Moneim Mansour , Joseph W. Pepper , Nesma Abd El-Hakim Refaei , Diaa Mohamed Abdel Moneim Abdallah , Vanessa Graham Murdock
IPC分类号: G06F17/30
摘要: A method is provided of enriching an entry for an entity in a local index of a search engine with tags. The method comprises obtaining location-related social media messages from within a neighborhood of an entity; determining from the obtained messages one or more terms that are unique to the entity; individually determining one or more co-occurring terms for the one or more unique terms; and using the one or more co-occurring term as tags to label the entity in the local index. Furthermore, a method is provided of retrieving social media messages associated with search results.
-
公开(公告)号:US09904727B2
公开(公告)日:2018-02-27
申请号:US15289846
申请日:2016-10-10
发明人: Riham Hassan Abdel-Moneim Mansour , Ahmed Ael Mohamed Abdel Kader Ashour , Hesham Saad Mohamed Abdelwahab El Baz
CPC分类号: G06F17/30663 , G06F17/30011 , G06F17/30613 , G06F17/30713 , G06F17/30716 , G06K9/00 , G06K9/00483 , G06K9/6215 , G06K9/68
摘要: A system for retrieving/identifying a document comprising text stored in a document repository is described. A memory stores a graphical structure comprising a first plurality of nodes each representing a person, and a second plurality of nodes each representing a document in the document repository, the nodes being connected by edges according to automatically observed interactions between the represented people and documents. At least some of the nodes have one or more annotations each denoting a topic. A node relatedness calculator computes distances between nodes of the graphical structure using the topic annotations. An input receives an identifier of a user who is represented by one of the first plurality of nodes. An identifier/retriever identifies one or more documents from the document repository by using the identifier and using the computed distances between nodes.
-
公开(公告)号:US20210192281A1
公开(公告)日:2021-06-24
申请号:US16721652
申请日:2019-12-19
发明人: Saurabh Sanjay Deshpande , Mina Mikhail , Matthew Francis Hurst , Riham Hassan Abdel-Moneim Mansour
IPC分类号: G06K9/62 , G06F16/22 , G06N20/00 , G06F16/2453
摘要: The present disclosure relates to processing operations configured to uniquely utilize indexing of content to improve content retrieval processing, particularly when working with large data sets. The techniques described herein enables efficient content retrieval when working with large data sets such as those that may be associated with a plurality of tenants of a data storage application/service. Among other technical advantages, the present disclosure is applicable to train a classifier using relevant samples based on text search in tenant-specific scenarios, where accurate searching can be executed for content associated with one or more tenant accounts of an application/service concurrently in milliseconds even in instances where there may be millions of documents to be searched. As an example, exemplary data shards may be generated and managed for efficient and scalable content retrieval processing including training of a classifier (e.g., artificial intelligence classifier) and real-time (or near real-time) query processing.
-
公开(公告)号:US09483474B2
公开(公告)日:2016-11-01
申请号:US14615156
申请日:2015-02-05
发明人: Riham Hassan Abdel-Moneim Mansour , Ahmed Adel Mohamed Abdel Kader Ashour , Hesham Saad Mohamed Abdelwahab El Baz
CPC分类号: G06F17/30663 , G06F17/30011 , G06F17/30613 , G06F17/30713 , G06F17/30716 , G06K9/00 , G06K9/00483 , G06K9/6215 , G06K9/68
摘要: A system for retrieving/identifying a document comprising text stored in a document repository is described. A memory stores a graphical structure comprising a first plurality of nodes each representing a person, and a second plurality of nodes each representing a document in the document repository, the nodes being connected by edges according to automatically observed interactions between the represented people and documents. At least some of the nodes have one or more annotations each denoting a topic. A node relatedness calculator computes distances between nodes of the graphical structure using the topic annotations. An input receives an identifier of a user who is represented by one of the first plurality of nodes. An identifier/retriever identifies one or more documents from the document repository by using the identifier and using the computed distances between nodes.
摘要翻译: 描述用于检索/识别包含存储在文档库中的文本的系统。 存储器存储图形结构,其包括每个表示人的第一多个节点,以及每个表示文档库中的文档的第二多个节点,所述节点根据所代表的人和文档之间的自动观察到的交互而被边缘连接。 至少一些节点具有每个表示主题的一个或多个注释。 节点相关性计算器使用主题注释计算图形结构的节点之间的距离。 输入接收由第一多个节点之一表示的用户的标识符。 标识符/检索者通过使用标识符并且使用所计算的节点之间的距离来从文档存储库识别一个或多个文档。
-
公开(公告)号:US20160232157A1
公开(公告)日:2016-08-11
申请号:US14615156
申请日:2015-02-05
发明人: Riham Hassan Abdel-Moneim Mansour , Ahmed Adel Mohamed Abdel Kader Ashour , Hesham Saad Mohamed Abdelwahab El Baz
CPC分类号: G06F17/30663 , G06F17/30011 , G06F17/30613 , G06F17/30713 , G06F17/30716 , G06K9/00 , G06K9/00483 , G06K9/6215 , G06K9/68
摘要: A system for retrieving/identifying a document comprising text stored in a document repository is described. A memory stores a graphical structure comprising a first plurality of nodes each representing a person, and a second plurality of nodes each representing a document in the document repository, the nodes being connected by edges according to automatically observed interactions between the represented people and documents. At least some of the nodes have one or more annotations each denoting a topic. A node relatedness calculator computes distances between nodes of the graphical structure using the topic annotations. An input receives an identifier of a user who is represented by one of the first plurality of nodes. An identifier/retriever identifies one or more documents from the document repository by using the identifier and using the computed distances between nodes.
摘要翻译: 描述用于检索/识别包含存储在文档库中的文本的系统。 存储器存储图形结构,其包括每个表示人的第一多个节点,以及每个表示文档库中的文档的第二多个节点,所述节点根据所代表的人和文档之间的自动观察到的交互而被边缘连接。 至少一些节点具有每个表示主题的一个或多个注释。 节点相关性计算器使用主题注释计算图形结构的节点之间的距离。 输入接收由第一多个节点之一表示的用户的标识符。 标识符/检索者通过使用标识符并且使用所计算的节点之间的距离来从文档存储库识别一个或多个文档。
-
公开(公告)号:US11544502B2
公开(公告)日:2023-01-03
申请号:US16721652
申请日:2019-12-19
发明人: Saurabh Sanjay Deshpande , Mina Mikhail , Matthew Francis Hurst , Riham Hassan Abdel-Moneim Mansour
IPC分类号: G06F16/30 , G06K9/62 , G06F16/22 , G06N20/00 , G06F16/2453
摘要: The present disclosure relates to processing operations configured to uniquely utilize indexing of content to improve content retrieval processing, particularly when working with large data sets. The techniques described herein enables efficient content retrieval when working with large data sets such as those that may be associated with a plurality of tenants of a data storage application/service. Among other technical advantages, the present disclosure is applicable to train a classifier using relevant samples based on text search in tenant-specific scenarios, where accurate searching can be executed for content associated with one or more tenant accounts of an application/service concurrently in milliseconds even in instances where there may be millions of documents to be searched. As an example, exemplary data shards may be generated and managed for efficient and scalable content retrieval processing including training of a classifier (e.g., artificial intelligence classifier) and real-time (or near real-time) query processing.
-
公开(公告)号:US09881023B2
公开(公告)日:2018-01-30
申请号:US14337574
申请日:2014-07-22
发明人: Riham Hassan Abdel-Moneim Mansour , Mohamed Farouk Abdel-Handy , Hesham Saad Mohamed Abdelwahab El Baz
IPC分类号: G06K9/54 , G10L15/00 , G06F17/30 , G10L15/183
CPC分类号: G06F17/30247 , G06F17/30017 , G06F17/30023 , G06F17/30265 , G06F17/30554 , G06F17/30598 , G06F17/30684 , G06F17/30705 , G06F17/30976 , G10L15/183
摘要: Retrieving and/or storing images associated with events is described. For example, streams of event data comprising text are analyzed to detect an event and a language component builds an event language model for the event, comprising a plurality of words. In various examples, images extracted from web or other sources have associated text. In examples, images with associated text that is similar to the event language model are identified as images of the event. In various examples, associations between images and events are used to update an image retrieval system and/or an image storage system. In various examples, query terms about an event are received at an image retrieval system which returns images related to the event on the basis of associations between image text and event language models.
-
公开(公告)号:US20170154100A1
公开(公告)日:2017-06-01
申请号:US15289846
申请日:2016-10-10
发明人: Riham Hassan Abdel-Moneim Mansour , Admed Ael Modmed Abdel Kader Ashour , Hesham Saad Modamed Abdelwahab El Baz
IPC分类号: G06F17/30
CPC分类号: G06F17/30663 , G06F17/30011 , G06F17/30613 , G06F17/30713 , G06F17/30716 , G06K9/00 , G06K9/00483 , G06K9/6215 , G06K9/68
摘要: A system for retrieving/identifying a document comprising text stored in a document repository is described. A memory stores a graphical structure comprising a first plurality of nodes each representing a person, and a second plurality of nodes each representing a document in the document repository, the nodes being connected by edges according to automatically observed interactions between the represented people and documents. At least some of the nodes have one or more annotations each denoting a topic. A node relatedness calculator computes distances between nodes of the graphical structure using the topic annotations. An input receives an identifier of a user who is represented by one of the first plurality of nodes. An identifier/retriever identifies one or more documents from the document repository by using the identifier and using the computed distances between nodes.
-
-
-
-
-
-
-