- 专利标题: Management of indexed data to improve content retrieval processing
-
申请号: US16721652申请日: 2019-12-19
-
公开(公告)号: US11544502B2公开(公告)日: 2023-01-03
- 发明人: Saurabh Sanjay Deshpande , Mina Mikhail , Matthew Francis Hurst , Riham Hassan Abdel-Moneim Mansour
- 申请人: Microsoft Technology Licensing, LLC
- 申请人地址: US WA Redmond
- 专利权人: Microsoft Technology Licensing, LLC
- 当前专利权人: Microsoft Technology Licensing, LLC
- 当前专利权人地址: US WA Redmond
- 主分类号: G06F16/30
- IPC分类号: G06F16/30 ; G06K9/62 ; G06F16/22 ; G06N20/00 ; G06F16/2453
摘要:
The present disclosure relates to processing operations configured to uniquely utilize indexing of content to improve content retrieval processing, particularly when working with large data sets. The techniques described herein enables efficient content retrieval when working with large data sets such as those that may be associated with a plurality of tenants of a data storage application/service. Among other technical advantages, the present disclosure is applicable to train a classifier using relevant samples based on text search in tenant-specific scenarios, where accurate searching can be executed for content associated with one or more tenant accounts of an application/service concurrently in milliseconds even in instances where there may be millions of documents to be searched. As an example, exemplary data shards may be generated and managed for efficient and scalable content retrieval processing including training of a classifier (e.g., artificial intelligence classifier) and real-time (or near real-time) query processing.
公开/授权文献
信息查询