Method, apparatus, and computer program product for classification and tagging of textual data
    71.
    发明授权
    Method, apparatus, and computer program product for classification and tagging of textual data 有权
    用于文本数据分类和标记的方法,设备和计算机程序产品

    公开(公告)号:US09330167B1

    公开(公告)日:2016-05-03

    申请号:US13893044

    申请日:2013-05-13

    申请人: Groupon, Inc.

    发明人: Nick Pendar

    IPC分类号: G06F17/30

    摘要: Provided herein are systems, methods and computer readable media for classification and tagging of textual data. An example method may include accessing a corpus comprising a plurality of documents, each document having one or more labels indicative of services offered by a merchant, generating a query based on extracted features and the documents, generating a precision score for at least a portion of the generated query and selecting a subset of the generated queries based on an assigned precision score satisfying a precision score threshold, the selected subset of the generated queries configured to provide an indication of one or more labels to be applied to machine readable text. A second example method, utilized for tagging machine readable text with unknown labels, may include assigning a label to textual portions of the machine readable text based on results of the application of the queries.

    摘要翻译: 本文提供了用于分类和标记文本数据的系统,方法和计算机可读介质。 示例性方法可以包括访问包括多个文档的语料库,每个文档具有指示商家提供的服务的一个或多个标签,基于提取的特征和文档生成查询,生成至少一部分 所生成的查询和基于分配的精度分数选择所生成的查询的子集,满足精度分数阈值,所选择的生成查询的子集被配置为提供要应用于机器可读文本的一个或多个标签的指示。 用于标记具有未知标签的机器可读文本的第二示例性方法可以包括基于查询的应用结果将标签分配给机器可读文本的文本部分。

    Search method, search device and storage medium
    73.
    发明授权
    Search method, search device and storage medium 有权
    搜索方式,搜索设备和存储介质

    公开(公告)号:US09317590B2

    公开(公告)日:2016-04-19

    申请号:US14347776

    申请日:2012-12-06

    IPC分类号: G06F17/30

    摘要: Disclosed are a search method, a search device and a storage medium. The method comprises obtaining all relevant documents of information to be sought; calculating a relevancy between each relevant document and the information to be sought based on a word matching algorithm and a semantics matching algorithm; performing sequencing processing on all the relevant documents according to the relevancy obtained through calculation, and displaying a sequencing result. Further disclosed is a search device. The present invention comprehensively considers matching between words, and matching of the semantics relationship between words, obtains an accurate relevancy calculation result, provides an ideal search result to a user, and improves satisfaction of the user.

    摘要翻译: 公开了搜索方法,搜索装置和存储介质。 该方法包括获取所要求的所有相关信息文件; 基于字匹配算法和语义匹配算法计​​算每个相关文档与要搜索的信息之间的相关性; 根据通过计算获得的相关性,对所有相关文件进行排序处理,并显示排序结果。 另外公开了一种搜索装置。 本发明综合考虑词之间的匹配以及词之间的语义关系的匹配,获得准确的相关性计算结果,为用户提供理想的搜索结果,并提高用户的满意度。

    LANDING PAGE SEARCH RESULTS
    74.
    发明申请
    LANDING PAGE SEARCH RESULTS 审中-公开
    着陆页搜索结果

    公开(公告)号:US20160098489A1

    公开(公告)日:2016-04-07

    申请号:US14968138

    申请日:2015-12-14

    IPC分类号: G06F17/30

    摘要: Systems and methods for providing content are disclosed. In an embodiment, information encoding at least one keyword that is associated with first content accessed by a user is received. A search query based at least in part on at least one keyword is executed to identify items. In response to a request from the user to access second content, a response is generated to the request that includes item information associated with at least a subset of the identified items. The response is provided to the user.

    摘要翻译: 公开了用于提供内容的系统和方法。 在一个实施例中,接收编码与由用户访问的第一内容相关联的至少一个关键字的信息。 执行至少部分地至少一个关键字的搜索查询以识别项目。 响应于来自用户访问第二内容的请求,对包括与所识别的项目的至少一个子集相关联的项目信息的请求生成响应。 该响应被提供给用户。

    REFERENCED CONTENT INDEXING
    76.
    发明申请
    REFERENCED CONTENT INDEXING 审中-公开
    参考内容索引

    公开(公告)号:US20160085780A1

    公开(公告)日:2016-03-24

    申请号:US14489667

    申请日:2014-09-18

    IPC分类号: G06F17/30

    摘要: One or more techniques and/or systems are provided for indexing referenced content and/or for deep content searching. In an example, parent content (e.g., an instant message from a friend about a celebrity) may be evaluated to identify a reference (e.g., a URL) to referenced content hosted by a content source (e.g., a photo shared through a photo sharing service). The referenced content may be acquired from the content source, and may be evaluated to identify a search term that is descriptive of the referenced content (e.g., a name of the celebrity in the photo). The parent content and the referenced content may be indexed into a search index using the search term. In an example, responsive to a search query corresponding to the parent content and/or the search term, the parent content and/or the referenced content may be provided as search results.

    摘要翻译: 提供一个或多个技术和/或系统用于索引所引用的内容和/或用于深层内容搜索。 在一个示例中,可以评估父内容(例如,来自朋友关于名人的即时消息)以识别由内容源(例如,通过照片共享共享的照片)的引用内容的引用(例如,URL) 服务)。 引用的内容可以从内容源获取,并且可以被评估以识别描述所引用的内容的搜索词(例如,照片中名人的名称)。 父内容和参考内容可以使用搜索项索引到搜索索引中。 在一个例子中,响应于与父内容和/或搜索词相对应的搜索查询,可以将父内容和/或引用内容提供为搜索结果。

    IDENTIFYING AND SCORING DATA VALUES

    公开(公告)号:US20160085755A1

    公开(公告)日:2016-03-24

    申请号:US14494114

    申请日:2014-09-23

    IPC分类号: G06F17/30 G06Q10/00

    摘要: Text including at least a first term can be presented on a display. An enterprise glossary is queried to identify other terms that match the first term. Data assets to which each of the other terms are linked and which include data values for the other terms can be identified. A first score indicating a level of relevance of the respective data asset to an enterprise is assigned to each of the data assets. A frequency distribution of the data values in the data assets is determined. Based at least on the first scores indicating the level of relevance of the respective data assets to the enterprise and the frequency distribution of the data values in the data assets, second scores are assigned to each of the data values. A plurality the data values which are assigned highest of the second scores are presented on the display.

    State-Specific External Functionality for Software Developers
    78.
    发明申请
    State-Specific External Functionality for Software Developers 有权
    软件开发人员的具体国家外部功能

    公开(公告)号:US20160085521A1

    公开(公告)日:2016-03-24

    申请号:US14588351

    申请日:2014-12-31

    申请人: Quixey, Inc.

    摘要: A system includes a user interface presented to a developer. The developer selects a first function to supplement functionality of a first application with external functionality available from third party applications. A code generation module provides a software object to the developer for incorporation into a first state of the first application. The first state includes a user interface element associated with an entity. User selection of the user interface element initiates preparation of a query wrapper including a combination of the entity's name and a predefined text string corresponding to the first function. The query wrapper is transmitted to a search system and a result set is received and displayed. A first item of the result set includes an access mechanism for a specified state of a target application. User selection of the first item causes the access mechanism to open the target application to the specified state.

    摘要翻译: 系统包括向开发者呈现的用户界面。 开发人员选择第一功能来补充具有可从第三方应用获得的外部功能的第一应用的功能。 代码生成模块向开发者提供软件对象以将其并入第一应用的第一状态。 第一状态包括与实体相关联的用户界面元素。 用户选择用户界面元素启动查询包装器的准备,该查询包装器包括实体名称与对应于第一功能的预定文本串的组合。 查询包装器被发送到搜索系统,并且接收并显示结果集。 结果集的第一项包括用于目标应用的指定状态的访问机制。 用户选择第一个项目导致访问机制将目标应用程序打开到指定的状态。

    UNSTRUCTURED SECURITY THREAT INFORMATION ANALYSIS
    79.
    发明申请
    UNSTRUCTURED SECURITY THREAT INFORMATION ANALYSIS 有权
    未经构造的安全威胁信息分析

    公开(公告)号:US20160065599A1

    公开(公告)日:2016-03-03

    申请号:US14473743

    申请日:2014-08-29

    IPC分类号: H04L29/06 H04L29/08

    摘要: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for creating structured data using data received from unstructured textual data sources. One of the methods includes receiving unstructured textual data, identifying one or more keywords in the unstructured textual data, determining one or more patterns included in the unstructured textual data using the identified keywords, identifying one or more intelligence types that correspond with the unstructured textual data using the determined patterns, and associating, for each of the identified intelligence types, a data subset from the unstructured textual data with the respective intelligence type.

    摘要翻译: 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于使用从非结构化文本数据源接收的数据创建结构化数据。 其中一种方法包括接收非结构化文本数据,识别非结构化文本数据中的一个或多个关键字,使用所识别的关键字确定包含在非结构化文本数据中的一个或多个模式,识别与非结构化文本数据相对应的一个或多个智能类型 使用所确定的模式,并且针对每个所识别的智能类型,将来自非结构化文本数据的数据子集与相应的智能类型相关联。

    LANGUAGE LEARNING EXCHANGE
    80.
    发明申请
    LANGUAGE LEARNING EXCHANGE 审中-公开
    语言学习交流

    公开(公告)号:US20160027334A1

    公开(公告)日:2016-01-28

    申请号:US14875005

    申请日:2015-10-05

    申请人: WESPEKE, INC.

    发明人: Michael E. Elchik

    摘要: Systems and methods for identifying and connecting complementary users of a language learning exchange are provided herein. One aspect includes registering through a computing device one or more users in a user community of an online language learning platform, the one or more users associated with profile information comprising user name, native language, and language of interest elements; receiving search terms associated with a first user; matching the first user with a complementary user based on the search terms and profile information; and presenting the first user and the complementary user with a learning exchange interface via which they each use local devices to communicate over a network.

    摘要翻译: 本文提供了用于识别和连接语言学习交换机的辅助用户的系统和方法。 一个方面包括通过计算设备登记在线语言学习平台的用户社区中的一个或多个用户,所述一个或多个用户与包括用户名,本地语言和感兴趣的语言元素的简档信息相关联; 接收与第一用户相关联的搜索词; 根据搜索词和简档信息,将第一用户与补充用户进行匹配; 以及向所述第一用户和所述补充用户呈现学习交换界面,通过所述学习交换界面,他们每个使用本地设备通过网络进行通信。