Language input user interface
    51.
    发明授权
    Language input user interface 有权
    语言输入用户界面

    公开(公告)号:US07403888B1

    公开(公告)日:2008-07-22

    申请号:US09606811

    申请日:2000-06-28

    IPC分类号: G10L11/00

    摘要: A language input architecture receives input text (e.g., phonetic text of a character-based language) entered by a user from an input device (e.g., keyboard, voice recognition). The input text is converted to an output text (e.g., written language text of a character-based language). The language input architecture has a user interface that displays the output text and unconverted input text in line with one another. As the input text is converted, it is replaced in the UI with the converted output text. In addition to this in-line input feature, the UI enables in-place editing or error correction without requiring the user to switch modes from an entry mode to an edit mode. To assist with this in-place editing, the UI presents pop-up windows containing the phonetic text from which the output text was converted as well as first and second candidate lists that contain small and large sets of alternative candidates that might be used to replace the current output text. The language input user interface also allows a user to enter a mixed text of different languages.

    摘要翻译: 语言输入架构从输入设备(例如,键盘,语音识别)接收用户输入的输入文本(例如,基于字符的语言的语音文本)。 输入文本被转换为输出文本(例如,基于字符的语言的书面语言文本)。 语言输入架构具有用于显示输出文本和未转换的输入文本的用户界面。 当输入文本被转换时,它将在UI中被替换为转换的输出文本。 除了这种在线输入功能之外,UI还可以进行就地编辑或纠错,而无需用户将模式从入门模式切换到编辑模式。 为了协助进行就地编辑,用户界面将显示弹出窗口,其中包含输出文本被转换的语音文本以及第一个和第二个候选列表,其中包含可用于替换的小组和大组替代候选项 当前的输出文本。 语言输入用户界面还允许用户输入不同语言的混合文本。

    Extracting semantic attributes
    52.
    发明申请
    Extracting semantic attributes 有权
    提取语义属性

    公开(公告)号:US20070239697A1

    公开(公告)日:2007-10-11

    申请号:US11392761

    申请日:2006-03-30

    IPC分类号: G06F17/30

    摘要: Extraction of semantic information and the generation of semantic attributes allows for improved organization and management of data. Semantic attributes are automatically generated and eliminate the need for manual entry of attribute information. A semantic file network may further be constructed based on similarities between files that are based on the semantic attribute information. Semantic links representing a semantic relationship may be built between similar or relevant files. In addition, user operations and user operation patterns may also be considered in building the file network. Semantic attributes and information may further facilitate browsing the file systems as well as improve the accuracy and speed of queries.

    摘要翻译: 语义信息的提取和语义属性的产生可以改善数据的组织和管理。 自动生成语义属性,无需手动输入属性信息。 还可以基于基于语义属性信息的文件之间的相似性来构建语义文件网络。 表示语义关系的语义链接可以在相似或相关文件之间建立。 此外,在构建文件网络时也可以考虑用户操作和用户操作模式。 语义属性和信息可以进一步促进文件系统的浏览以及提高查询的准确性和速度。

    Text classification by weighted proximal support vector machine
    53.
    发明申请
    Text classification by weighted proximal support vector machine 有权
    通过加权近端支持向量机进行文本分类

    公开(公告)号:US20070239638A1

    公开(公告)日:2007-10-11

    申请号:US11384889

    申请日:2006-03-20

    IPC分类号: G06F15/18

    CPC分类号: G06F17/30707 G06K9/6269

    摘要: Embodiments of the invention relate to improvements to the support vector machine (SVM) classification model. When text data is significantly unbalanced (i.e., positive and negative labeled data are in disproportion), the classification quality of standard SVM deteriorates. Embodiments of the invention are directed to a weighted proximal SVM (WPSVM) model that achieves substantially the same accuracy as the traditional SVM model while requiring significantly less computational time. A weighted proximal SVM (WPSVM) model in accordance with embodiments of the invention may include a weight for each training error and a method for estimating the weights, which automatically solves the unbalanced data problem. And, instead of solving the optimization problem via the KKT (Karush-Kuhn-Tucker) conditions and the Sherman-Morrison-Woodbury formula, embodiments of the invention use an iterative algorithm to solve an unconstrained optimization problem, which makes WPSVM suitable for classifying relatively high dimensional data.

    摘要翻译: 本发明的实施例涉及对支持向量机(SVM)分类模型的改进。 当文本数据显着不平衡(即正负标签数据不成比例)时,标准SVM的分类质量恶化。 本发明的实施例涉及一种加权近端SVM(WPSVM)模型,其实现与传统SVM模型基本相同的精度,同时需要显着更少的计算时间。 根据本发明的实施例的加权近端SVM(WPSVM)模型可以包括每个训练误差的权重以及用于估计权重的方法,其自动地解决不平衡数据问题。 而且,不是通过KKT(Karush-Kuhn-Tucker)条件和Sherman-Morrison-Woodbury公式来解决优化问题,而是本发明的实施例使用迭代算法来解决无约束优化问题,这使得WPSVM适合于相对分类 高维数据。

    Collaborative filtering using cluster-based smoothing
    54.
    发明申请
    Collaborative filtering using cluster-based smoothing 审中-公开
    使用基于群集的平滑的协同过滤

    公开(公告)号:US20070239553A1

    公开(公告)日:2007-10-11

    申请号:US11377130

    申请日:2006-03-16

    IPC分类号: G06Q30/00

    摘要: In an embodiment, a method of predicting an active user's rating for an item is disclosed. A database of users may be sorted into clusters. The data associated with the users in each cluster may be smoothed to filling in ratings for items that the users have not personally rated. An active user may then be compared to a set of users, where the set may be all or some portion of the database, to determine the K users that are most similar to the active user. The ratings of the K users regarding the item may be used to predict the active user's rating for the item. In an embodiment, the rating of each of the K users is assigned a confidence value associated with whether the user personally rated the item or if the rating was generated by the data smoothing process.

    摘要翻译: 在一个实施例中,公开了一种用于预测项目的活跃用户评级的方法。 可以将用户的数据库分类为群集。 可以平滑与每个群集中的用户相关联的数据,以填充用户未被评估的项目的评级。 然后可以将活动用户与一组用户进行比较,其中该集合可以是数据库的全部或部分,以确定与活动用户最相似的K个用户。 关于该项目的K个用户的评级可以用于预测该项目的活动用户的评级。 在一个实施例中,每个K个用户的评级被分配与用户个人评价该项目相关联的置信度值,或者如果该评级是由数据平滑处理产生的。

    Language conversion and display
    55.
    发明申请
    Language conversion and display 审中-公开
    语言转换和显示

    公开(公告)号:US20050060138A1

    公开(公告)日:2005-03-17

    申请号:US10898407

    申请日:2004-07-23

    摘要: A language input architecture receives input text (e.g., phonetic text of a character-based language) entered by a user from an input device (e.g., keyboard, voice recognition). The input text is converted to an output text (e.g., written language text of a character-based language). The language input architecture has a user interface that displays the output text and unconverted input text in line with one another. As the input text is converted, it is replaced in the UI with the converted output text. In addition to this in-line input feature, the UI enables in-place editing or error correction without requiring the user to switch modes from an entry mode to an edit mode. To assist with this in-place editing, the UI presents pop-up windows containing the phonetic text from which the output text was converted as well as first and second candidate lists that contain small and large sets of alternative candidates that might be used to replace the current output text. The language input user interface also allows a user to enter a mixed text of different languages.

    摘要翻译: 语言输入架构从输入设备(例如,键盘,语音识别)接收用户输入的输入文本(例如,基于字符的语言的语音文本)。 输入文本被转换为输出文本(例如,基于字符的语言的书面语言文本)。 语言输入架构具有用于显示输出文本和未转换的输入文本的用户界面。 当输入文本被转换时,它将在UI中被替换为转换的输出文本。 除了这种在线输入功能之外,UI还可以进行就地编辑或纠错,而无需用户将模式从入门模式切换到编辑模式。 为了协助进行就地编辑,用户界面将显示弹出窗口,其中包含输出文本被转换的语音文本以及第一个和第二个候选列表,其中包含可用于替换的小组和大组替代候选项 当前的输出文本。 语言输入用户界面还允许用户输入不同语言的混合文本。

    Identifying influential persons in a social network
    56.
    发明授权
    Identifying influential persons in a social network 有权
    识别社会网络中有影响力的人物

    公开(公告)号:US08359276B2

    公开(公告)日:2013-01-22

    申请号:US11533742

    申请日:2006-09-20

    IPC分类号: G06Q99/00

    CPC分类号: G06Q30/02 G06Q10/10

    摘要: An influential persons identification system and method for identifying a set of influential persons (or influencers) in a social network (such as an online social network). The influential persons set is generated such that by sending a message to the set the message will be propagated through the network at the greatest speed and coverage. A ranking of users is generated, and a pruning process is performed starting with the top-ranked user and working down the list. For each user on the list, the user is identified as an influencer and then the user and each of his friends are deleted from the social network users list. Next, the same process is performed for the second-ranked user, the third-ranked user, and so forth. The process terminates when the list of users of the social network is exhausted or the desired number of influencers on the influential person set is reached.

    摘要翻译: 在社交网络(如在线社交网络)中识别一组有影响力的人(或影响者)的有影响力的人员识别系统和方法。 产生有影响力的人员,通过发送消息给消息集,消息将以最大的速度和覆盖率通过网络传播。 生成用户排名,并从顶级用户开始执行修剪过程,并在列表中执行操作。 对于列表中的每个用户,用户被识别为影响者,然后从社交网络用户列表中删除用户和他的每个朋友。 接下来,对于第二等级的用户,第三等级的用户等执行相同的处理。 当社交网络的用户列表用完或者达到期望数量的有影响力的人集合的影响者时,该过程终止。

    Identification of similar queries based on overall and partial similarity of time series
    57.
    发明授权
    Identification of similar queries based on overall and partial similarity of time series 有权
    基于时间序列的总体和部分相似性识别类似查询

    公开(公告)号:US08290921B2

    公开(公告)日:2012-10-16

    申请号:US11770505

    申请日:2007-06-28

    IPC分类号: G06F7/00 G06F17/30

    CPC分类号: G06F17/30864 G06F17/3064

    摘要: Techniques for identifying similar queries based on their overall similarity and partial similarity of time series of frequencies of the queries are provided. To identify queries that are similar to a target query, the query analysis system generates, for each query, an overall similarity score for that query and the target query based on the time series of the query and the target query. The query analysis system also generates, for each query, partial similarity scores for the query and the target query based on various time sub-series of the overall time series of the queries. The query analysis system then identifies queries as being similar to the target query based on the overall similarity scores and the partial similarity scores of the queries.

    摘要翻译: 提供了基于其查询的时间序列的总体相似性和部分相似性来识别类似查询的技术。 为了识别类似于目标查询的查询,查询分析系统根据查询和目标查询的时间序列为每个查询生成该查询和目标查询的总体相似性得分。 查询分析系统还根据查询的整个时间序列的各种时间子序列,为每个查询生成查询和目标查询的部分相似度分数。 然后,查询分析系统基于查询的总体相似性得分和部分相似性得分将查询识别为与目标查询相似。

    WEB CONTENT MINING OF PAIR-BASED DATA
    58.
    发明申请
    WEB CONTENT MINING OF PAIR-BASED DATA 审中-公开
    基于对象数据的WEB内容采集

    公开(公告)号:US20110213763A1

    公开(公告)日:2011-09-01

    申请号:US13101061

    申请日:2011-05-04

    IPC分类号: G06F17/30

    CPC分类号: G06F16/951

    摘要: Described herein is technology for, among other things, mining pair-based data on the web. The technology involves an online pair-based data mining system as well as an offline SVM training system. By subjecting a pair-based input data to the systems, one may grow a pool of pair-based data which share characteristics of the pair-based input data in more efficient manner.

    摘要翻译: 这里描述的是用于在网络上挖掘基于对的数据的技术。 该技术涉及一个在线的基于对的数据挖掘系统以及离线SVM培训系统。 通过对基于对的输入数据进行系统的处理,可以以更有效的方式增加基于成对的输入数据的特征的基于对的数据池。

    Web content mining of pair-based data
    59.
    发明授权
    Web content mining of pair-based data 有权
    基于对数据的Web内容挖掘

    公开(公告)号:US07962507B2

    公开(公告)日:2011-06-14

    申请号:US11941968

    申请日:2007-11-19

    IPC分类号: G06F17/30 G06F7/00

    CPC分类号: G06F17/30864

    摘要: Described herein is technology for, among other things, mining pair-based data on the web. The technology involves an online pair-based data mining system as well as an offline SVM training system. By subjecting a pair-based input data to the systems, one may grow a pool of pair-based data which share characteristics of the pair-based input data in more efficient manner.

    摘要翻译: 这里描述的是用于在网络上挖掘基于对的数据的技术。 该技术涉及一个在线的基于对的数据挖掘系统以及离线SVM培训系统。 通过对基于对的输入数据进行系统的处理,可以以更有效的方式增加基于成对的输入数据的特征的基于对的数据池。

    Internet visualization system and related user interfaces
    60.
    发明授权
    Internet visualization system and related user interfaces 有权
    互联网可视化系统及相关用户界面

    公开(公告)号:US07873904B2

    公开(公告)日:2011-01-18

    申请号:US11972073

    申请日:2008-01-10

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30864 G06F2216/03

    摘要: Systems and methods are described for an Internet visualization system and related user interfaces. In one implementation, the system analyzes Internet search logs to determine most popular search queries across the world at a current time. A user interface displays a keyword of each of the most popular queries in a single visual display that relates each query to a geographical location of greatest popularity. The system can also filter queries according to demographics. In one implementation the user interface provides a 3-dimensional Internet visualization that adopts an ocean or seascape theme. The ocean floor displays a map of the world, and query bubbles rise from geographical locations on the map. The size and duration of each query bubble denotes the relative popularity of a given query.

    摘要翻译: 为互联网可视化系统和相关的用户界面描述了系统和方法。 在一个实现中,系统分析互联网搜索日志以确定当前世界上最流行的搜索查询。 用户界面在单个视觉显示中显示每个最流行的查询的关键字,其将每个查询与最受欢迎的地理位置相关联。 该系统还可以根据人口特征来过滤查询。 在一个实现中,用户界面提供采用海洋或海景主题的三维互联网可视化。 海底显示世界地图,查询气泡从地图上的地理位置上升。 每个查询气泡的大小和持续时间表示给定查询的相对受欢迎程度。