Method and apparatus for recognizing multiword expressions
    1.
    发明授权
    Method and apparatus for recognizing multiword expressions 有权
    用于识别多字表达的方法和装置

    公开(公告)号:US07346511B2

    公开(公告)日:2008-03-18

    申请号:US10248057

    申请日:2002-12-13

    IPC分类号: G10L15/04 G06F17/27

    CPC分类号: G06F17/271 G06F17/2755

    摘要: Words of an input string are morphologically analyzed to identify their alternative base forms and parts of speech. The analyzed words of the input string are used to compile the input string into a first finite-state network. The first finite-state network is matched with a second finite-state network of multiword expressions to identify all subpaths of the first finite-state network that match one or more complete paths in the second finite-state network. Each matching subpath of the first finite-state network and path of the second finite-state network identify a multiword expression in the input string. The morphological analysis is performed without disambiguating words and without segmenting the input string into sentences in the input string to compile the first finite-state network with at least one path that identifies alternative base forms or parts of speech of a word in the input string.

    摘要翻译: 输入字符串的词在形态上进行分析,以确定其替代基本形式和词性。 输入字符串的分析词用于将输入字符串编译成第一个有限状态网络。 第一有限状态网络与多字表达式的第二有限状态网络匹配,以识别与第二有限状态网络中的一个或多个完整路径匹配的第一有限状态网络的所有子路径。 第一有限状态网络的每个匹配子路径和第二有限状态网络的路径在输入字符串中标识多字表达式。 执行形态分析而不消除词汇,而不将输入字符串分割成输入字符串中的句子,以用至少一个路径识别第一有限状态网络,该路径识别输入字符串中单词的替代基本形式或词性。

    Method and apparatus for mapping multiword expressions to identifiers using finite-state networks
    2.
    发明授权
    Method and apparatus for mapping multiword expressions to identifiers using finite-state networks 有权
    使用有限状态网络将多字表达式映射到标识符的方法和装置

    公开(公告)号:US07552051B2

    公开(公告)日:2009-06-23

    申请号:US10248058

    申请日:2002-12-13

    IPC分类号: G10L15/04 G06F17/27

    CPC分类号: G06F17/2775

    摘要: Multiword expressions are mapped to identifiers using finite-state networks. Each of a plurality of multiword expressions is encoded into a regular expression. Each regular expression encodes a base form common to a plurality of derivative forms defined by ones of the multiword expressions. Each of the plurality of regular expressions is compiled with factorization into a set of finite-state networks. A union of the finite-state networks in the set of finite-state networks is performed to define a multiword finite-state network and a set of subnets. The multiword finite-state network and the set of subnets are traversed to identify a path corresponding to one of the plurality of multiword expressions, wherein only transitions originating from the multiword finite-state network are accounted for to ascertain a path number identifying a base form of the one of the plurality of multiword expressions.

    摘要翻译: 使用有限状态网络将多字表达式映射到标识符。 多个多词表达式中的每一个被编码成正则表达式。 每个正则表达式编码由多个词表达式中的一个定义的多个导数形式共同的基本形式。 多个正则表达式中的每一个被分解成一组有限状态网络。 执行有限状态网络集合中的有限状态网络的并集,以定义多字有限状态网络和一组子网。 遍历多字有限状态网络和子集合以识别与多个多词表达式中的一个对应的路径,其中仅考虑源自多字有限状态网络的转换以确定识别基本形式的路径号 的多个多词表达中的一个。

    Reversible user interface component
    3.
    发明授权
    Reversible user interface component 有权
    可逆的用户界面组件

    公开(公告)号:US08860763B2

    公开(公告)日:2014-10-14

    申请号:US13362212

    申请日:2012-01-31

    IPC分类号: G09G5/02 G06T11/60

    摘要: A method of configuring a widget and a tactile user interface which displays the widget are disclosed which enable menu-setting operations with minimal touch gestures and occupation of screen space. The interface includes a touch sensitive display device and memory which stores instructions for displaying the widget and a set of graphic objects together on the display device. The widget has two or more virtual sides, each of the sides being associated with a respective functionality. The widget is flipped, in response to a recognized touch gesture, from a first of the sides to a second of the sides, whereby the functionality of the widget is changed. The graphic objects are associated, in memory, with respective items having attributes. The graphic objects exhibit a response to the widget functionality of a currently displayed one of the sides of the widget based on the attributes of the respective items.

    摘要翻译: 公开了一种配置小部件的方法和显示小部件的触觉用户界面,其使得能够以最小的触摸手势和屏幕空间的占用来进行菜单设置操作。 该接口包括触敏显示设备和存储器,其存储用于在显示设备上一起显示窗口小部件和一组图形对象的指令。 小部件具有两个或更多个虚拟侧面,每个侧面与相应的功能相关联。 响应于识别的触摸手势,从第一侧到第二侧翻转该小部件,由此改变小部件的功能。 图形对象在存储器中与具有属性的各个项目相关联。 基于各个项目的属性,图形对象对窗口小部件的当前显示的一个边的widget功能作出响应。

    Apparatus and method for document collection and filtering
    4.
    发明授权
    Apparatus and method for document collection and filtering 有权
    文件收集和过滤的装置和方法

    公开(公告)号:US08386437B2

    公开(公告)日:2013-02-26

    申请号:US12417130

    申请日:2009-04-02

    IPC分类号: G06F7/00

    CPC分类号: G06F17/30011

    摘要: A system and method for document management are provided. The method relies on a logging system which automatically generates image logs for input documents for each job (print, copy, fax, scan, etc.) processed by the multifunction printing device(s) of an organization. The image logs are processed to identify keywords which are the basis of a search for similar documents among those which have been previously archived as well as documents in other accessible document repositories, including Web documents. The method identifies matching documents and optionally also revisions and related documents. A procedure is provided for ensuring that for each document processed by a multifunction device or other image output device of the organization, image data is archived (or identified as a public document without archiving). The method avoids duplication by using a digital matching document, where available, enabling the images of the image log for the input document to be discarded.

    摘要翻译: 提供了一种用于文档管理的系统和方法。 该方法依赖于记录系统,该系统自动生成由组织的多功能打印设备处理的每个作业(打印,复印,传真,扫描等)的输入文档的图像日志。 处理图像日志以识别作为先前存档的那些文档以及其他可访问文档存储库(包括Web文档)中的文档的搜索基础的关键字。 该方法识别匹配文档,还可选择修改和相关文档。 提供了一种程序,用于确保对于由组织的多功能设备或其他图像输出设备处理的每个文档,图像数据被归档(或被识别为公共文档而不进行归档)。 该方法通过使用数字匹配文档(如果可用)来避免重复,使得能够丢弃输入文档的图像日志的图像。

    System and method for assisted document review
    5.
    发明授权
    System and method for assisted document review 有权
    辅助文件审查的系统和方法

    公开(公告)号:US08165974B2

    公开(公告)日:2012-04-24

    申请号:US12479972

    申请日:2009-06-08

    IPC分类号: G06F15/18 G06F5/00 G06F17/00

    CPC分类号: G06N5/043 G06Q10/10 G06Q50/18

    摘要: A system and method for reviewing documents are provided. A collection of documents is portioned into sets of documents for review by a plurality of reviewers. For each set, documents in the set are displayed on a display device for review by a reviewer and temporarily organized through grouping and sorting. The reviewer's labels for the displayed documents are received. Based on the reviewer's labels, a class from a plurality of classes is assigned to each of the reviewed documents. A classifier model stored in computer memory is progressively trained, based on features extracted from the reviewed documents in the set and their assigned classes. Prior to review of all documents in the set, a calculated subset of documents for which the classifier model assigns a class different from the one assigned based on the reviewer's label is returned for a second review by a reviewer. Models generated from one or more other document sets can be used to assess the review of a first of the sets.

    摘要翻译: 提供了一种审查文件的系统和方法。 一组文件分为多组文件供多位评审员审阅。 对于每个集合,集合中的文档显示在显示设备上,供审阅者查看,并通过分组和排序进行临时组织。 接收到显示文件的审阅者标签。 根据审阅者的标签,将来自多个类的课程分配给每个经审查的文档。 存储在计算机存储器中的分类器模型基于从集合中的经审查的文档及其分配的类中提取的特征而逐渐训练。 在审查集合中的所有文档之前,返回分类器模型分配与基于审阅者标签分配的类别不同的​​类别的文档的计算子集,供审阅者进行第二次审阅。 可以使用从一个或多个其他文档集生成的模型来评估第一组的审查。

    PRINTER IMAGE LOG SYSTEM FOR DOCUMENT GATHERING AND RETENTION
    6.
    发明申请
    PRINTER IMAGE LOG SYSTEM FOR DOCUMENT GATHERING AND RETENTION 有权
    用于文件记录和保留的打印机图像记录系统

    公开(公告)号:US20100253967A1

    公开(公告)日:2010-10-07

    申请号:US12417110

    申请日:2009-04-02

    摘要: A system and method for document image acquisition and retrieval which find application in litigation for responding to discovery requests are disclosed. The method includes automatically acquiring image data and associated records for documents being processed by a plurality of image output devices within an organization and archiving the image data and associated records as image logs for the processed documents. When a request for document production is received by the organization, the image logs (and/or information extracted therefrom) are automatically filtered through at least one classifier trained to return documents responsive to the document request, and documents corresponding to the filtered out image logs are output. One of the filters may be configured for filtering privileged from non-privileged documents.

    摘要翻译: 公开了一种用于文件图像采集和检索的系统和方法,该系统和方法在诉讼中发现应用于响应发现请求。 该方法包括自动获取由组织内的多个图像输出设备正在处理的文档的图像数据和相关联的记录,并将图像数据和相关联的记录归档为处理的文档的图像日志。 当组织接收到文档生成请求时,通过经过训练以响应于文档请求返回文档的至少一个分类器自动过滤图像日志(和/或从其提取的信息),以及对应于过滤出的图像日志的文档 被输出。 其中一个过滤器可能被配置为过滤来自非特权文档的特权。

    Imaging system with haptic interface
    7.
    发明授权
    Imaging system with haptic interface 有权
    具有触觉界面的成像系统

    公开(公告)号:US07518745B2

    公开(公告)日:2009-04-14

    申请号:US11237321

    申请日:2005-09-28

    CPC分类号: G06F3/016

    摘要: An imaging system includes a processing component which receives images to be rendered and a rendering device, such as a marking engine, fax machine or email system, in communication with the processing component for rendering an image supplied by the processing component. A haptic interface is in communication with the processing component for inputting commands from the user to the processing component for rendering the image, and outputting feedback from the processing component to the user as a force feedback.

    摘要翻译: 成像系统包括接收要呈现的图像的处理部件和与处理部件通信的呈现装置,例如标记引擎,传真机或电子邮件系统,用于呈现由处理部件提供的图像。 触觉界面与处理组件通信,用于从用户输入命令到用于渲染图像的处理组件,并将作为力反馈的来自处理组件的反馈输出给用户。

    Hierarchical clustering with real-time updating
    8.
    发明申请
    Hierarchical clustering with real-time updating 有权
    分层聚类与实时更新

    公开(公告)号:US20070239745A1

    公开(公告)日:2007-10-11

    申请号:US11391864

    申请日:2006-03-29

    IPC分类号: G06F7/00

    摘要: A probabilistic clustering system is defined at least in part by probabilistic model parameters indicative of word counts, ratios, or frequencies characterizing classes of the clustering system. An association of one or more documents in the probabilistic clustering system is changed from one or more source classes to one or more destination classes. Probabilistic model parameters characterizing classes affected by the changed association are locally updated without updating probabilistic model parameters characterizing classes not affected by the changed association.

    摘要翻译: 概率聚类系统至少​​部分地由指示表征群集系统的类的字数,比率或频率的概率模型参数定义。 概率聚类系统中的一个或多个文档的关联从一个或多个源类改变为一个或多个目的地类。 表征受改变的关联影响的类的概率模型参数是本地更新的,而不更新表征不受改变的关联影响的类的概率模型参数。

    Imaging system with haptic interface
    9.
    发明申请
    Imaging system with haptic interface 有权
    具有触觉界面的成像系统

    公开(公告)号:US20070070033A1

    公开(公告)日:2007-03-29

    申请号:US11237321

    申请日:2005-09-28

    IPC分类号: G09G5/00

    CPC分类号: G06F3/016

    摘要: An imaging system includes a processing component which receives images to be rendered and a rendering device, such as a marking engine, fax machine or email system, in communication with the processing component for rendering an image supplied by the processing component. A haptic interface is in communication with the processing component for inputting commands from the user to the processing component for rendering the image, and outputting feedback from the processing component to the user as a force feedback.

    摘要翻译: 成像系统包括接收要呈现的图像的处理部件和与处理部件通信的呈现装置,例如标记引擎,传真机或电子邮件系统,用于呈现由处理部件提供的图像。 触觉界面与处理组件通信,用于从用户输入命令到用于渲染图像的处理组件,并将作为力反馈的来自处理组件的反馈输出给用户。

    PRINTER IMAGE LOG SYSTEM FOR DOCUMENT GATHERING AND RETENTION
    10.
    发明申请
    PRINTER IMAGE LOG SYSTEM FOR DOCUMENT GATHERING AND RETENTION 有权
    用于文件记录和保留的打印机图像记录系统

    公开(公告)号:US20130077857A1

    公开(公告)日:2013-03-28

    申请号:US13683143

    申请日:2012-11-21

    IPC分类号: G06K9/62

    摘要: A system and method for document image acquisition and retrieval find application in litigation for responding to discovery requests. The method includes receiving automatically acquired electronic image logs comprising image data and associated records for documents processed by a plurality of image output devices within an organization. When a request for document production is received, the image logs (and/or information extracted therefrom) are automatically filtered through at least one classifier trained to return documents responsive to the document request, and documents corresponding to the filtered out image logs are output. One of the filters may be configured for filtering out documents that include attorney-client exchanges.

    摘要翻译: 用于文件图像采集和检索的系统和方法在诉讼中找到应答以响应发现请求。 该方法包括接收自动获取的电子图像日志,其包括由组织内的多个图像输出设备处理的文档的图像数据和相关联的记录。 当接收到文档生成请求时,通过经过训练以响应于文档请求返回文档的至少一个分类器自动过滤图像日志(和/或从其提取的信息),并且输出与过滤出的图像日志相对应的文档。 其中一个过滤器可以被配置为过滤包括律师 - 客户端交换的文档。