Apparatus and method for multimedia object retrieval
    1.
    发明申请
    Apparatus and method for multimedia object retrieval 审中-公开
    多媒体对象检索的装置和方法

    公开(公告)号:US20050050086A1

    公开(公告)日:2005-03-03

    申请号:US10913514

    申请日:2004-08-09

    摘要: A multimedia object retrieval apparatus and method for retrieving multimedia objects from structured documents containing both a multimedia object and relevant explanation text. The apparatus and method parse an input structured document into a parsing result such as an HTML DOM tree; recognize a main block in the input parsing result and output a main block annotated structured document model; extract a pair of a multimedia object and corresponding explanation, and output a structured object index such as an XML format object index; and search through the structured object index to form a target object list. The apparatus and method can be applied to various kinds of structured documents, and can extract object explanations with a high precision. The apparatus and method may also identify the relationship between the object and the title of the input structured document.

    摘要翻译: 一种用于从包含多媒体对象和相关说明文本的结构化文档中检索多媒体对象的多媒体对象检索装置和方法。 该装置和方法将输入的结构化文档解析成诸如HTML DOM树的解析结果; 识别输入解析结果中的主块,并输出主块注释结构化文档模型; 提取一对多媒体对象和相应的解释,并输出结构化对象索引,如XML格式对象索引; 并搜索结构化对象索引以形成目标对象列表。 该装置和方法可以应用于各种结构化文档,可以高精度地提取对象解释。 设备和方法还可以标识对象和输入结构化文档的标题之间的关系。

    Annotation management program, device, method, and annotation editing program, device, method
    2.
    发明申请
    Annotation management program, device, method, and annotation editing program, device, method 审中-公开
    注释管理程序,设备,方法和注释编辑程序,设备,方法

    公开(公告)号:US20080147677A1

    公开(公告)日:2008-06-19

    申请号:US11983089

    申请日:2007-11-07

    IPC分类号: G06F17/30

    摘要: A management module of an annotation server receives annotation data, which contains location information of web page data, content information of an annotation, and position information of an object to which the annotation is linked, from a web client machine together with a registration request. Then, the management module issues an annotation ID. The module retrieves a text, which consists of an object to which the annotation is linked and an adjacent part that has a relationship satisfying a predetermined condition with the object, as context information from a source text of the web page. Then, the module registers the context information together with the annotation data that is received in advance into an annotation database.

    摘要翻译: 注释服务器的管理模块从Web客户端机器连同注册请求接收包含网页数据的位置信息,注释的内容信息和注释所链接的对象的位置信息的注释数据。 然后,管理模块发出注释ID。 模块从文本的源文本中检索文本,该文本由注释链接到的对象和与该对象具有满足预定条件的关系的相邻部分组成。 然后,模块将上下文信息与预先接收到的注释数据一起注册到注释数据库中。

    Apparatus for translating lingual morphemes as well as the typographical
morphemes attached thereto
    3.
    发明授权
    Apparatus for translating lingual morphemes as well as the typographical morphemes attached thereto 失效
    用于翻译语言语素的装置以及附在其上的排字语素

    公开(公告)号:US5361205A

    公开(公告)日:1994-11-01

    申请号:US923513

    申请日:1992-08-03

    摘要: An apparatus which translates a document (character string) having various kinds of typographical information, such as on a font size and on a font style, as character attributes, and reflects the typographical information attached to an original text of the document. The apparatus performs a morpheme analysis with typographical information saved, by regarding a piece of typographical information as a morpheme for a document having typographical information between characters. The apparatus judges typographical information attached to each character forming a morpheme after performing a morpheme analysis for a sentence having a piece of typographical information as a character attribute, and determines morpheme typographical information when characters forming a single morpheme carry different pieces of typographical information. The apparatus also separates a sentence whose morpheme is analyzed into a piece of typographical information and an original text translates the original document into some other language and converts to an appropriate one the piece of typographical information as necessary, anticipation of a case in which the piece of typographical information attached to the original document cannot be attached "as is" to its translation result.

    摘要翻译: 将具有各种印刷信息(例如字体大小和字体样式)的文档(字符串)翻译为字符属性的装置,并且反映附加到文档的原始文本的印刷信息。 该装置通过将一张印刷信息作为具有字符之间的印刷信息的文档的语素来执行对保存的印刷信息的语素分析。 该装置在对具有一排印刷信息的句子进行语素分析后,判断附加到每个字符上的字符信息,形成语素作为字符属性,并且当形成单个语素的字符携带不同的打印信息时,确定语素排版信息。 该设备还将其分析的语素分成一段印刷信息,原始文本将原始文档翻译成某种其他语言,并根据需要将其转换为适当的印刷信息,预期该片段 附于原始文件的印刷资料不能按原样附加到其翻译结果。

    Annotation management program, device, method and annotation display program, device, method
    4.
    发明申请
    Annotation management program, device, method and annotation display program, device, method 审中-公开
    注释管理程序,设备,方法和注释显示程序,设备,方法

    公开(公告)号:US20080133584A1

    公开(公告)日:2008-06-05

    申请号:US11983067

    申请日:2007-11-07

    IPC分类号: G06F17/30

    CPC分类号: G06F16/95

    摘要: Abstract of the Disclosure An annotation server stores annotation data sent from a web client into a first annotation database. The annotation server retrieves annotation data whose description information requires an execution result of a predetermined program from the first database, and incorporates the execution result of the predetermined program into the description information for the retrieved annotation data. Then, the computer transfers the data to the second database. Receiving a sending request for annotation data from a web client, the annotation server retrieves the annotation data from the second database and sends it to the web client that sent the request. Therefore, the web client displays the latest information as an annotation over a web page according to the annotation data received from the computer.

    摘要翻译: 公开摘要注释服务器将从Web客户端发送的注释数据存储到第一批注数据库中。 注释服务器检索其描述信息需要来自第一数据库的预定程序的执行结果的注释数据,并将预定程序的执行结果合并到检索到的注释数据的描述信息中。 然后,计算机将数据传输到第二个数据库。 从Web客户端接收对注释数据的发送请求,注释服务器从第二个数据库检索注释数据,并将其发送到发送请求的Web客户端。 因此,Web客户端根据从计算机接收到的注释数据,将最新信息显示为网页上的注释。

    Text data generation program, text data generation device, text data generation method, text-processing tool program, text-processing tool device; and text processing method
    5.
    发明申请
    Text data generation program, text data generation device, text data generation method, text-processing tool program, text-processing tool device; and text processing method 审中-公开
    文本数据生成程序,文本数据生成装置,文本数据生成方式,文本处理工具程序,文本处理工具装置; 和文字处理方法

    公开(公告)号:US20080082910A1

    公开(公告)日:2008-04-03

    申请号:US11894219

    申请日:2007-08-20

    IPC分类号: G06F3/14

    摘要: A text data generation program generates text data as a target of a text processing tool. The program controls a computer to receive location information of web page data from the text-processing tool, acquires web page data from a web server via a communication device in response to the location information, acquires annotation data, which is linked with the web page data, from an annotation server via the communication device, converts contents of the acquired annotation into a form that can be interpreted by the text-processing tool, embeds the converted contents at a position to which the annotation should link, and outputs the web page data to which the contents of the annotation are embedded to the text-processing tool.

    摘要翻译: 文本数据生成程序生成作为文本处理工具的目标的文本数据。 该程序控制计算机从文本处理工具接收网页数据的位置信息,响应于位置信息通过通信设备从网络服务器获取网页数据,获取与网页链接的注释数据 数据从注释服务器经由通信设备将所获取的注释的内容转换成可以由文本处理工具解释的形式,将转换的内容嵌入到注释应该链接到的位置,并输出网页 将批注内容嵌入文本处理工具的数据。

    Method and apparatus for recognizing specific type of information files
    6.
    发明申请
    Method and apparatus for recognizing specific type of information files 审中-公开
    用于识别特定类型的信息文件的方法和装置

    公开(公告)号:US20050267915A1

    公开(公告)日:2005-12-01

    申请号:US11135658

    申请日:2005-05-24

    IPC分类号: G06F17/30

    CPC分类号: G06F16/986

    摘要: The present invention provides a file recognition apparatus and method for recognizing specific information type with respect to a web page file group collected from the Internet or stored in other storage apparatus. The file recognition apparatus of the invention comprises: a file grouping section for classifying, from a predetermined viewpoint, the file group to be recognized by file type; a file type recognition section for recognizing the type of the files according to characteristics specific to the specific information type; and a file-type-recognition correction section for correcting the recognition result of each file in consideration of the recognition precision of all files in the group. The apparatus and method of the invention can recognize various types of information, and can obtain satisfying reorganization precision.

    摘要翻译: 本发明提供一种文件识别装置和方法,用于识别从因特网收集的或存储在其他存储装置中的网页文件组的特定信息类型。 本发明的文件识别装置包括:文件分组部件,用于从预定的视点分类要由文件类型识别的文件组; 文件类型识别部分,用于根据特定信息类型的特征来识别文件的类型; 以及文件类型识别校正部分,用于考虑到组中所有文件的识别精度来校正每个文件的识别结果。 本发明的装置和方法可以识别各种类型的信息,并且可以获得令人满意的重组精度。

    Apparatus and method for evaluating web pages
    7.
    发明授权
    Apparatus and method for evaluating web pages 失效
    用于评估网页的装置和方法

    公开(公告)号:US07395498B2

    公开(公告)日:2008-07-01

    申请号:US10327027

    申请日:2002-12-24

    IPC分类号: G06N3/00

    CPC分类号: G06F17/3061 G06F2216/03

    摘要: An evaluation apparatus learns the correspondence between domains and evaluation items from a Web page group in Internet, generates an evaluation set group, and generates a specified domain evaluation set by extracting evaluation items corresponding to the specified domain from the evaluation set group. Then, it evaluates a Web page to be evaluated based on the specified domain evaluation set.

    摘要翻译: 评估装置从因特网中的网页组中学习域和评价项之间的对应关系,生成评估集组,并通过从评估集组提取与指定域对应的评估项,生成指定的域评价集。 然后,它将根据指定的域评估集来评估要评估的网页。

    Queries-and-responses processing method, queries-and-responses processing program, queries-and-responses processing program recording medium, and queries-and-responses processing apparatus
    8.
    发明授权
    Queries-and-responses processing method, queries-and-responses processing program, queries-and-responses processing program recording medium, and queries-and-responses processing apparatus 失效
    查询和响应处理方法,查询和响应处理程序,查询和响应处理程序记录介质以及查询和响应处理设备

    公开(公告)号:US07343371B2

    公开(公告)日:2008-03-11

    申请号:US10028423

    申请日:2001-12-28

    IPC分类号: G06F7/00 G06F17/30 G06F17/27

    摘要: A query-and-response processing method for analyzing the intention of a query provided by a user reduces search result information to an amount manageable for the user, sorts out the result information, and presents it in an easily readable form to the user. A search request analyzer analyzes a search request provided from the user, a search criteria generator generates search criteria, then a search executor searches through a database. A query intention analyzer analyzes the intention of a query from the user, such as a query topic, and an output formatter, based on the result of the analysis, selects items to be presented to the user from the search results and determines the output format of the search results. A presentation module receives the results and presents the data to the user.

    摘要翻译: 用于分析用户提供的查询的意图的查询和响应处理方法将搜索结果信息减少到可管理用户的数量,对结果信息进行排序,并以易于读取的形式呈现给用户。 搜索请求分析器分析从用户提供的搜索请求,搜索条件生成器生成搜索条件,然后搜索执行器搜索数据库。 查询意图分析器基于分析结果分析来自用户的查询的意图,例如查询主题和输出格式化器,从搜索结果中选择要呈现给用户的项目,并确定输出格式 的搜索结果。 呈现模块接收结果并将数据呈现给用户。

    Information publication control method and apparatus, and information publication control instruction method, and apparatus
    9.
    发明申请
    Information publication control method and apparatus, and information publication control instruction method, and apparatus 审中-公开
    信息发布控制方法和装置以及信息发布控制指令方法和装置

    公开(公告)号:US20070198683A1

    公开(公告)日:2007-08-23

    申请号:US11443129

    申请日:2006-05-31

    IPC分类号: G06F15/173

    CPC分类号: G06F16/958

    摘要: An object of the present invention is to carry out publication control for a portion of contents according to its valid period. This invention includes: reading out publication data including first data whose publication should be controlled, publication control condition data relating to a valid period of the first data, and second data whose publication does not have to be controlled from a publication data storage storing the publication data to judge whether or not a condition defined in the publication control condition data is satisfied; and upon detecting that the condition defined in the publication control condition data is satisfied, generating current publication data including the first data corresponding to the publication control condition data whose condition is judged to be satisfied and the second data and outputting the generated current publication data. In this way, when the publication of the first data is controlled based on the publication control condition data concerning the valid period, it becomes possible to control not to open information whose validity has been lost such as the contact telephone number to inquire the event, to the public, for example, after the event ended or the like.

    摘要翻译: 本发明的目的是根据其有效期对一部分内容进行发布控制。 本发明包括:读出包括其出版物应受控制的第一数据的发布数据,与第一数据有效期有关的发布控制条件数据,以及不需要从出版物数据存储存储该出版物的第二数据 用于判断出版物控制条件数据中定义的条件是否满足的数据; 并且当检测到满足发布控制条件数据中定义的条件时,产生包括对应于其条件被判定为满足的发布控制条件数据的第一数据和第二数据的当前发布数据,并输出生成的当前发布数据。 以这种方式,当基于关于有效期的发布控制条件数据来控制第一数据的发布时,可以控制不打开有效性已经丢失的信息,诸如联系电话号码来查询事件, 例如,事件结束后等等。

    Document processing system and recording medium
    10.
    发明授权
    Document processing system and recording medium 失效
    文件处理系统和记录介质

    公开(公告)号:US06523025B1

    公开(公告)日:2003-02-18

    申请号:US09630553

    申请日:2000-08-01

    IPC分类号: G06F1730

    摘要: The accuracy of retrieving or clipping documents is improved. A document to be processed is input via a document input section. Event specifying means looks up knowledge information stored in knowledge information storing means to specify the type of an event described in the input document. Attribute value extracting means extracts, from the document, attribute values of attributes relating to the specified event. Correlating means performs a process of correlating the attribute values extracted by the attribute value extracting means with entities in the real world. Document storing means stores information (normalized information) generated by the correlating means and the document or information specifying a storage location thereof in a manner associated with each other. Document extracting means compares a query input from a user interface section with the normalized information and extracts, from the document storing means, matching documents or information specifying their storage locations.

    摘要翻译: 检索或裁剪文件的准确性得到改善。 要通过文档输入部分输入要处理的文档。 事件指定装置查找存储在知识信息存储装置中的知识信息,以指定在输入文档中描述的事件的类型。 属性值提取装置从文档中提取与指定事件相关的属性的属性值。 相关装置执行将由属性值提取装置提取的属性值与现实世界中的实体相关联的处理。 文档存储装置以相关联的方式存储由相关装置生成的信息(归一化信息)和指定其存储位置的文档或信息。 文档提取装置将来自用户界面部分的查询输入与归一化信息进行比较,并从文档存储装置提取指定其存储位置的匹配文档或信息。