Parsing extensible markup language (XML) data streams
    1.
    发明授权
    Parsing extensible markup language (XML) data streams 失效
    解析可扩展标记语言(XML)数据流

    公开(公告)号:US07073123B2

    公开(公告)日:2006-07-04

    申请号:US11271931

    申请日:2005-11-10

    IPC分类号: G06F17/00

    CPC分类号: G06F17/2247 G06F17/272

    摘要: In one embodiment, a method of parsing an XML data stream comprises receiving an XML data stream containing a namespace prefix and an associated element tag name. The element tag name is associated with an element tag. The namespace prefix and the element tag name are converted into a token that uniquely represents a namespace specification that is associated with the namespace prefix and the element tag. A stack is defined and is configured to receive one or more tokens during parsing of the XML data stream. Parsing of the XML data stream is performed without requiring an XML tree structure comprising an XML document embodied by the XML data stream, to be built.

    摘要翻译: 在一个实施例中,解析XML数据流的方法包括接收包含命名空间前缀和相关联的元素标签名称的XML数据流。 元素标签名称与元素标签相关联。 命名空间前缀和元素标签名称将转换为唯一表示与命名空间前缀和元素标记相关联的命名空间规范的令牌。 定义了一个堆栈,并将其配置为在解析XML数据流期间接收一个或多个令牌。 执行XML数据流的解析,而不需要构建包含由XML数据流体现的XML文档的XML树结构。

    Methods and apparatus for parsing extensible markup language (XML) data streams
    2.
    发明授权
    Methods and apparatus for parsing extensible markup language (XML) data streams 有权
    用于解析可扩展标记语言(XML)数据流的方法和装置

    公开(公告)号:US06763499B1

    公开(公告)日:2004-07-13

    申请号:US09361784

    申请日:1999-07-26

    IPC分类号: G06F1722

    CPC分类号: G06F17/2247 G06F17/272

    摘要: Various features enable an XML data stream to be parsed without the need to build a hierarchical tree structure for the XML document. In the described embodiment, the concept of an element or namespace stack is utilized as a way of organizing parsing activities and maintaining a definable place within the structure of the XML document. Various structures work together with the element or namespace stack to facilitate piecewise parsing of the XML data stream. One structure is a namespace hierarchy that is a collection of namespace objects that each represent a namespace specification that is encountered in the XML data stream. Each object includes a namespace prefix and an associated namespace specification. This structure creates a hierarchical organization that is used for mapping a particular encountered namespace specification into a unique value that represents both the namespace specification and an element tag in which the namespace specification occurs. Another structure is a dictionary collection that contains one or more dictionaries. Each dictionary is specifically associated with a namespace specification that is encountered in the XML data stream. The dictionaries contain entries for one or more tag names and each name's associated unique token. The token is returned and placed on the element stack along with another special value that enables the proper state to be maintained during processing of the XML data stream. The stack also includes a text accumulation buffer that can hold any text that is contained within an element (between the element tags). When an XML element is encountered, the element stack is used to organize parsing activities as the parser makes its way through the XML data stream.

    摘要翻译: 各种功能使得可以解析XML数据流,而无需为XML文档构建分层树结构。 在所描述的实施例中,元素或命名空间堆栈的概念被用作组织解析活动并在XML文档的结构内维护可定义的位置的方式。 各种结构与元素或命名空间堆栈一起工作,以便于对XML数据流进行分段解析。 一个结构是命名空间层次结构,它是命名空间对象的集合,每个对象都表示在XML数据流中遇到的命名空间规范。 每个对象包括命名空间前缀和关联的命名空间规范。 此结构创建一个分层组织,用于将特定遇到的命名空间规范映射到表示命名空间规范发生的命名空间规范和元素标记的唯一值。 另一个结构是包含一个或多个字典的字典集合。 每个字典与XML数据流中遇到的命名空间规范特别相关。 字典包含一个或多个标签名称和每个名称相关的唯一标记的条目。 令牌被返回并放置在元素堆栈上,以及另外一个特殊的值,可以在处理XML数据流期间保持正确的状态。 堆栈还包括一个文本累积缓冲区,可以容纳元素中包含的任何文本(元素标签之间)。 当遇到XML元素时,元素堆栈用于组织解析活动,因为解析器通过XML数据流。

    Methods and apparatus for parsing extensible markup language (XML) data streams
    3.
    发明授权
    Methods and apparatus for parsing extensible markup language (XML) data streams 失效
    用于解析可扩展标记语言(XML)数据流的方法和装置

    公开(公告)号:US07007230B2

    公开(公告)日:2006-02-28

    申请号:US11018208

    申请日:2004-12-20

    IPC分类号: G06F17/00

    CPC分类号: G06F17/2247 G06F17/272

    摘要: Systems for parsing an XML data stream are described. In one embodiment, the system is configured to receive an XML data stream comprising one or more element tags and determine whether an element tag contains a namespace declaration. The system creates one or more namespace objects if an element tag contains one or more respective namespace declarations, each namespace object corresponding to one namespace declaration. The system associates namespace objects with one another if more than one namespace object is created and associates each namespace object with a dictionary that contains one or more entries that are associated with an element tag.

    摘要翻译: 描述用于解析XML数据流的系统。 在一个实施例中,系统被配置为接收包括一个或多个元素标签的XML数据流,并且确定元素标签是否包含命名空间声明。 如果元素标签包含一个或多个相应的命名空间声明,则每个命名空间对象对应一个命名空间声明,系统将创建一个或多个命名空间对象。 如果创建了多个命名空间对象,则系统将命名空间对象相互关联,并将每个命名空间对象与包含与元素标记关联的一个或多个条目的字典相关联。

    Methods and apparatus for parsing extensible markup language (XML) data streams
    4.
    发明授权
    Methods and apparatus for parsing extensible markup language (XML) data streams 失效
    用于解析可扩展标记语言(XML)数据流的方法和装置

    公开(公告)号:US06996773B2

    公开(公告)日:2006-02-07

    申请号:US10841401

    申请日:2004-05-07

    IPC分类号: G06F17/00

    CPC分类号: G06F17/2247 G06F17/272

    摘要: In one embodiment, a method of parsing an XML data stream comprises receiving an XML data stream containing a namespace prefix and an associated element tag name. The element tag name is associated with an element tag. The namespace prefix and the element tag name are converted into a token that uniquely represents a namespace specification that is associated with the namespace prefix and the element tag. A stack is defined and is configured to receive one or more tokens during parsing of the XML data stream. Parsing of the XML data stream is performed without requiring an XML tree structure comprising an XML document embodied by the XML data stream, to be built.

    摘要翻译: 在一个实施例中,解析XML数据流的方法包括接收包含命名空间前缀和相关联的元素标签名称的XML数据流。 元素标签名称与元素标签相关联。 命名空间前缀和元素标签名称将转换为唯一表示与命名空间前缀和元素标记相关联的命名空间规范的令牌。 定义了一个堆栈,并将其配置为在解析XML数据流期间接收一个或多个令牌。 执行XML数据流的解析,而不需要构建包含由XML数据流体现的XML文档的XML树结构。

    Methods and systems for processing XML documents
    5.
    发明授权
    Methods and systems for processing XML documents 有权
    用于处理XML文档的方法和系统

    公开(公告)号:US07134072B1

    公开(公告)日:2006-11-07

    申请号:US09417990

    申请日:1999-10-13

    IPC分类号: G06F17/00

    CPC分类号: G06F17/227 G06F17/2247

    摘要: An architecture for processing an Extensible Markup Language (XML) document converts schema elements in the XML document to data type definition (DTD) objects that can be used to validate data elements in the XML document. The architecture utilizes a node factory design in which an XML parser calls one or more node factory interfaces to construct an in-memory tree representation of an XML document. One of the node factory interfaces is a schema node factory, which is a thin layer that receives calls from the parser to build nodes in the tree representation and translates those calls to calls to a schema builder. The schema builder is a table driven interface that converts the schema elements in the XML document into DTD objects. The DTD objects are then used to validate the data elements as belonging to the schema. If valid, the data elements are used to construct the tree representation.

    摘要翻译: 用于处理可扩展标记语言(XML)文档的架构将XML文档中的模式元素转换为可用于验证XML文档中的数据元素的数据类型定义(DTD)对象。 该架构使用节点工厂设计,其中XML解析器调用一个或多个节点工厂接口来构造XML文档的内存中的树表示。 节点工厂接口之一是一个模式节点工厂,它是从分析器接收调用以构建树表示中的节点的薄层,并将这些调用转换为对构建器的调用。 模式构建器是一个表驱动接口,将XML文档中的模式元素转换为DTD对象。 然后,DTD对象用于验证属于模式的数据元素。 如果有效,则使用数据元素构建树表示。

    XSD INFERENCE
    6.
    发明申请

    公开(公告)号:US20090030920A1

    公开(公告)日:2009-01-29

    申请号:US12239548

    申请日:2008-09-26

    IPC分类号: G06F17/30

    CPC分类号: G06F17/2247 G06F17/2725

    摘要: The automatic generation of schemas for XML documents is provided. In an illustrative implementation, a computer readable medium having computer readable instructions to instruct a computing environment to execute one or more inference algorithms is provided. In operation, an XML document is processed according to the computer readable instructions such that the content and tags of the XML document are identified. The XML document is processed according to an inference algorithm, which executes one or more processing rule, and uses the XML document information in conjunction with the rules and operations of the XML schema definition language, to automatically produce a schema for the XML document.

    摘要翻译: 提供了自动生成XML文档的模式。 在说明性实现中,提供了具有指令计算环境执行一个或多个推理算法的计算机可读指令的计算机可读介质。 在操作中,根据计算机可读指令处理XML文档,以便识别XML文档的内容和标签。 根据推理算法处理XML文档,该算法执行一个或多个处理规则,并结合XML模式定义语言的规则和操作使用XML文档信息,以自动生成XML文档的模式。

    SCALEABLE FAULT-TOLERANT METADATA SERVICE
    8.
    发明申请
    SCALEABLE FAULT-TOLERANT METADATA SERVICE 有权
    可扩展的容错元数据服务

    公开(公告)号:US20110289049A1

    公开(公告)日:2011-11-24

    申请号:US12782730

    申请日:2010-05-19

    IPC分类号: G06F17/30

    摘要: Metadata may be stored in, and retrieved from, a scalable, fault-tolerant metadata service. In one example, metadata is divided into partitions, and each partition is served by one or more nodes. For each partition, a first one of the nodes may handle read and write requests, and the other nodes may handle read requests in the event that the first node is down or is experiencing high load. When a request is made with respect to metadata, a metadata server may identify a node, in the partition to which the metadata is assigned, to which the request is to be made. The entity that is making the request then contacts that node, and requests the read or write on the metadata. In a partition, metadata may be replicated between the first node and the other nodes using a log-based replication protocol.

    摘要翻译: 元数据可以存储在可扩展的容错元数据服务中并从其检索。 在一个示例中,元数据被划分成分区,并且每个分区由一个或多个节点服务。 对于每个分区,第一个节点可以处理读取和写入请求,并且其他节点可以在第一个节点关闭或正在经历高负载的情况下处理读取请求。 当针对元数据提出请求时,元数据服务器可以识别要向其分配请求的分配中的元数据分配的节点。 然后,发出请求的实体联系该节点,并请求读取或写入元数据。 在分区中,可以使用基于日志的复制协议在第一节点和其他节点之间复制元数据。

    XSD inference
    9.
    发明授权
    XSD inference 有权
    XSD推理

    公开(公告)号:US07530015B2

    公开(公告)日:2009-05-05

    申请号:US10606340

    申请日:2003-06-25

    IPC分类号: G06F17/30

    CPC分类号: G06F17/2247 G06F17/2725

    摘要: The automatic generation of schemas for XML documents is provided. In an illustrative implementation, a computer readable medium having computer readable instructions to instruct a computing environment to execute one or more inference algorithms is provided. In operation, an XML document is processed according to the computer readable instructions such that the content and tags of the XML document are identified. The XML document is processed according to an inference algorithm, which executes one or more processing rule, and uses the XML document information in conjunction with the rules and operations of the XML schema definition language, to automatically produce a schema for the XML document.

    摘要翻译: 提供了自动生成XML文档的模式。 在说明性实现中,提供了具有指令计算环境执行一个或多个推理算法的计算机可读指令的计算机可读介质。 在操作中,根据计算机可读指令处理XML文档,以便识别XML文档的内容和标签。 根据推理算法处理XML文档,该算法执行一个或多个处理规则,并结合XML模式定义语言的规则和操作使用XML文档信息,以自动生成XML文档的模式。

    Scaleable fault-tolerant metadata service
    10.
    发明授权
    Scaleable fault-tolerant metadata service 有权
    可扩展的容错元数据服务

    公开(公告)号:US08595184B2

    公开(公告)日:2013-11-26

    申请号:US12782730

    申请日:2010-05-19

    IPC分类号: G06F7/00 G06F17/00 G06F17/30

    摘要: Metadata may be stored in, and retrieved from, a scalable, fault-tolerant metadata service. In one example, metadata is divided into partitions, and each partition is served by one or more nodes. For each partition, a first one of the nodes may handle read and write requests, and the other nodes may handle read requests in the event that the first node is down or is experiencing high load. When a request is made with respect to metadata, a metadata server may identify a node, in the partition to which the metadata is assigned, to which the request is to be made. The entity that is making the request then contacts that node, and requests the read or write on the metadata. In a partition, metadata may be replicated between the first node and the other nodes using a log-based replication protocol.

    摘要翻译: 元数据可以存储在可扩展的容错元数据服务中并从其检索。 在一个示例中,元数据被划分成分区,并且每个分区由一个或多个节点服务。 对于每个分区,第一个节点可以处理读取和写入请求,并且其他节点可以在第一个节点关闭或正在经历高负载的情况下处理读取请求。 当针对元数据提出请求时,元数据服务器可以识别要向其分配请求的分配中的元数据分配的节点。 然后,发出请求的实体联系该节点,并请求读取或写入元数据。 在分区中,可以使用基于日志的复制协议在第一节点和其他节点之间复制元数据。