摘要:
Modular content framework and document format methods and systems are described. The described framework and format define a set of building blocks for composing, packaging, distributing, and rendering document-centered content. These building blocks define a platform-independent framework for document formats that enable software and hardware systems to generate, exchange, and display documents reliably and consistently. The framework and format have been designed in a flexible and extensible fashion. In addition to this general framework and format, a particular format, known as the reach package format, is defined using the general framework. The reach package format is a format for storing paginated documents. The contents of a reach package can be displayed or printed with full fidelity among devices and applications in a wide range of environments and across a wide range of scenarios.
摘要:
Modular content framework and document format methods and systems are described. The described framework and format define a set of building blocks for composing, packaging, distributing, and rendering document-centered content. These building blocks define a platform-independent framework for document formats that enable software and hardware systems to generate, exchange, and display documents reliably and consistently. The framework and format have been designed in a flexible and extensible fashion. In addition to this general framework and format, a particular format, known as the reach package format, is defined using the general framework. The reach package format is a format for storing paginated documents. The contents of a reach package can be displayed or printed with full fidelity among devices and applications in a wide range of environments and across a wide range of scenarios.
摘要:
Methods and systems permit rendering a text element formatted using unavailable or restricted fonts. In one embodiment, when producing a document that contains a restricted font, text portions formatted with the restricted font are processed into an alternate representation. A textual representation for the text portions and the additional representation(s) are embedded in a document or container. A unique representation for the restricted font file allows a consuming computing system to attempt to find the restricted font locally. Each textual representation is rendered using its corresponding restricted font, if the restricted font file is found. Otherwise, the additional representation(s) are used.
摘要:
Systems, methods, and/or techniques (“tools”) for improved search techniques for page-based document layouts are described herein. The tools may analyze markup elements defined for pages within source documents, and may determine whether the markup elements for the page may include at least part of a search string.
摘要:
Various technologies and techniques detect tables in vector graphics based documents and use them in meaningful ways. The system detects at least one table in a vector graphics based document using a set of rules. The rules include analyzing a set of content representing horizontal and vertical lines to find intersections and identifying table cells based on the intersections. Once identified, the table content is translated into a modified format. The content can be output to a destination application in the modified format that is more suitable for output or use by the destination application.
摘要:
Methods and systems for preserving unknown markup in a strongly typed environment are described. In but one embodiment, XML-based markup that may contain XML-based elements that are both known and unknown is received. A strongly typed tree that is associated with the known XML-based elements is instantiated and a weakly typed tree that is associated with both the XML-based known and unknown elements is instantiated. The strongly and the weakly typed trees are then correlated in a manner that preserves the unknown XML-based elements.
摘要:
Systems, methods, and/or techniques (“tools”) for improved search techniques for page-based document layouts are described herein. The tools may analyze markup elements defined for pages within source documents, and may determine whether the markup elements for the page may include at least part of a search string.
摘要:
Various technologies and techniques detect lists in vector graphics based documents and use them in meaningful ways. The system detects at least one list in a vector graphics based document using a set of rules. Pattern detection logic identifies characters, symbols, numbers, letters, and/or images that may start a list. Additional pattern detection logic determines if a list exists. The system can identify and parse bulleted lists, numbered or lettered lists, and nested lists that are any combination of both. Once identified, the content is translated into a modified format. The content can be output to a destination application in the modified format that is more suitable for output or use by the destination application.
摘要:
Semantic objects are created that provide a structure for markup language representations of documents. The semantic objects include text runs that are produced from the markup language representation and that are placed into semantic blocks that group text runs according to how text is logically structured in the document being represented. The text runs of each semantic block are ordered to correspond to the logical order of the document being represented. The semantic blocks corresponding to each page of the document being represented are ordered to correspond to the logical order of the document being represented. The ordered semantic blocks including the ordered text runs are saved as a semantic object which can they be utilized to make use of the logical structure of the document being represented by the markup language.
摘要:
Various embodiments develop (and consume), along with rendered print job data, metadata that describes certain characteristics of the print job data. This metadata can be provided, along with the rendered data, from a client device to a print server and can allow the print server to ascertain the nature or context of the print job data. In some embodiments, the metadata can describe such things as page boundaries and state transition data. By ascertaining the nature or context of the print job data, the print server is able to intelligently act upon this information and, in at least some embodiments, implement additional print server features that would not be possible if only rendered data were sent to the print server.