Abstract:
Script detection service techniques are described. In an implementation, values representing individual text characters in a string of one or more text characters are identified to determine which human writing system is associated with the individual text characters. The values are compared to a table that associates subsets of values with individual human writing systems. The values are determined to be within a particular subset of values in the table that correspond to a particular human writing system. A particular human writing system is designated as associated with the string based on the values associated with the individual text characters in the string being within the particular subset of values that corresponds with the particular human writing system.
Abstract:
Script detection service techniques are described. In an implementation, a determination is made as to which human writing system is associated with individual text characters in a string of one or more text characters based on values representing the individual text characters in the string. A particular human writing system is designated as associated with the string based on the values associated with the individual text characters in the string.
Abstract:
A system and method for converting a source data sequence to a target encoding is disclosed. A fallback technique, specifiable by a user program component, is applied to data in the source data sequence that cannot be mapped to the target encoding. A particular embodiment involves methods for encoding a Unicode string as a sequence of bytes in a target encoding, and decoded the encoded bytes back to Unicode.
Abstract:
In a computer operating system having a user interface, a resource loader, and a registry resource database, a method for dynamically verifying whether the resource content of a resource module is compatible with the most current version of the operating system. Upon obtaining a request for a resource, the resource loader verifies that the resource modules remain compatible with the operating system prior to loading a resource from an alternate resource module. In accordance with one embodiment of the present invention, the operating system has a multilingual user interface and utilizes resource modules that are language specific.
Abstract:
An application program interface (API) provides a set of functions, including a set of base classes and types that are used in substantially all applications accessing the API, for application developers who build Web applications on Microsoft Corporation's .NET™ platform.
Abstract:
A method and system for mapping a number of characters in a string, wherein the string comprises a combination of characters representing indexed expressions and a combination of characters representing non-indexed expressions. One embodiment produces a weight array that can be utilized to compare a first and second string having indexed and non-indexed expressions. In one embodiment, a method generates a set of special weights for characters that represent indexed and non-indexed expressions. The method then associates a weight value of an indexed expression with the specific group of characters representing a specific non-indexed expression, and generates a weight array by retrieving a plurality of special weights associated with the specific group of characters representing the specific non-indexed expression and the associated weight value of the indexed expression.
Abstract:
A JAVA-based virtual machine that supports input method editor (IME) functionality in three contexts: (1) using JAVA-based IMEs with a JAVA virtual machine; (2) writing JAVA-based IMEs; and (3) integrating a JAVA virtual machine to system IMEs. The virtual machine includes a windows manager, an input method manager, one or more IMEs and/or keyboard drivers (KBDs), and a component. The virtual machine may also include an IME bridge for integrating the virtual machine with a system IME. The windows manager serves as a gateway between the operating system and the virtual machine. The input method manager (IMM) serves as a router between the windows manager and a plurality of IMEs and keyboards driver (KBDs). The IME includes a look-up table of composed characters that are indexed by Unicode keyboard commands. The component typically renders a user interface that allows the user to manipulate the composed characters returned by the IME. The functionality of the IME and the component may vary from platform to platform, and from user to user. The virtual machine can implement a predefined set of interfaces that allow the IME and the component to communicate with each other and with the windows manager and the input method manager. These predefined interfaces allow users to modify the functionality of the IME and component and integrate these modified elements with the virtual machine.
Abstract:
Map service techniques are described. In an implementation, text is received from an application for processing by one or more linguistic services. Based on service properties of respective linguistic services that are relevant to the application, particular linguistic services are designated to be available for use by the application and one or more other linguistic services are obscured from the application. A communication is formed to communication the text to a designated linguistic service.
Abstract:
Script detection service techniques are described. In an implementation, values representing individual text characters in a string of one or more text characters are identified to determine which human writing system is associated with the individual text characters. The values are compared to a table that associates subsets of values with individual human writing systems. The values are determined to be within a particular subset of values in the table that correspond to a particular human writing system. A particular human writing system is designated as associated with the string based on the values associated with the individual text characters in the string being within the particular subset of values that corresponds with the particular human writing system.
Abstract:
Language detection techniques are described. In implementation, a method comprises determining which human writing system is associated with text characters in a string based on values representing the text characters. When the values are associated with more than one human language, the string is compared with a targeted dictionary to identify a corresponding human language associated with the string. Linguistic services are designated to be available based on service properties of the linguistic services and based on the corresponding human language associated with the string.