Abstract:
A method and system for mapping a number of characters in a string, wherein the string comprises a combination of characters representing indexed expressions and a combination of characters representing non-indexed expressions. One embodiment produces a weight array that can be utilized to compare a first and second string having indexed and non-indexed expressions. In one embodiment, a method generates a set of special weights for characters that represent indexed and non-indexed expressions. The method then associates a weight value of an indexed expression with the specific group of characters representing a specific non-indexed expression, and generates a weight array by retrieving a plurality of special weights associated with the specific group of characters representing the specific non-indexed expression and the associated weight value of the indexed expression.
Abstract:
A method and system for mapping a number of characters in a string, wherein the string comprises a combination of characters representing indexed expressions and a combination of characters representing non-indexed expressions. One embodiment produces a weight array that can be utilized to compare a first and second string having indexed and non-indexed expressions. In one embodiment, a method generates a set of special weights for characters that represent indexed and non-indexed expressions. The method then associates a weight value of an indexed expression with the specific group of characters representing a specific non-indexed expression, and generates a weight array by retrieving a plurality of special weights associated with the specific group of characters representing the specific non-indexed expression and the associated weight value of the indexed expression.
Abstract:
A system and method for converting a source data sequence to a target encoding is disclosed. A fallback technique, specifiable by a user program component, is applied to data in the source data sequence that cannot be mapped to the target encoding. A particular embodiment involves methods for encoding a Unicode string as a sequence of bytes in a target encoding, and decoded the encoded bytes back to Unicode.
Abstract:
Linguistic service platform techniques are described. In implementations, one or more computer-readable media comprise instructions that are executable by a computer to designate a linguistic service having a particular property responsive to an application program interface call specifying the property. Communication may be brokered between the linguistic service and the application so that communication occurs without the application directly communicating with the linguistic service.
Abstract:
An application program interface (API) provides a set of functions for application developers who build Web applications on Microsoft Corporation's .NET™ platform.
Abstract:
An application program interface (API) provides a set of functions, including a set of base classes and types that are used in substantially all applications accessing the API, for application developers who build Web applications on Microsoft Corporation's .NET™ platform.
Abstract:
A system and method for converting a source data sequence to a target encoding is disclosed. A fallback technique, specifiable by a user program component, is applied to data in the source data sequence that cannot be mapped to the target encoding. A particular embodiment involves methods for encoding a Unicode string as a sequence of bytes in a target encoding, and decoded the encoded bytes back to Unicode.
Abstract:
Map service techniques are described. In an implementation, text is received from an application for processing by one or more linguistic services. Based on service properties of respective linguistic services that are relevant to the application, particular linguistic services are designated to be available for use by the application and one or more other linguistic services are obscured from the application. A communication is formed to communication the text to a designated linguistic service.
Abstract:
Language detection techniques are described. In implementation, a method comprises determining which human writing system is associated with text characters in a string based on values representing the text characters. When the values are associated with more than one human language, the string is compared with a targeted dictionary to identify a corresponding human language associated with the string. Linguistic services are designated to be available based on service properties of the linguistic services and based on the corresponding human language associated with the string.
Abstract:
Linguistic service platform techniques are described. In implementations, one or more computer-readable media comprise instructions that are executable by a computer to designate a linguistic service having a particular property responsive to an application program interface call specifying the property. Communication may be brokered between the linguistic service and the application so that communication occurs without the application directly communicating with the linguistic service.