摘要:
An apparatus for providing adaptive language model scaling includes an adaptive scaling element and an interface element. The adaptive scaling element is configured to receive input speech comprising a sequence of spoken words and to determine a plurality of candidate sequences of text words in which each of the candidate sequences has a corresponding sentence score representing a probability that a candidate sequence matches the sequence of spoken words. Each corresponding sentence score is calculated using an adaptive scaling factor. The interface element is configured to receive a user input selecting one of the candidate sequences. The adaptive scaling element is further configured to estimate an objective function based on the user input and to modify the adaptive scaling factor based on the estimated objective function.
摘要:
This invention relates to a framework for converting a source speech signal associated with a source voice into a target speech signal that is a representation of the source speech signal associated with a target voice. The source speech signal is encoded into samples of encoding parameters, wherein the encoding comprises the step of segmenting the source speech signal into segments based on characteristics of the source speech signal. The samples of the encoding parameters, or a converted representation of the samples of the encoding parameters are then decoded to obtain the target speech signal. Therein, in the encoding, the decoding or in a separate step, samples of parameters related to the source speech signal are converted into samples of parameters related to the target speech signal. Therein, at least one of the encoding and the converting depends on the segments of the source speech signal.
摘要:
In the concatenative text-to-speech system, high compression rate of duration data in the prosodic template is achieved by extracting statistical parameters describing behavior of actual duration values of instances of each given syllable, phoneme, half-phoneme, diphone, triphone or any other basic speech unit employed, and storing only the extracted statistical parameters, instead of the original duration values. Entries of each given basic unit in the prosodic template is sorted and indexed in the order of increasing duration value. Consequently, the amount of duration data can be significantly reduced, while keeping the error statistically under acceptable range.
摘要:
The exemplary embodiments of the invention provide at least a method, apparatus and system to perform operations including receiving context data from an electronic device, causing, at least in part based on the received context data, an identification of at least one context model compatible with the electronic device, and causing, at least in part, provision of the electronic device with the at least one compatible context model. In addition, the exemplary embodiments of the invention further provide at least a method, apparatus and system to perform operations including causing, at least in part, a provision of context data associated with an electronic device to a context inference service, in response, receiving a context model from the context inference service, and causing adaptation of the received context model as a current context model of the electronic device.
摘要:
Methods and apparatuses are provided for user interest modeling. A method may include receiving an input from a user for specifying one or more topics from among a predetermined hierarchy of topics and subtopics. The method may additionally include retrieving one or more documents associated with the user and extracting language tokens from the documents based, at least in part, on the specified topics. Corresponding apparatuses are also provided.
摘要:
A method for personalized location privacy recommendation comprises: obtaining information of one or more locations for a user; collecting features of the one or more locations; and recommending respective privacy levels of the one or more locations automatically based at least in part on the information and the features.
摘要:
An approach is provided for providing group context sensing and inference. The group context platform determines at least one group of one or more devices that have one or more group contexts that are at least substantially similar, at least substantially correlated, or a combination thereof. Next, the group context platform causes, at least in part, a distribution of one or more context sensing tasks among the one or more devices of the at least one group. Then, the group context platform processes and/or facilitates a processing of one or more results of the one or more context sensing tasks to (a) modify the one or more group contexts; (b) enhance the one or more group contexts; (c) determine one or more other group contexts; or (d) a combination thereof.
摘要:
An approach is provided for determining significant places with greatly improved accuracy using universally available identifier information. A significant place platform causes, at least in part, a mapping of one or more communication coverage areas associated with one or more identifiers onto at least one geo-grid, wherein the one or more identifiers are associated with at least one device operating within at least one communication network. The significant place platform further processes the one or more identifiers to determine one or more significance scores associated with one or more grid units of the at least one geo-grid. The significant place platform also determines at least one significant place based, at least in part, on the one or more significance scores.
摘要:
An approach is provided for providing hub-based indexing and services. The hub-based platform causes, at least in part, an indexing of location-based content according to one or more location hubs of one or more transportation lines, Next, the hub-based platform determines a current proximity, a predicted proximity, or a combination thereof of one or more devices to the one or more location hubs, wherein the one or more devices are (a) traveling on the one or more transportation lines, (b) predicted to travel on the one or more transportation lines, or (c) a combination thereof. Then, the hub-based platform causes, at least in part, a presentation of at least a portion of the location-based content based, at least in part, on the current proximity, the predicted proximity, or a combination thereof.
摘要:
An approach for recommending location-based content items that account for locations with ease of access based on available transportation options is described. A content recommender platform determines one or more predicted locations of a user based, at least in part, on an ease of access from a location associated with the use. The content recommender platform determines one or more location-based content items associated with the one or more predicted locations, the location, or a combination thereof. The content recommender platform determines one or more recommended content items from among the one or more location-based content items. In this way, the recommended content items may be easily accessible and may accord with the user's preferences.