摘要:
A system to determine positions of mobile computing devices and provide direction information includes a first mobile computing device configured to broadcast a first chirp signal, a second mobile computing device configured to broadcast a second chirp signal indicating receipt of the first chirp signal and a first time information about when the first chirp signal is received, and a third mobile computing device configured broadcast a third chirp signal indicating (a) receipt of the first and second chirp signals and (b) a second time information about when the first and second chirp signals are received. The first mobile computing device is configured to use the first and second time information to determine a position of the second mobile computing device. The first mobile computing device is also configured to transmit text messages to the second mobile computing device to direct a user of the second mobile computing device to a position of a user of the first mobile computing device.
摘要:
Methods for implementing shared experiences using mobile computing devices comprise capturing audio waves associated with a media using a built-in microphone of a mobile computing device, the mobile computing device including a processor, a memory, a display screen, a built-in battery to power the mobile computing device, and a built-in communication module to enable wireless communication. A signature is generated from the audio waves captured by the microphone. Based on the signature being recognized as a known signature, the signature and positioning information are transmitted to an audio server using the wireless communication. The positioning information identifies a specific moment in the media that a user of the mobile computing device is listening, the audio server and the mobile computing device connected to a network. Activity information is received from the audio server. The activity information is related to the media and associated with a third party server connected to the network. The user of the mobile computing device is enabled to use the activity information to interact with the third party server.
摘要:
A language identification system that includes a universal phoneme decoder (UPD) is described. The UPD contains a universal phoneme set representing both 1) all phonemes occurring in the set of two or more spoken languages, and 2) captures phoneme correspondences across languages, such that a set of unique phoneme patterns and probabilities are calculated in order to identify a most likely phoneme occurring each time in the audio files in the set of two or more potential languages in which the UPD was trained on. Each statistical language models (SLM) uses the set of unique phoneme patterns created for each language in the set to distinguish between spoken human languages in the set of languages. The run-time language identifier module identifies a particular human language being spoken by utilizing the linguistic probabilities supplied by the one or more SLMs that are based on the set of unique phoneme patterns created for each language.
摘要:
Various methods and apparatus are described for a speech recognition system. In an embodiment, the statistical language model (SLM) provides probability estimates of how linguistically likely a sequence of linguistic items are to occur in that sequence based on an amount of times the sequence of linguistic items occurs in text and phrases in general use. The speech recognition decoder module requests a correction module for one or more corrected probability estimates P′(z|xy) of how likely a linguistic item z follows a given sequence of linguistic items x followed by y, where (x, y, and z) are three variable linguistic items supplied from the decoder module. The correction module is trained to linguistics of a specific domain, and is located in between the decoder module and the SLM in order to adapt the probability estimates supplied by the SLM to the specific domain when those probability estimates from the SLM significantly disagree with the linguistic probabilities in that domain.
摘要:
A method, apparatus, and system are described for a continuous speech recognition engine that includes a fine speech recognizer model, a coarse sound representation generator, and a coarse match generator. The fine speech recognizer model receives a time coded sequence of sound feature frames, applies a speech recognition process to the sound feature frames and determines at least a best guess at each recognizable word that corresponds to the sound feature frames. The coarse sound representation generator generates a coarse sound representation of the recognized word. The coarse match generator determines a likelihood of the coarse sound representation actually being the recognized word based on comparing the coarse sound representation of the recognized word to a database containing the known sound of that recognized word and assigns the likelihood as a robust confidence level parameter to that recognized word.
摘要:
A method, apparatus, and system are described for a continuous speech recognition engine that includes a fine speech recognizer model, a coarse sound representation generator, and a coarse match generator. The fine speech recognizer model receives a time coded sequence of sound feature frames, applies a speech recognition process to the sound feature frames and determines at least a best guess at each recognizable word that corresponds to the sound feature frames. The coarse sound representation generator generates a coarse sound representation of the recognized word. The coarse match generator determines a likelihood of the coarse sound representation actually being the recognized word based on comparing the coarse sound representation of the recognized word to a database containing the known sound of that recognized word and assigns the likelihood as a robust confidence level parameter to that recognized word.
摘要:
Methods for implementing shared experiences using mobile computing devices comprise capturing audio waves associated with a media using a built-in microphone of a mobile computing device, the mobile computing device including a processor, a memory, a display screen, a built-in battery to power the mobile computing device, and a built-in communication module to enable wireless communication. A signature is generated from the audio waves captured by the microphone. Based on the signature being recognized as a known signature, the signature and positioning information are transmitted to an audio server using the wireless communication. The positioning information identifies a specific moment in the media that a user of the mobile computing device is listening, the audio server and the mobile computing device connected to a network. Activity information is received from the audio server. The activity information is related to the media and associated with a third party server connected to the network. The user of the mobile computing device is enabled to use the activity information to interact with the third party server.
摘要:
A language identification system that includes a universal phoneme decoder (UPD) is described. The UPD contains a universal phoneme set representing both 1) all phonemes occurring in the set of two or more spoken languages, and 2) captures phoneme correspondences across languages, such that a set of unique phoneme patterns and probabilities are calculated in order to identify a most likely phoneme occurring each time in the audio files in the set of two or more potential languages in which the UPD was trained on. Each statistical language model (SLM) uses the set of unique phoneme patterns created for each language in the set to distinguish between spoken human languages in the set of languages. The run-time language identifier module identifies a particular human language being spoken by utilizing the linguistic probabilities supplied by the SLMs that are based on the set of unique phoneme patterns created for each language.
摘要:
A system to determine positions of mobile computing devices and provide direction information includes a first mobile computing device configured to broadcast a first chirp signal, a second mobile computing device configured to broadcast a second chirp signal indicating receipt of the first chirp signal and a first time information about when the first chirp signal is received, and a third mobile computing device configured broadcast a third chirp signal indicating (a) receipt of the first and second chirp signals and (b) a second time information about when the first and second chirp signals are received. The first mobile computing device is configured to use the first and second time information to determine a position of the second mobile computing device. The first mobile computing device is also configured to transmit text messages to the second mobile computing device to direct a user of the second mobile computing device to a position of a user of the first mobile computing device.
摘要:
A language identification system that includes a universal phoneme decoder (UPD) is described. The UPD contains a universal phoneme set representing both 1) all phonemes occurring in the set of two or more spoken languages, and 2) captures phoneme correspondences across languages, such that a set of unique phoneme patterns and probabilities are calculated in order to identify a most likely phoneme occurring each time in the audio files in the set of two or more potential languages in which the UPD was trained on. Each statistical language model (SLM) uses the set of unique phoneme patterns created for each language in the set to distinguish between spoken human languages in the set of languages. The run-time language identifier module identifies a particular human language being spoken by utilizing the linguistic probabilities supplied by the SLMs that are based on the set of unique phoneme patterns created for each language.