摘要:
As a step in finding the one most likely word sequence in a spoken language system, an N-best search is conducted to find the N most likely sentence hypotheses. During the search, word theories are distinguished based only on the one previous word. At each state within a word, the total probability is calculated for each of a few previous words. At the end of each word, the probability score is recorded for each previous word theory, together with the name of the previous word. At the end of the sentence, a recursive traceback is performed to derive the list of the N best sentences.
摘要:
The invention provides a method of large vocabulary speech recognition that employs a single tree-structured phonetic hidden Markov model (HMM) at each frame of a time-synchronous process. A grammar probability is utilized upon recognition of each phoneme of a word, before recognition of the entire word is complete. Thus, grammar probabilities are exploited as early as possible during recognition of a word. At each frame of the recognition process, a grammar probability is determined for the transition from the most likely preceding grammar state to a set of words that share at least one common phoneme. The grammar probability is combined with accumulating phonetic evidence to provide a measure of the likelihood that a state in the HMM will lead to the word most likely to have been spoken. In a preferred embodiment, phonetic context information is exploited, even before the complete context of a phoneme is known. Instead of an exact triphone model, wherein the phonemes previous and subsequent to a phoneme are considered, a composite triphone model is used that exploits partial phonetic context information to provide a phonetic model that is more accurate than aphonetic model that ignores context. In another preferred embodiment, the single phonetic tree method is used as the forward pass of a forward/backward recognition process, wherein the backward pass employs a recognition process other than the single phonetic tree method.
摘要:
The invention provides a method for both the positive and negative selection of at least one mammalian cell population from a mixture of cell populations utilizing a magnetically stabilized fluidized bed. One desirable application of this method is the separation and purification of mammalian hematopoietic cells. Target cell populations include human stem cells.
摘要:
Methods and systems for providing an improved IR system that performs information retrieval by using probabilities. When performing information retrieval, the improved IR system utilizes both the prior probability that a document is relevant independent of the query as well as the probability that the query was generated by a particular document given that the particular document is relevant. By using these probabilities, the improved IR system retrieves documents in a more accurate manner than conventional systems which are based on an ad hoc approach.
摘要:
Methods, compositions and devices are provided for the growth of human stem and/or hematopoietic cells in culture. Bioreactors are provided in which diverse cell types are simultaneously-cultured in the presence of appropriate levels of nutrients and growth factors substantially continuously maintained in the bioreactor while removing undesirable metabolic products. This simultaneous culture of multiple cell types successfully reconstructs hematopoietic tissue ex vivo. Optionally, at least one growth factor is provided through excretion by transfected stromal cells, particularly heterologous cells. The invention also allows for the separate maintenance of stromal and hematopoietic cells, and to allow for harvesting of both the adherent and non-adherent cells.
摘要:
A real-time or streaming speech processing system and method is disclosed with capabilities distributed between and client and a server where the server may be reached via the Internet. The speech processing entails digitizing and converting the utterances to features extracted to help the processing. The features are sent via a communications channel to the server where the recognition occurs. The features extracted allow low bandwidth channels to be used with still maintaining real-time response. The recognizer will determine the most likely text representing the utterances and return the text to the client. The system can be used to identify and/or verify who is speaking.
摘要:
The invention provides a method for both the positive and negative selection of at least one mammalian cell population from a mixture of cell populations utilizing a magnetically stabilized fluidized bed. One desirable application of this method is the separation and purification of hematopoietic cells. Target cell populations include human stem cells.
摘要:
Devices are provided for the growth of human stem and/or hematopoietic cells in culture. Bioreactors are provided in which diverse cell types are simultaneously-cultured in the presence of appropriate levels of nutrients and growth factors substantially continuously maintained in the bioreactor while removing undesirable metabolic products. This simultaneous culture of multiple cell types successfully reconstructs hematopoietic tissue ex vivo. Optionally, at least one growth factor is provided through excretion by transfected stromal cells, particularly heterologous cells. The stromal cells and hematopoietic cells are maintained separately and both the adherent and non-adherent cells are harvested.
摘要:
A language-independent and segment free OCR system and method comprises a unique feature extraction approach which represents two dimensional data relating to OCR as one independent variable (specifically the position within a line of text in the direction of the line) so that the same CSR technology based on HMMs can be adapted in a straightforward manner to recognize optical characters. After a line finding stage, followed by a simple feature-extraction stage, the system can utilize a commercially available CSR system, with little or no modification, to perform the recognition of text by and training of the system. The whole system, including the feature extraction, training, and recognition components, are designed to be independent of the script or language of the text being recognized. The language-dependent parts of the system are confined to the lexicon and training data. Furthermore, the method of recognition does not require pre-segmentation of the data at the character and/or word levels, neither for training nor for recognition. In addition, a language model can be used to enhance system performance as an integral part of the recognition process and not as a post-process, as is commonly done with spell checking, for example.