摘要:
An intelligent query system for processing voiced-based queries is disclosed, which uses a combination of both statistical and semantic based processing to identify the question posed by the user by understanding the meaning of the user's utterance. Based on identifying the meaning of the utterance, the system selects a single answer that best matches the user's query. The answer that is paired to this single question is then retrieved and presented to the user. The system, as implemented, accepts environmental variables selected by the user and is scalable to provide answers to a variety and quantity of user-initiated queries.
摘要:
Systems and methods for query-based searching using spoken input are disclosed. In systems and methods according to embodiments of the invention, continuous speech natural language queries are accepted from a user using a client device. Speech processing tasks are divided between the client device and one or more server systems. Once user speech is recognized, the system searches one or more data repositories containing queries for at least one query that matches the recognized speech and returns information related to the query.
摘要:
A speech-enabled WWW based computing system allows a user to interact with content associated with a web page and select items of interest using speech as a mode of input. Dynamic grammars can assist in the recognition operations to improve speed and comprehension.
摘要:
An intelligent query system for processing voiced-based queries is disclosed. This distributed client-server system, typically implemented on an intranet or over the Internet accepts a user's queries at his/her computer, PDA or workstation using a speech input interface. After converting the user's query from speech to text, a natural language engine, a database processor and a full-text SQL database is implemented to find a single answer that best matches the user's query. Both statistical and semantic decoding are used to assist and improve the performance of the query recognition.
摘要:
Sentence based queries from a user are analyzed using a natural language engine to determine appropriate answers from an electronic database. The system and methods are useful for Internet based search engines, as well as distributed speech recognition systems such as a client-server system. The latter are typically implemented on an intranet or over the Internet based on user queries at his/her computer, a PDA, or a workstation using a speech input interface.
摘要:
Speech data is processed with one or more dynamic grammars, to reduce latency and improve accuracy. Different speech grammars are used by a speech recognition process depending on a context experienced by a speaker, and sentence grammars are similarly varied during a natural language process. The methods are useful for distributed speech recognition systems such as a client-server system, typically implemented on an intranet or over the Internet based on user queries at his/her computer, a PDA, or a workstation using a speech input interface.
摘要:
A speech recognition system uses speech recognition models which are specifically trained and optimized for users residing in a particular geographic area or region. The speech models are trained with samples of word variants expected to be used in a natural language by representative members of a population associated with the geographic region or community of users. The speech recognition system is configured to have a real-time response that imitates a dialogue with a human operator.
摘要:
A speech-enabled internet based computing system includes a configurable speech recognition engine used for interacting with content on a web accessible page. The speech recognition engine is distributed across a client and server architecture, and is adaptive so that speech processing operations can be allocated as needed between the two. This allows for support for client devices having differing computing capabilities. Natural language operations can also be supported as desired. A user can thus interact with a web page and select items of interest using speech as a mode of input. Dynamic grammars can assist in the recognition operations to improve speed and comprehension.
摘要:
A speech recognition engine is configured to permit partial or full recognition of an utterance at a client device and/or a server device. This allows for support for client devices having differing computing capabilities. Natural language operations can also be supported as desired. A user can thus interact with content also accessible at a web page and select items of interest using speech as a mode of input. Dynamic grammars can assist in the recognition operations to improve speed and comprehension.
摘要:
A prosody analyzer enhances the interpretation of natural language utterances. The analyzer is distributed over a client/server architecture, so that the scope of emotion recognition processing tasks can be allocated on a dynamic basis based on processing resources, channel conditions, client loads etc. The partially processed prosodic data can be sent separately or combined with other speech data from the client device and streamed to a server for a real-time response. Training of the prosody analyzer with real world expected responses improves emotion modeling and the real-time identification of potential features such as emphasis, intent, attitude and semantic meaning in the speaker's utterances.