Abstract:
A system and method are disclosed that improve automatic speech recognition in a spoken dialog system. The method comprises partitioning speech recognizer output into self-contained clauses, identifying a dialog act in each of the self-contained clauses, qualifying dialog acts by identifying a current domain object and/or a current domain action, and determining whether further qualification is possible for the current domain object and/or current domain action. If further qualification is possible, then the method comprises identifying another domain action and/or another domain object associated with the current domain object and/or current domain action, reassigning the another domain action and/or another domain object as the current domain action and/or current domain object and then recursively qualifying the new current domain action and/or current object. This process continues until nothing is left to qualify.
Abstract:
Disclosed herein are systems and methods to incorporate human knowledge when developing and using statistical models for natural language understanding. The disclosed systems and methods embrace a data-driven approach to natural language understanding which progresses seamlessly along the continuum of availability of annotated collected data, from when there is no available annotated collected data to when there is any amount of annotated collected data.
Abstract:
A system and method are disclosed that improve automatic speech recognition in a spoken dialog system. The method comprises partitioning speech recognizer output into self-contained clauses, identifying a dialog act in each of the self-contained clauses, qualifying dialog acts by identifying a current domain object and/or a current domain action, and determining whether further qualification is possible for the current domain object and/or current domain action. If further qualification is possible, then the method comprises identifying another domain action and/or another domain object associated with the current domain object and/or current domain action, reassigning the another domain action and/or another domain object as the current domain action and/or current domain object and then recursively qualifying the new current domain action and/or current object. This process continues until nothing is left to qualify.
Abstract:
A system and method are disclosed that improve automatic speech recognition in a spoken dialog system. The method comprises partitioning speech recognizer output into self-contained clauses, identifying a dialog act in each of the self-contained clauses, qualifying dialog acts by identifying a current domain object and/or a current domain action, and determining whether further qualification is possible for the current domain object and/or current domain action. If further qualification is possible, then the method comprises identifying another domain action and/or another domain object associated with the current domain object and/or current domain action, reassigning the another domain action and/or another domain object as the current domain action and/or current domain object and then recursively qualifying the new current domain action and/or current object. This process continues until nothing is left to qualify.
Abstract:
An arrangement for allowing “hands-free” access to DTMF-controlled systems, such as one's voice mail messaging systems, utilizes a speech-to-DTMF tone application that monitors the communication between the user and the DTMF-controlled system. A speech recognition unit is utilized to retrieve certain voice commands (e.g., “next”, “skip”, “repeat”, “forward”, etc.) when uttered by the user. The application then translates the received commands into the proper DTMF tone sequence used by the DTMF-controlled system and transmits the DTMF tones to the system. The application is particularly useful in the cell phone environment and avoids the necessity of the user to constantly switch between using the keypad and listening to messages/commands from the system.
Abstract:
A clausifier and method of extracting clauses for spoken language understanding are disclosed. The method relates to generating a set of clauses from speech utterance text and comprises inserting at least one boundary tag in speech utterance text related to sentence boundaries, inserting at least one edit tag indicating a portion of the speech utterance text to remove, and inserting at least one conjunction tag within the speech utterance text. The result is a set of clauses that may be identified within the speech utterance text according to the inserted at least one boundary tag, at least one edit tag and at least one conjunction tag. The disclosed clausifier comprises a sentence boundary classifier, an edit detector classifier, and a conjunction detector classifier. The clausifier may comprise a single classifier or a plurality of classifiers to perform the steps of identifying sentence boundaries, editing text, and identifying conjunctions within the text.
Abstract:
A system and method to provide content and call attributes for a destination phone number using a click-to-dial connection includes accepting a query, retrieving links to a document, and searching through cached data using the query to identify relevant or destination number information. If no match is found, the method continues with accessing the document identified by the link for identifying relevant number information and creating a click-to-dial icon to link to the relevant numbers included in the relevant phone number information identified by the query. Next, a popup box for content relevant to the click-to-dial icon and a click feature on the popup box are created to retrieve call destination attributes for viewing by a user. Further, computer instructions create at least one communication connection between two communication devices after viewing call destination attributes.
Abstract:
An arrangement for allowing “hands-free” access to DTMF-controlled systems, such as one's voice mail messaging systems, utilizes a speech-to-DTMF tone application that monitors the communication between the user and the DTMF-controlled system. A speech recognition unit is utilized to retrieve certain voice commands (e.g., “next”, “skip”, “repeat”, “forward”, etc.) when uttered by the user. The application then translates the received commands into the proper DTMF tone sequence used by the DTMF-controlled system and transmits the DTMF tones to the system. The application is particularly useful in the cell phone environment and avoids the necessity of the user to constantly switch between using the keypad and listening to messages/commands from the system.
Abstract:
A system and method are disclosed that improve automatic speech recognition in a spoken dialog system. The method comprises partitioning speech recognizer output into self-contained clauses, identifying a dialog act in each of the self-contained clauses, qualifying dialog acts by identifying a current domain object and/or a current domain action, and determining whether further qualification is possible for the current domain object and/or current domain action. If further qualification is possible, then the method comprises identifying another domain action and/or another domain object associated with the current domain object and/or current domain action, reassigning the another domain action and/or another domain object as the current domain action and/or current domain object and then recursively qualifying the new current domain action and/or current object. This process continues until nothing is left to qualify.
Abstract:
A system and method to provide content and call attributes for a destination phone number using a click-to-dial connection includes accepting a query, retrieving links to a document, and searching through cached data using the query to identify relevant or destination number information. If no match is found, the method continues with accessing the document identified by the link for identifying relevant number information and creating a click-to-dial icon to link to the relevant numbers included in the relevant phone number information identified by the query. Next, a popup box for content relevant to the click-to-dial icon and a click feature on the popup box are created to retrieve call destination attributes for viewing by a user. Further, computer instructions create at least one communication connection between two communication devices after viewing call destination attributes.