摘要:
Systems and methods provide for indexing audio content by fusing the indexes derived from a keyword stream and a large vocabulary stream search. For example, systems and methods provide for two stream searching of Spoken Web VoiceSites, wherein metadata is extracted from the VoiceSite and is used to determine a set of keywords for high precision search while a traditional standard vocabulary set is used to perform a high results, low precision search. The results of the keyword search and the standard vocabulary search are fused together to form a comprehensive, ranked list of results.
摘要:
Systems and methods provide for indexing audio content by fusing the indexes derived from a keyword stream and a large vocabulary stream search. For example, systems and methods provide for two stream searching of Spoken Web VoiceSites, wherein metadata is extracted from the VoiceSite and is used to determine a set of keywords for high precision search while a traditional standard vocabulary set is used to perform a high results, low precision search. The results of the keyword search and the standard vocabulary search are fused together to form a comprehensive, ranked list of results.
摘要:
A method comprises crawling and indexing voice sites and storing results in an index; receiving a search request in voice from a user via a telephone; performing speech recognition on the voice search request and converting the request from voice to text; parsing the query; and performing a search on the index and ranking the search results. Search results may be filtered based on attributes such as location and context. Filtered search results may be presented to the user in categories to enable easy voice browsing of the search results by the user. Computer program code and systems are also provided.
摘要:
A method includes crawling and indexing voice sites and storing results in an index; receiving a search request in voice from a user via a telephone; performing speech recognition on the voice search request and converting the request from voice to text; parsing the query; and performing a search on the index and ranking the search results. Search results may be filtered based on attributes such as location and context. Filtered search results may be presented to the user in categories to enable easy voice browsing of the search results by the user. Computer program code and systems are also provided.
摘要:
Methods and arrangements for creating and building subcategories in a user-based voice application. User posts are appended to a category in a voice application and subdivision of the category is prompted upon satisfaction of a first condition. The category is designated as a parent category, and at least one subcategory nomination are solicited from users. User input is solicited on appending at least one nominated subcategory to the parent category, and a nominated subcategory is appended to the parent category upon satisfaction of a second condition. User input is solicited to reassign a user post from the parent category to at least one subcategory.
摘要:
Computer-implemented methods, computer program products and computer systems that render display of a video on a first page, that has been occluded from view by the subsequent viewing of a second page, to viewing on such subsequently viewed second page. Source location of the video is identified from source code of the first page, the identified video source is retrieved, and then it is input into source code of the second page to transfer the video to the second page. The video may then me displayed on the second page. The computer program products and computer systems both include a number of program instructions for effecting the transfer of the video from the first page to the second page for the continuous unobstructed or non-occluded viewing thereof.
摘要:
Methods and arrangements for visually representing audio content in a voice application. A display is connected to a voice application, and an image is displayed on the display, the image comprising a main portion and at least one subsidiary portion, the main portion representing a contextual entity of the audio content and the at least one subsidiary portion representing at least one participatory entity of the audio content. The at least one subsidiary portion is displayed without text, and the image is changed responsive to changes in audio content in the voice application.
摘要:
Methods and arrangements for visually representing audio content in a voice application. A display is connected to a voice application, and an image is displayed on the display, the image comprising a main portion and at least one subsidiary portion, the main portion representing a contextual entity of the audio content and the at least one subsidiary portion representing at least one participatory entity of the audio content. The at least one subsidiary portion is displayed without text, and the image is changed responsive to changes in audio content in the voice application.
摘要:
A phonetic vocabulary for a speech recognition system is adapted to a particular speaker's pronunciation. A speaker can be attributed specific pronunciation styles, which can be identified from specific pronunciation examples. Consequently, a phonetic vocabulary can be reduced in size, which can improve recognition accuracy and recognition speed.
摘要:
Systems and associated methods configured to provide user-driven audio content navigation for the spoken web are described. Embodiments allow users to skim audio for content that seems to be of relevance to the user, similar to visual skimming of standard web pages, and mark point of interest within the audio. Embodiments provide techniques for navigating audio content while interacting with information systems in a client-server environment, where the client device can be a simple, standard telephone.