摘要:
A non-transitory processor-readable medium stores code representing instructions to be executed by a processor. The code causes the processor to receive a request from a user of a client device to initiate a speech recognition engine for a web page displayed at the client device. In response to the request, the code causes the processor to (1) download, from a server associated with a first party, the speech recognition engine into the client device; and then (2) analyze, using the speech recognition engine, content of the web page including text in an identified language to produce analyzed content based on the identified language, where the content of the web page is received from a server associated with a second party. The code further causes the processor to send a signal to cause the client device to present the analyzed content to the user at the client device.
摘要:
In some embodiments, a method includes measuring a disparity between two speech samples by segmenting both a reference speech sample and a student speech sample into speech units. A duration disparity can be determined for units that are not adjacent to each other in the reference speech sample. A duration disparity can also be determined for the corresponding units in the student speech sample. A difference can then be calculated between the student speech sample duration disparity and the reference speech sample duration disparity.
摘要:
The content of an instructor-student interaction set in an automated teaching system is represented in a graph-based format. In a graph-based representation, not only can variations branch away from each other at a node (branching point), as in the tree-based representation, but they can also merge back together. Not only does this make the -structure more compact, but it increases the number of variations that can be represented in the content while simultaneously eliminating the need to individually author each variation.
摘要:
A modularized computer-aided language learning system utilizing a unique user interface and modularized presentation modules to assist users to learn a language. The system presents a presentation module including a first description of a presentation subject and a placeholder indicating that a second description of the presentation subject is missing from the presentation module. Each of the first description of the presentation subject and the second description of the presentation subject is one of a textual type, a visual type and an audio type. Separated from the presentation module, the system presents the second description of the presentation subject, and receives a user input indicating an association of the presented second description to presentation module related to the presentation subject. Feedback is provided indicating the correctness of the association. The disclosure also describes a unique program design approach to form training programs using the presentation modules.
摘要:
A student providing a multi-word response in a computerized language teaching system provides a manual input concurrently with each responsive word. For example, he might enter a keystroke correspondent to the first letter of each word. When using the teaching computer silently, a student will typically “speak” each word mentally as he enters a keystroke, so the limited experience is almost as effective as speaking out loud. When a student types one or more keystrokes concurrently with each word that he speaks, the computer will be able to detect when a student is responding with a correct word, but merely mispronouncing it. Also, since the computer will receive a keystroke as the student starts each new word, it is better able to distinguish the boundaries between words and recognize them more reliably.
摘要:
A system and method for language instruction is provided. In an embodiment, a method of language instruction is provided which comprises presenting a first description of an event responsive to a first perspective and presenting a second description of the event responsive to a second perspective, wherein the first description of the event and the second description of the event are in a common language. The first and second descriptions of the event can be provided in a variety of formats, such as in audio format or as text.