-
公开(公告)号:US11790227B1
公开(公告)日:2023-10-17
申请号:US17148742
申请日:2021-01-14
IPC分类号: G06N3/08 , G06F40/284 , G06F40/232
CPC分类号: G06N3/08 , G06F40/232 , G06F40/284
摘要: Systems and methods are disclosed for automatically scoring a constructed response using a neural network. In embodiments, a constructed response received by a processing system may be processed to divide the constructed response into multiple series of word tokens, wherein each word token includes a sequence of characters. The constructed response may be further processed to correct one or more spelling errors. The word tokens may be encoded to generate representation vectors for the constructed response. A set of nonlinear operations may be applied to the plurality of representation vectors in a neural network to generate a single vector output. A set of predetermined network weights may be applied to the vector output of the neural network to generate a scalar output for scoring the constructed response.
-
公开(公告)号:US11455999B1
公开(公告)日:2022-09-27
申请号:US16844439
申请日:2020-04-09
发明人: Xinhao Wang , Su-Youn Yoon , Keelan Evanini , Klaus Zechner , Yao Qian
摘要: Data is received that encapsulates a spoken response to a prompt text comprising a string of words. Thereafter, the received data is transcribed into a string of words. The string of words is then compared with a prompt so that a similarity grid representation of the comparison can be generated that characterizes a level of similarity between the string of words in the spoken response and the string of words in the prompt text. The grid representation is then scored using at least one machine learning model. The score indicates a likelihood of the spoken response having been off-topic. Data providing the encapsulated score can then be provided. Related apparatus, systems, techniques and articles are also described.
-
公开(公告)号:US11417339B1
公开(公告)日:2022-08-16
申请号:US16695348
申请日:2019-11-26
发明人: Xinhao Wang , Keelan Evanini , Yao Qian , Klaus Zechner
IPC分类号: G10L15/26 , G10L15/197 , G10L25/51 , G10L15/16
摘要: Data is received that encapsulates a spoken response to a test question. Thereafter, the received data is transcribed into a string of words. The string of words is then compared with at least one source string so that a similarity grid representation of the comparison can be generated that characterizes a level of similarity between the string of words and the at least one source string. The grid representation is then scored using at least one machine learning model. The score indicates a likelihood of the spoken response having been plagiarized. Data providing the encapsulated score can then be provided. Related apparatus, systems, techniques and articles are also described.
-
公开(公告)号:US11049409B1
公开(公告)日:2021-06-29
申请号:US14974721
申请日:2015-12-18
摘要: Systems and methods are provided for automatically scoring a response and statistically revaluating whether it can be considered as aberrant. In one embodiment, a constructed response is evaluated via a pre-screening stage and a post-hoc screening stage. The pre-screening stage attempts to determine whether the constructed response is aberrant based on a variety of aberration metrics and criteria. If the constructed response is deemed not to be aberrant, then the post-hoc screening stage attempts to predict a discrepancy between what score an automated scoring system would assigned and what score a human rater would assign to the response. If the discrepancy is sufficiently low, then the constructed response may be scored by an automated scoring engine. On the other hand, if the constructed response failed to pass either of the two stages, then a flag may be raised to indicate that additional human review may be needed.
-
公开(公告)号:US10885274B1
公开(公告)日:2021-01-05
申请号:US16014021
申请日:2018-06-21
发明人: Swapna Somasundaran , Michael Flor , Martin Chodorow , Binod Gyawali , Hillary Molloy , Laura McCulla
IPC分类号: G06F40/279 , G06F16/34 , G06F16/31 , G06F40/30
摘要: Systems and methods are provided for processing a response to essay prompts that request a narrative response. A data structure associated with a narrative essay is accessed. The essay is analyzed to generate an organization subscore, where the organization subscore is generated using a graph metric by identifying content words in each sentence of the essay and populating a data structure with links between related content words in neighboring sentences, wherein the organization subscore is determined based on the links. The essay is analyzed to generate a development subscore, where the development subscore is generated using a transition metric by accessing a transition cue data store and identifying transition words in the essay, wherein the development subscore is based on a number of words in the essay that match words in the transition cue data store. A narrative quality metric is determined based on the organization subscore and the development subscore.
-
公开(公告)号:US10783873B1
公开(公告)日:2020-09-22
申请号:US16221980
申请日:2018-12-17
发明人: Yao Qian , Keelan Evanini , Patrick Lange , Robert A. Pugh , Rutuja Ubale
摘要: Systems and methods for identifying a person's native language, are presented. A native language identification system, comprising a plurality of artificial neural networks, such as time delay deep neural networks, is provided. Respective artificial neural networks of the plurality of artificial neural networks are trained as universal background models, using separate native language and non-native language corpora. The artificial neural networks may be used to perform voice activity detection and to extract sufficient statistics from the respective language corpora. The artificial neural networks may use the sufficient statistics to estimate respective T-matrices, which may in turn be used to extract respective i-vectors. The artificial neural networks may use i-vectors to generate a multilayer perceptron model, which may be used to identify a person's native language, based on an utterance by the person in his or her non-native language.
-
7.
公开(公告)号:US10585985B1
公开(公告)日:2020-03-10
申请号:US15841568
申请日:2017-12-14
摘要: Methods and systems for scoring written text based on use of idiomatic expressions, including reading pre-selected idiomatic expressions in a canonical form into memory, expanding idiomatic expressions from the canonical form, reading a written response into the memory, pre-processing the written response, searching the pre-processed written response for idiomatic expressions, and assigning a score to the written response. The score may be based at least in part on the number of idiomatic expressions in the written response. Corresponding apparatuses, systems, and methods are also disclosed.
-
公开(公告)号:US10515314B2
公开(公告)日:2019-12-24
申请号:US14957769
申请日:2015-12-03
摘要: Systems and methods are provided for a computer-implemented method for identifying pairs of cohesive words within a text. A supervised model is trained to detect cohesive words within a text to be scored. Training the supervised model includes identifying a plurality of pairs of candidate cohesive words in a training essay and an order associated with the pairs of candidate cohesive words based on an order of words in the training essay. The pairs of candidate cohesive words are filtered to form a set of evaluation pairs. The evaluation pairs are provided via a graphical user interface based on the order associated with the pairs of candidate cohesive words. An indication of cohesion or no cohesion is received for the evaluation pairs via the graphical user interface. The supervised model is trained based on the evaluation pairs and the received indications.
-
公开(公告)号:US10446044B2
公开(公告)日:2019-10-15
申请号:US14306753
申请日:2014-06-17
IPC分类号: G09B7/02
摘要: A computer-implemented method of calibrating an assessment model for assessing responses includes accessing a plurality of training responses with a processing system for training an assessment model. The processing system analyzes the plurality of training responses to derive values of multiple features of the training responses. The processing system trains the assessment model based on the values of the multiple features of the training responses and a portfolio score for each individual associated with the plurality of training responses utilized in the training. The portfolio score for each individual corresponds to a measure of proficiency based on multiple writing samples constructed by the individual. The processing system determines, based on said training, a weight for each of the multiple features. The processing system calibrates the assessment model to include the weights for at least some of the features such that the assessment model is configured to generate scores for responses.
-
公开(公告)号:US10424217B1
公开(公告)日:2019-09-24
申请号:US15378591
申请日:2016-12-14
发明人: Kathleen M. Sheehan
IPC分类号: G09B7/02 , G09B17/00 , G09B7/04 , G09B7/08 , G09B7/00 , G09B7/07 , G09B7/06 , A63F13/67 , G06F11/34
摘要: Systems and methods are provided for generating texts appropriate for a reading level of an individual. An existing exam unit is accessed, wherein the existing exam unit includes a reading passage and a plurality of questions related to the reading passage. The plurality of questions are filtered based on a criterion to form a subset of questions. A first difficulty score is determined based on the reading passage. A second difficulty score is determined based on the subset of questions. A correlation between the first difficulty score and the second difficulty score is determined, and a text is generated that is appropriate for a reading level of an individual based on performance of the individual on the exam unit.
-
-
-
-
-
-
-
-
-