Systems and methods for neural content scoring

    公开(公告)号:US11790227B1

    公开(公告)日:2023-10-17

    申请号:US17148742

    申请日:2021-01-14

    摘要: Systems and methods are disclosed for automatically scoring a constructed response using a neural network. In embodiments, a constructed response received by a processing system may be processed to divide the constructed response into multiple series of word tokens, wherein each word token includes a sequence of characters. The constructed response may be further processed to correct one or more spelling errors. The word tokens may be encoded to generate representation vectors for the constructed response. A set of nonlinear operations may be applied to the plurality of representation vectors in a neural network to generate a single vector output. A set of predetermined network weights may be applied to the vector output of the neural network to generate a scalar output for scoring the constructed response.

    Detection of off-topic spoken responses using machine learning

    公开(公告)号:US11455999B1

    公开(公告)日:2022-09-27

    申请号:US16844439

    申请日:2020-04-09

    IPC分类号: G10L15/26 G10L15/16 G06N3/08

    摘要: Data is received that encapsulates a spoken response to a prompt text comprising a string of words. Thereafter, the received data is transcribed into a string of words. The string of words is then compared with a prompt so that a similarity grid representation of the comparison can be generated that characterizes a level of similarity between the string of words in the spoken response and the string of words in the prompt text. The grid representation is then scored using at least one machine learning model. The score indicates a likelihood of the spoken response having been off-topic. Data providing the encapsulated score can then be provided. Related apparatus, systems, techniques and articles are also described.

    Detection of plagiarized spoken responses using machine learning

    公开(公告)号:US11417339B1

    公开(公告)日:2022-08-16

    申请号:US16695348

    申请日:2019-11-26

    摘要: Data is received that encapsulates a spoken response to a test question. Thereafter, the received data is transcribed into a string of words. The string of words is then compared with at least one source string so that a similarity grid representation of the comparison can be generated that characterizes a level of similarity between the string of words and the at least one source string. The grid representation is then scored using at least one machine learning model. The score indicates a likelihood of the spoken response having been plagiarized. Data providing the encapsulated score can then be provided. Related apparatus, systems, techniques and articles are also described.

    Systems and methods for treatment of aberrant responses

    公开(公告)号:US11049409B1

    公开(公告)日:2021-06-29

    申请号:US14974721

    申请日:2015-12-18

    IPC分类号: G09B7/02 G06F40/20

    摘要: Systems and methods are provided for automatically scoring a response and statistically revaluating whether it can be considered as aberrant. In one embodiment, a constructed response is evaluated via a pre-screening stage and a post-hoc screening stage. The pre-screening stage attempts to determine whether the constructed response is aberrant based on a variety of aberration metrics and criteria. If the constructed response is deemed not to be aberrant, then the post-hoc screening stage attempts to predict a discrepancy between what score an automated scoring system would assigned and what score a human rater would assign to the response. If the discrepancy is sufficiently low, then the constructed response may be scored by an automated scoring engine. On the other hand, if the constructed response failed to pass either of the two stages, then a flag may be raised to indicate that additional human review may be needed.

    Platform for administering and evaluating narrative essay examinations

    公开(公告)号:US10885274B1

    公开(公告)日:2021-01-05

    申请号:US16014021

    申请日:2018-06-21

    摘要: Systems and methods are provided for processing a response to essay prompts that request a narrative response. A data structure associated with a narrative essay is accessed. The essay is analyzed to generate an organization subscore, where the organization subscore is generated using a graph metric by identifying content words in each sentence of the essay and populating a data structure with links between related content words in neighboring sentences, wherein the organization subscore is determined based on the links. The essay is analyzed to generate a development subscore, where the development subscore is generated using a transition metric by accessing a transition cue data store and identifying transition words in the essay, wherein the development subscore is based on a number of words in the essay that match words in the transition cue data store. A narrative quality metric is determined based on the organization subscore and the development subscore.

    Computer-implemented systems and methods for generating a supervised model for lexical cohesion detection

    公开(公告)号:US10515314B2

    公开(公告)日:2019-12-24

    申请号:US14957769

    申请日:2015-12-03

    摘要: Systems and methods are provided for a computer-implemented method for identifying pairs of cohesive words within a text. A supervised model is trained to detect cohesive words within a text to be scored. Training the supervised model includes identifying a plurality of pairs of candidate cohesive words in a training essay and an order associated with the pairs of candidate cohesive words based on an order of words in the training essay. The pairs of candidate cohesive words are filtered to form a set of evaluation pairs. The evaluation pairs are provided via a graphical user interface based on the order associated with the pairs of candidate cohesive words. An indication of cohesion or no cohesion is received for the evaluation pairs via the graphical user interface. The supervised model is trained based on the evaluation pairs and the received indications.

    Systems and methods for generating automated evaluation models

    公开(公告)号:US10446044B2

    公开(公告)日:2019-10-15

    申请号:US14306753

    申请日:2014-06-17

    IPC分类号: G09B7/02

    摘要: A computer-implemented method of calibrating an assessment model for assessing responses includes accessing a plurality of training responses with a processing system for training an assessment model. The processing system analyzes the plurality of training responses to derive values of multiple features of the training responses. The processing system trains the assessment model based on the values of the multiple features of the training responses and a portfolio score for each individual associated with the plurality of training responses utilized in the training. The portfolio score for each individual corresponds to a measure of proficiency based on multiple writing samples constructed by the individual. The processing system determines, based on said training, a weight for each of the multiple features. The processing system calibrates the assessment model to include the weights for at least some of the features such that the assessment model is configured to generate scores for responses.

    Systems and methods for ability-appropriate text generation

    公开(公告)号:US10424217B1

    公开(公告)日:2019-09-24

    申请号:US15378591

    申请日:2016-12-14

    摘要: Systems and methods are provided for generating texts appropriate for a reading level of an individual. An existing exam unit is accessed, wherein the existing exam unit includes a reading passage and a plurality of questions related to the reading passage. The plurality of questions are filtered based on a criterion to form a subset of questions. A first difficulty score is determined based on the reading passage. A second difficulty score is determined based on the subset of questions. A correlation between the first difficulty score and the second difficulty score is determined, and a text is generated that is appropriate for a reading level of an individual based on performance of the individual on the exam unit.