-
公开(公告)号:US10559225B1
公开(公告)日:2020-02-11
申请号:US15988232
申请日:2018-05-24
Applicant: Educational Testing Service
Inventor: Jidong Tao , Lei Chen , Chong Min Lee
IPC: G10L15/00 , G09B19/04 , G06N3/04 , G10L15/183 , G10L15/16
Abstract: Provide automatic assessment of oral recitations during computer based language assessments using a trained neural network to automate the scoring and feedback processes without human transcription and scoring input by automatically generating a score of a language assessment. Providing an automatic speech recognition (“ASR”) scoring system. Training multiple scoring reference vectors associated with multiple possible scores of an assessment, and receiving an acoustic language assessment response to an assessment item. Based on the acoustic language assessment automatically generating a transcription, and generating an individual word vector from the transcription. Generating an input vector by concatenating an individual word vector with a transcription feature vector, and supplying an input vector as input to a neural network. Generating an output vector based on weights of a neural network; and generating a score by comparing an output vector with scoring vectors.
-
公开(公告)号:US10937444B1
公开(公告)日:2021-03-02
申请号:US16196716
申请日:2018-11-20
Applicant: Educational Testing Service
Inventor: David Suendermann-Oeft , Lei Chen , Jidong Tao , Shabnam Ghaffarzadegan , Yao Qian
Abstract: A system for end-to-end automated scoring is disclosed. The system includes a word embedding layer for converting a plurality of ASR outputs into input tensors; a neural network lexical model encoder receiving the input tensors; a neural network acoustic model encoder implementing AM posterior probability, word duration, mean value of pitch and mean value of intensity based on a plurality of cues; and a linear regression module, for receiving concatenated encoded features from the neural network lexical model encoder and the neural network acoustic model encoder.
-
公开(公告)号:US09984682B1
公开(公告)日:2018-05-29
申请号:US15474440
申请日:2017-03-30
Applicant: Educational Testing Service
Inventor: Jidong Tao , Lei Chen , Chong Min Lee
CPC classification number: G09B19/04 , G06N3/04 , G10L15/16 , G10L15/183
Abstract: Provide automatic assessment of oral recitations during computer based language assessments using a trained neural network to automate the scoring and feedback processes without human transcription and scoring input by automatically generating a score of a language assessment. Providing an automatic speech recognition (“ASR”) scoring system. Training multiple scoring reference vectors associated with multiple possible scores of an assessment, and receiving an acoustic language assessment response to an assessment item. Based on the acoustic language assessment automatically generating a transcription, and generating an individual word vector from the transcription. Generating an input vector by concatenating an individual word vector with a transcription feature vector, and supplying an input vector as input to a neural network. Generating an output vector based on weights of a neural network; and generating a score by comparing an output vector with scoring vectors.
-
公开(公告)号:US10008209B1
公开(公告)日:2018-06-26
申请号:US15273830
申请日:2016-09-23
Applicant: Educational Testing Service
Inventor: Yao Qian , Jidong Tao , David Suendermann-Oeft , Keelan Evanini , Alexei V. Ivanov , Vikram Ramanarayanan
Abstract: Systems and methods are provided for providing voice authentication of a candidate speaker. Training data sets are accessed, where each training data set comprises data associated with a training speech sample of a speaker and a plurality of speaker metrics, where the plurality of speaker metrics include a native language of the speaker. The training data sets are used to train a neural network, where the data associated with each training speech sample is a training input to the neural network, and each of the plurality of speaker metrics is a training output to the neural network. Data associated with a speech sample is provided to the neural network to generate a vector that contains values for the plurality of speaker metrics, and the values contained in the vector are compared to values contained in a reference vector associated with a known person to determine whether the candidate speaker is the known person.
-
公开(公告)号:US10283142B1
公开(公告)日:2019-05-07
申请号:US15215649
申请日:2016-07-21
Applicant: Educational Testing Service
Inventor: Zhou Yu , Vikram Ramanarayanan , David Suendermann-Oeft , Xinhao Wang , Klaus Zechner , Lei Chen , Jidong Tao , Yao Qian
Abstract: Systems and methods are provided for a processor-implemented method of analyzing quality of sound acquired via a microphone. An input metric is extracted from a sound recording at each of a plurality of time intervals. The input metric is provided at each of the time intervals to a neural network that includes a memory component, where the neural network provides an output metric at each of the time intervals, where the output metric at a particular time interval is based on the input metric at a plurality of time intervals other than the particular time interval using the memory component of the neural network. The output metric is aggregated from each of the time intervals to generate a score indicative of the quality of the sound acquired via the microphone.
-
-
-
-