Patent search ap:("Educational Testing Service") AND inv:"Jidong Tao" Page 1

1.

发明授权
Computer-implemented systems and methods for automatically generating an assessment of oral recitations of assessment items 有权

公开(公告)号：US10559225B1

公开(公告)日：2020-02-11

申请号：US15988232

申请日：2018-05-24

Applicant: Educational Testing Service

Inventor： Jidong Tao , Lei Chen , Chong Min Lee

IPC: G10L15/00 , G09B19/04 , G06N3/04 , G10L15/183 , G10L15/16

Abstract: Provide automatic assessment of oral recitations during computer based language assessments using a trained neural network to automate the scoring and feedback processes without human transcription and scoring input by automatically generating a score of a language assessment. Providing an automatic speech recognition (“ASR”) scoring system. Training multiple scoring reference vectors associated with multiple possible scores of an assessment, and receiving an acoustic language assessment response to an assessment item. Based on the acoustic language assessment automatically generating a transcription, and generating an individual word vector from the transcription. Generating an input vector by concatenating an individual word vector with a transcription feature vector, and supplying an input vector as input to a neural network. Generating an output vector based on weights of a neural network; and generating a score by comparing an output vector with scoring vectors.

2.

发明授权
End-to-end neural network based automated speech scoring 有权

公开(公告)号：US10937444B1

公开(公告)日：2021-03-02

申请号：US16196716

申请日：2018-11-20

Applicant: Educational Testing Service

Inventor： David Suendermann-Oeft , Lei Chen , Jidong Tao , Shabnam Ghaffarzadegan , Yao Qian

IPC: G10L25/30 , G10L15/16 , G06F17/18 , G06N3/04 , G06F40/284 , G10L25/60

Abstract: A system for end-to-end automated scoring is disclosed. The system includes a word embedding layer for converting a plurality of ASR outputs into input tensors; a neural network lexical model encoder receiving the input tensors; a neural network acoustic model encoder implementing AM posterior probability, word duration, mean value of pitch and mean value of intensity based on a plurality of cues; and a linear regression module, for receiving concatenated encoded features from the neural network lexical model encoder and the neural network acoustic model encoder.

3.

发明授权
Computer-implemented systems and methods for automatically generating an assessment of oral recitations of assessment items 有权

公开(公告)号：US09984682B1

公开(公告)日：2018-05-29

申请号：US15474440

申请日：2017-03-30

Applicant: Educational Testing Service

Inventor： Jidong Tao , Lei Chen , Chong Min Lee

IPC: G10L15/00 , G10L15/10 , G10L15/16 , G10L15/06 , G10L15/02 , G10L15/183 , G09B19/04

CPC classification number: G09B19/04 , G06N3/04 , G10L15/16 , G10L15/183

Abstract: Provide automatic assessment of oral recitations during computer based language assessments using a trained neural network to automate the scoring and feedback processes without human transcription and scoring input by automatically generating a score of a language assessment. Providing an automatic speech recognition (“ASR”) scoring system. Training multiple scoring reference vectors associated with multiple possible scores of an assessment, and receiving an acoustic language assessment response to an assessment item. Based on the acoustic language assessment automatically generating a transcription, and generating an individual word vector from the transcription. Generating an input vector by concatenating an individual word vector with a transcription feature vector, and supplying an input vector as input to a neural network. Generating an output vector based on weights of a neural network; and generating a score by comparing an output vector with scoring vectors.

4.

发明授权
Computer-implemented systems and methods for speaker recognition using a neural network 有权

公开(公告)号：US10008209B1

公开(公告)日：2018-06-26

申请号：US15273830

申请日：2016-09-23

Applicant: Educational Testing Service

Inventor： Yao Qian , Jidong Tao , David Suendermann-Oeft , Keelan Evanini , Alexei V. Ivanov , Vikram Ramanarayanan

IPC: G10L15/00 , G10L17/18 , G10L17/08 , G10L17/20 , G10L15/16

CPC classification number: G10L17/18 , G10L15/005 , G10L15/16 , G10L17/04 , G10L17/08 , G10L17/20

Abstract: Systems and methods are provided for providing voice authentication of a candidate speaker. Training data sets are accessed, where each training data set comprises data associated with a training speech sample of a speaker and a plurality of speaker metrics, where the plurality of speaker metrics include a native language of the speaker. The training data sets are used to train a neural network, where the data associated with each training speech sample is a training input to the neural network, and each of the plurality of speaker metrics is a training output to the neural network. Data associated with a speech sample is provided to the neural network to generate a vector that contains values for the plurality of speaker metrics, and the values contained in the vector are compared to values contained in a reference vector associated with a known person to determine whether the candidate speaker is the known person.

5.

发明授权
Processor-implemented systems and methods for determining sound quality 有权

公开(公告)号：US10283142B1

公开(公告)日：2019-05-07

申请号：US15215649

申请日：2016-07-21

Applicant: Educational Testing Service

Inventor： Zhou Yu , Vikram Ramanarayanan , David Suendermann-Oeft , Xinhao Wang , Klaus Zechner , Lei Chen , Jidong Tao , Yao Qian

IPC: G10L25/30 , G10L25/60 , G10L25/24 , G10L25/93

Abstract: Systems and methods are provided for a processor-implemented method of analyzing quality of sound acquired via a microphone. An input metric is extracted from a sound recording at each of a plurality of time intervals. The input metric is provided at each of the time intervals to a neural network that includes a memory component, where the neural network provides an output metric at each of the time intervals, where the output metric at a particular time interval is based on the input metric at a plurality of time intervals other than the particular time interval using the memory component of the neural network. The output metric is aggregated from each of the time intervals to generate a score indicative of the quality of the sound acquired via the microphone.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification