-
公开(公告)号:US09548052B2
公开(公告)日:2017-01-17
申请号:US14108764
申请日:2013-12-17
Applicant: Google Inc.
Inventor: Virgil Scott King , Julian Paul Grady
IPC: G10L15/22 , G06F3/0483 , G06F3/16 , G06F17/21 , G06F3/0485 , G06F3/0484 , G06F15/02
CPC classification number: G10L15/22 , G06F3/0483 , G06F3/04842 , G06F3/0485 , G06F3/167 , G06F15/0291 , G06F17/21 , G09G2380/14 , G10L2015/223
Abstract: A user device receives audio data of a user reading aloud a displayed portion of an ebook, and converts a portion of the audio data to spoken-text data. The user device determines one or more similarity scores based on a comparison between the spoken-text data and text data associated with the displayed portion of the ebook, and ranks the one or more similarity scores to determine a reading location in the text data that corresponds to the spoken-text data. The user device determines a pronunciation score using the spoken-text data and pronunciation data associated with the text data at the reading location. The user device performs an action based in part on the reading location and the pronunciation score.
Abstract translation: 用户设备接收用户朗读电子书的显示部分的音频数据,并将音频数据的一部分转换成语音文本数据。 用户设备基于语音文本数据与与电子书的显示部分相关联的文本数据之间的比较来确定一个或多个相似性评分,并且对该一个或多个相似性评分进行排序以确定对应于该文本数据的阅读位置 到口语文本数据。 用户设备使用在文本位置处与文本数据相关联的口语文本数据和发音数据来确定发音分数。 用户设备部分地基于读取位置和发音得分执行动作。
-
公开(公告)号:US20150170648A1
公开(公告)日:2015-06-18
申请号:US14108764
申请日:2013-12-17
Applicant: Google Inc.
Inventor: Virgil Scott King , Julian Paul Grady
IPC: G10L15/26
CPC classification number: G10L15/22 , G06F3/0483 , G06F3/04842 , G06F3/0485 , G06F3/167 , G06F15/0291 , G06F17/21 , G09G2380/14 , G10L2015/223
Abstract: A user device receives audio data of a user reading aloud a displayed portion of an ebook, and converts a portion of the audio data to spoken-text data. The user device determines one or more similarity scores based on a comparison between the spoken-text data and text data associated with the displayed portion of the ebook, and ranks the one or more similarity scores to determine a reading location in the text data that corresponds to the spoken-text data. The user device determines a pronunciation score using the spoken-text data and pronunciation data associated with the text data at the reading location. The user device performs an action based in part on the reading location and the pronunciation score.
Abstract translation: 用户设备接收用户朗读电子书的显示部分的音频数据,并将音频数据的一部分转换成语音文本数据。 用户设备基于语音文本数据与与电子书的显示部分相关联的文本数据之间的比较来确定一个或多个相似性评分,并且对该一个或多个相似性评分进行排序以确定对应于该文本数据的阅读位置 到口语文本数据。 用户设备使用在文本位置处与文本数据相关联的口语文本数据和发音数据来确定发音分数。 用户设备部分地基于读取位置和发音得分执行动作。
-