Patent search cpc:"G10L2015/221" Page 7

61.

发明授权
Systems and methods for off-board voice-automated vehicle navigation 有权
Title translation: 车载语音自动车辆导航的系统和方法

公开(公告)号：US09026355B2

公开(公告)日：2015-05-05

申请号：US14253081

申请日：2014-04-15

Applicant: Sirius XM Connected Vehicle Services Inc.

Inventor： Thomas Barton Schalk

IPC: G06F17/00 , G06F19/00 , G08G1/0968 , G05D1/00 , G01C21/36 , G01C21/00 , G10L15/00 , G10L21/06 , G01C21/26 , G10L15/04 , G10L15/30 , G10L15/22

CPC classification number: G01C21/3608 , G01C21/00 , G01C21/26 , G10L15/00 , G10L15/04 , G10L15/22 , G10L15/30 , G10L21/06 , G10L2015/221

Abstract: A system for providing navigational information to a vehicle driver. An on-board system is disposed on a vehicle and processes and transmits to a data center, via a wireless link, spoken requests from a vehicle driver requesting navigational information. The data center performs automated voice recognition on the received spoken requests to attempt recognition of destination components of the spoken requests, generates a list of possible destination components corresponding to the spoken requests, assigns a confidence score for each of the possible destination components on the list, determines if a possible destination component with a highest confidence score has a confidence score above a selected threshold, and computer-generates a representation of the possible destination components for transmission to the on-board system via the wireless link for confirmation by the vehicle driver if the highest confidence score of every possible destination component is above the selected threshold.

Abstract translation: 一种用于向车辆驾驶员提供导航信息的系统。车载系统被布置在车辆上，并且经由无线链路处理和发送到数据中心的来自车辆驾驶员的请求导航信息的请求。数据中心对接收到的口头请求执行自动语音识别，以尝试识别口头请求的目的地组件，生成与该口语请求对应的可能的目的地组件的列表，为列表中的每个可能的目的地组件分配置信度分数确定具有最高置信度得分的可能目的地分量是否具有高于所选阈值的置信度分数，并且计算机生成可能的目的地分量的表示，以经由无线链路传输到车载系统，以供车辆驾驶员确认如果每个可能的目标分量的最高可信度得分高于所选阈值。

62.

发明授权
Information presentation device associated with sound source separation 有权
Title translation: 与声源分离相关的信息呈现装置

公开(公告)号：US08990078B2

公开(公告)日：2015-03-24

申请号：US13707730

申请日：2012-12-07

Applicant: Honda Motor Co., Ltd.

Inventor： Kazuhiro Nakadai , Keisuke Nakamura , Michita Imai , Shunsuke Ueda

IPC: G08B5/00 , G10L21/10 , G10L21/0364 , G10L15/22 , G10L15/26 , G10L21/0272

CPC classification number: G08B5/00 , G01S3/8006 , G10L15/26 , G10L21/0272 , G10L21/0364 , G10L21/10 , G10L2015/221 , H04N7/18

Abstract: An information presentation device includes an audio signal input unit configured to input an audio signal, an image signal input unit configured to input an image signal, an image display unit configured to display an image indicated by the image signal, a sound source localization unit configured to estimate direction information for each sound source based on the audio signal, a sound source separation unit configured to separate the audio signal to sound-source-classified audio signals for each sound source, an operation input unit configured to receive an operation input and generates coordinate designation information indicating a part of a region of the image, and a sound source selection unit configured to select a sound-source-classified audio signal of a sound source associated with a coordinate which is included in a region indicated by the coordinate designation information, and which corresponds to the direction information.

Abstract translation: 信息呈现装置包括被配置为输入音频信号的音频信号输入单元，被配置为输入图像信号的图像信号输入单元，被配置为显示由图像信号指示的图像的图像显示单元，配置为基于音频信号估计每个声源的方向信息，声源分离单元，被配置为将音频信号分离为每个声源的声源分类音频信号;操作输入单元，被配置为接收操作输入并产生指示图像的区域的一部分的坐标指定信息，以及声源选择单元，被配置为选择与由坐标指定信息指示的区域中包括的坐标相关联的声源的声源分类音频信号，并且其对应于方向信息。

63.

发明授权
Combination and federation of local and remote speech recognition 有权
Title translation: 本地和远程语音识别的组合和联合

公开(公告)号：US08892439B2

公开(公告)日：2014-11-18

申请号：US12503191

申请日：2009-07-15

Applicant: Julian J. Odell , Robert L. Chambers

Inventor： Julian J. Odell , Robert L. Chambers

IPC: G10L15/18 , G10L15/30

CPC classification number: G10L15/30 , G10L2015/221

Abstract: Techniques to provide automatic speech recognition at a local device are described. An apparatus may include an audio input to receive audio data indicating a task. The apparatus may further include a local recognizer component to receive the audio data, to pass the audio data to a remote recognizer while receiving the audio data, and to recognize speech from the audio data. The apparatus may further include a federation component operative to receive one or more recognition results from the local recognizer and/or the remote recognizer, and to federate a plurality of recognition results to produce a most likely result. The apparatus may further include an application to perform the task indicated by the most likely result. Other embodiments are described and claimed.

Abstract translation: 描述在本地设备处提供自动语音识别的技术。装置可以包括用于接收指示任务的音频数据的音频输入。该装置还可以包括接收音频数据的局部识别器组件，以便在接收音频数据的同时将音频数据传送到远程识别器，并从音频数据识别语音。该装置还可以包括联合组件，用于从本地识别器和/或远程识别器接收一个或多个识别结果，并联合多个识别结果以产生最可能的结果。该装置还可以包括用于执行由最可能的结果指示的任务的应用。描述和要求保护其他实施例。

64.

发明申请
VOICE PROCESSING APPARATUS AND VOICE PROCESSING METHOD 审中-公开
Title translation: 语音处理设备和语音处理方法

公开(公告)号：US20140324421A1

公开(公告)日：2014-10-30

申请号：US14262004

申请日：2014-04-25

Applicant: SAMSUNG ELECTRONICS CO., LTD.

Inventor： Sang-jin KIM , Hyun-kyu YUN

IPC: G10L15/20

CPC classification number: G10L15/20 , G10L15/30 , G10L21/02 , G10L2015/221 , G10L2015/228 , H04N5/4403 , H04N21/23418 , H04N21/42203 , H04N21/4221 , H04N21/43637 , H04N21/439 , H04N21/6582

Abstract: A voice processing apparatus includes: a voice receptor configured to collect a user voice, convert the user voice into a first voice signal, and to output the first voice signal; an audio processor configured to process a sound output through a speaker to output an audio signal; a memory unit configured to store the first voice signal output from the voice receptor and the audio signal output from the audio processor; an echo cancelor configured to remove an echo from the first voice signal to generate a second voice signal; and a first controller configured to control the echo cancelor to generate the second voice signal based on the first voice signal and the audio signal stored in the memory unit.

Abstract translation: 语音处理装置包括：语音接收器，被配置为收集用户语音，将用户语音转换为第一语音信号，并输出第一语音信号; 音频处理器，被配置为处理通过扬声器的声音输出以输出音频信号; 存储单元，被配置为存储从语音接收器输出的第一语音信号和从音频处理器输出的音频信号; 回波消除器，被配置为从第一语音信号中去除回声以产生第二语音信号; 以及第一控制器，被配置为基于存储在存储器单元中的第一语音信号和音频信号来控制回波消除器来生成第二语音信号。

65.

发明申请
SPEECH PROCESSING METHOD, DEVICE AND SYSTEM 审中-公开
Title translation: 语音处理方法，设备和系统

公开(公告)号：US20140297281A1

公开(公告)日：2014-10-02

申请号：US14196202

申请日：2014-03-04

Applicant: FUJITSU LIMITED

Inventor： Taro Togawa , Chisato Shioda , Takeshi Otani

IPC: G10L15/22

CPC classification number: G10L15/22 , G10L2015/221

Abstract: A speech processing method executed by a computer, the speech processing method includes: extracting, based on speech recognition for an input speech data, a plurality of word candidates including a first word candidate and a second word candidate from a memory, the plurality of word candidates being candidates for a word corresponding to the input speech data; determining at least one different part between the first word candidate and the second word candidate based on a comparison between the first word candidate and the second word candidate; and outputting the first word candidate with emphasis on the at least one different part.

Abstract translation: 一种由计算机执行的语音处理方法，所述语音处理方法包括：基于对输入语音数据的语音识别，从存储器中提取包括第一单词候选和第二单词候选的多个单词候选，所述多个单词候选人是对应于输入语音数据的单词的候选者; 基于所述第一词候选和所述第二词候选之间的比较，确定所述第一词候选和所述第二词候选之间的至少一个不同部分; 并且将重点放在该至少一个不同部分上输出该第一词候选。

66.

发明申请
TRACKING SPOKEN LANGUAGE USING A DYNAMIC ACTIVE VOCABULARY 审中-公开
Title translation: 使用动态活跃的VOCABULARY跟踪语言语言

公开(公告)号：US20140278428A1

公开(公告)日：2014-09-18

申请号：US14213440

申请日：2014-03-14

Applicant: Coho Software LLC

Inventor： Steve Rolland

IPC: G10L21/10

CPC classification number: G10L15/26 , G10L2015/221 , G10L2015/228

Abstract: Systems and methods to provide a set of dictionaries and highlighting lists for speech recognition and highlighting, where the speech recognition focuses only on a limited scope of vocabulary as present in a document. The systems and methods allow a rapid and accurate matching of the utterance with the available text, and appropriately indicate the location in the text or signal any errors made during reading. Described herein is a system and method to create speech recognition systems focused on reading a fixed text and providing feedback on what they read to improve literacy, aid those with disabilities, and to make the reading experience more efficient and fun.

Abstract translation: 提供语言识别和突出显示的一组词典和突出显示列表的系统和方法，其中语音识别仅关注文档中存在的有限的词汇范围。系统和方法允许快速准确地匹配话语与可用文本，并适当地指示文本中的位置或信号在读取期间发生的任何错误。这里描述了一种系统和方法，用于创建侧重于阅读固定文本并提供关于他们读取的内容以提高识字率，帮助残疾人的阅读并使阅读体验更有效率和更有趣的语音识别系统。

67.

发明申请
PREDICTING AND LEARNING CARRIER PHRASES FOR SPEECH INPUT 有权
Title translation: 预测和学习语音输入的载波

公开(公告)号：US20140229185A1

公开(公告)日：2014-08-14

申请号：US14252913

申请日：2014-04-15

Applicant: Google Inc.

Inventor： William J. Byrne , Alexander H. Gruenstein , Douglas H. Beeferman

IPC: G10L15/06

CPC classification number: G10L15/22 , G06F3/04842 , G06F3/167 , G06F17/2795 , G10L15/02 , G10L15/063 , G10L15/1822 , G10L15/26 , G10L2015/0631 , G10L2015/0635 , G10L2015/0638 , G10L2015/221

Abstract: Predicting and learning users' intended actions on an electronic device based on free-form speech input. Users' actions can be monitored to develop a list of carrier phrases having one or more actions that correspond to the carrier phrases. A user can speak a command into a device to initiate an action. The spoken command can be parsed and compared to a list of carrier phrases. If the spoken command matches one of the known carrier phrases, the corresponding action(s) can be presented to the user for selection. If the spoken command does not match one of the known carrier phrases, search results (e.g., Internet search results) corresponding to the spoken command can be presented to the user. The actions of the user in response to the presented action(s) and/or the search results can be monitored to update the list of carrier phrases.

Abstract translation: 基于自由形式语音输入，预测和学习用户对电子设备的预期动作。可以监视用户的动作以开发具有与运营商短语对应的一个或多个动作的运营商短语的列表。用户可以向设备发出命令以启动动作。可以解析口头命令并将其与载体短语列表进行比较。如果口头命令与已知的运营商短语之一匹配，则可以将相应的动作呈现给用户进行选择。如果口头命令与已知的运营商短语之一不匹配，则可以向用户呈现与口语命令对应的搜索结果（例如，因特网搜索结果）。可以监视用户响应于所呈现的动作和/或搜索结果的动作以更新运营商短语列表。

68.

发明申请
Correction Menu Enrichment with Alternate Choices and Generation of Choice Lists in Multi-Pass Recognition Systems 有权
Title translation: 校正菜单丰富与替代选择和生成选择列表在多遍识别系统

公开(公告)号：US20140223310A1

公开(公告)日：2014-08-07

申请号：US13756862

申请日：2013-02-01

Applicant: NUANCE COMMUNICATIONS, INC.

Inventor： Olivier Divay , Joev Dubach , Venkatesh Nagesha , Allan Gold

IPC: G06F3/0482

CPC classification number: G06F3/167 , G06F17/273 , G10L15/065 , G10L15/22 , G10L2015/221

Abstract: A method is described for user correction of speech recognition results. A speech recognition result for a given unknown speech input is displayed to a user. A user selection is received of a portion of the recognition result needing to be corrected. For each of multiple different recognition data sources, a ranked list of alternate recognition choices is determined which correspond to the selected portion. The alternate recognition choices are concatenated or interleaved together and duplicate choices removed to form a single ranked output list of alternate recognition choices, which is displayed to the user. The method may be adaptive over time to derive preferences that can then be leveraged in the ordering of one choice list or across choice lists.

Abstract translation: 描述了用于用户校正语音识别结果的方法。向用户显示给定未知语音输入的语音识别结果。接收到需要校正的识别结果的一部分的用户选择。对于多个不同的识别数据源中的每一个，确定与所选择的部分相对应的替代识别选择的排名列表。替代的识别选择被连接或交织在一起，并且重复的选择被删除以形成向用户显示的替代识别选择的单个排名的输出列表。该方法可以随时间自适应以导出可以在一个选择列表的排序中或跨选择列表中被利用的偏好。

69.

发明申请
INTERFACE DEVICE FOR PROCESSING VOICE OF USER AND METHOD THEREOF 审中-公开
Title translation: 用于处理用户语音的接口设备及其方法

公开(公告)号：US20140156256A1

公开(公告)日：2014-06-05

申请号：US13911937

申请日：2013-06-06

Applicant: Electronics and Telecommunications Research Institute

Inventor： Ki Hyun KIM , Sang Hun KIM , Seung YUN

IPC: G06F17/28

CPC classification number: G06F17/289 , G10L2015/221

Abstract: The present invention suggests an interface device for processing a voice of a user which efficiently outputs various information so as to allow a user to contribute to the voice recognition or the automatic interpretation and a method thereof. For this purpose, the present invention suggests an interface device for processing a voice of a user which includes an utterance input unit configured to input utterance of a user, an utterance end recognizing unit configured to recognize the end of the input utterance; and an utterance result output unit configured to output at least one of a voice recognition result, a translation result, and an interpretation result of the ended utterance.

Abstract translation: 本发明提出了一种用于处理用户语音的接口设备，其有效地输出各种信息，以便允许用户贡献语音识别或自动解释及其方法。为此，本发明提出了一种用于处理用户语音的接口装置，其包括被配置为输入用户的发音的发声输入单元，被配置为识别输入话语的结束的发声结束识别单元; 以及发声结果输出单元，被配置为输出语音识别结果，翻译结果和结束话语的解释结果中的至少一个。

70.

发明申请
DISPLAY APPARATUS, VOICE ACQUIRING APPARATUS AND VOICE RECOGNITION METHOD THEREOF 审中-公开
Title translation: 显示装置，语音获取装置和语音识别方法

公开(公告)号：US20140136205A1

公开(公告)日：2014-05-15

申请号：US14076361

申请日：2013-11-11

Applicant: Samsung Electronics Co., Ltd.

Inventor： Jong-hyuk JANG , Chan-hee CHOI , Hee-seob RYU , Kyung-mi PARK , Seung-kwon PARK , Jae-hyun BAE

IPC: G10L21/06

CPC classification number: G10L21/06 , G10L15/30 , G10L2015/221 , H04L12/282 , H04N21/4126 , H04N21/4131 , H04N21/42203 , H04N21/43615 , H04N21/485

Abstract: Disclosed are a display apparatus, a voice acquiring apparatus and a voice recognition method thereof, the display apparatus including: a display unit which displays an image; a communication unit which communicates with a plurality of external apparatuses; and a controller which includes a voice recognition engine to recognize a user's voice, receives a voice signal from a voice acquiring unit, and controls the communication unit to receive candidate instruction words from at least one of the plurality of external apparatuses to recognize the received voice signal.

Abstract translation: 公开了一种显示装置，语音获取装置和语音识别方法，所述显示装置包括：显示单元，显示图像; 与多个外部设备通信的通信单元; 以及控制器，其包括用于识别用户的语音的语音识别引擎，从语音获取单元接收语音信号，并且控制所述通信单元从所述多个外部设备中的至少一个接收候选指令字以识别所接收的语音信号。

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification