ELECTRONIC APPARATUS AND CONTROL METHOD THEREOF

    公开(公告)号:WO2019112332A1

    公开(公告)日:2019-06-13

    申请号:PCT/KR2018/015397

    申请日:2018-12-06

    Inventor: JIN, Jangho

    CPC classification number: G10L15/22 G10L15/30 G10L17/22 G10L2015/228

    Abstract: An electronic apparatus is provided. The electronic apparatus includes a communicator comprising communication circuitry configured to communicate with a voice recognition server; and a processor configured to control the communicator to establish a session with the voice recognition server, based on a voice input start signal being received from a first external apparatus, to maintain the established session based on the voice input start signal being received from a second external apparatus in a state where the session is established, and to process voice recognition on audio data received from the second external apparatus using the maintained session.

    NEURAL NETWORKS FOR SPEAKER VERIFICATION
    3.
    发明申请

    公开(公告)号:WO2019027531A1

    公开(公告)日:2019-02-07

    申请号:PCT/US2018/032681

    申请日:2018-05-15

    Applicant: GOOGLE LLC

    Abstract: Systems, methods, devices, and other techniques for training and using a speaker verification neural network. A computing device may receive data that characterizes a first utterance. The computing device provides the data that characterizes the utterance to a speaker verification neural network. Subsequently, the computing device obtains, from the speaker verification neural network, a speaker representation that indicates speaking characteristics of a speaker of the first utterance. The computing device determines whether the first utterance is classified as an utterance of a registered user of the computing device. In response to determining that the first utterance is classified as an utterance of the registered user of the computing device, the device may perform an action for the registered user of the computing device.

    VOICE USER INTERFACE
    5.
    发明申请
    VOICE USER INTERFACE 审中-公开
    语音用户界面

    公开(公告)号:WO2017212235A1

    公开(公告)日:2017-12-14

    申请号:PCT/GB2017/051621

    申请日:2017-06-06

    Abstract: A method of speaker authentication comprises: receiving a speech signal; dividing the speech signal into segments; and, following each segment, obtaining an authentication score based on said segment and previously received segments, wherein the authentication score represents a probability that the speech signal comes from a specific registered speaker. In response to an authentication request, an authentication result is output based on the authentication score.

    Abstract translation: 说话人认证的方法包括:接收语音信号; 将语音信号分成段; 并且在每个片段之后,基于所述片段和先前接收的片段获得认证分数,其中认证分数表示语音信号来自特定注册讲话者的概率。 响应认证请求,根据认证分数输出认证结果。

    A METHOD FOR FACILITATING A TRANSACTION USING A HUMANOID ROBOT
    6.
    发明申请
    A METHOD FOR FACILITATING A TRANSACTION USING A HUMANOID ROBOT 审中-公开
    一种使用人性化机器人协助交易的方法

    公开(公告)号:WO2017131582A1

    公开(公告)日:2017-08-03

    申请号:PCT/SG2017/050008

    申请日:2017-01-09

    Abstract: The present disclosure relates to methods for facilitating a transaction. The methods involve interaction with a humanoid robot and include using the humanoid robot to receive a session initiation instruction from a user. During the session one or more articles are identified for purchase, each article being a product or service. A checkout sequence is then initiated to purchase the one or more articles. At some stage during performance of the above steps, beforehand or afterwards, the user is authenticated via interaction with the humanoid robot. [FIG. 1]

    Abstract translation: 本公开涉及用于促进交易的方法。 这些方法涉及与类人机器人的交互,并且包括使用人形机器人接收来自用户的会话发起指令。 在会议期间,确定购买一件或多件商品,每件商品为产品或服务。 结账顺序然后被启动以购买一个或多个物品。 在执行上述步骤的某个阶段,事先或之后,通过与仿人机器人的交互来验证用户。 [图。 1]

    SYSTEM AND METHOD FOR PROVIDING WORDS OR PHRASES TO BE UTTERED BY MEMBERS OF A CROWD AND PROCESSING THE UTTERANCES IN CROWD-SOURCED CAMPAIGNS TO FACILITATE SPEECH ANALYSIS
    9.
    发明申请
    SYSTEM AND METHOD FOR PROVIDING WORDS OR PHRASES TO BE UTTERED BY MEMBERS OF A CROWD AND PROCESSING THE UTTERANCES IN CROWD-SOURCED CAMPAIGNS TO FACILITATE SPEECH ANALYSIS 审中-公开
    系统和方法,用于提供由CROWD成员改编的词语或句法,并处理CROWD-SOURCED CAMPAIGNS中的UTTERANCES以便进行语音分析

    公开(公告)号:WO2017044370A1

    公开(公告)日:2017-03-16

    申请号:PCT/US2016/049856

    申请日:2016-09-01

    Abstract: Systems and methods of providing text related to utterances, and gathering voice data in response to the text are provide herein. In various implementations, an identification token that identifies a first file for a voice data collection campaign, and a second file for a session script may be received from a natural language processing training device. The first file and the second file may be used to configure the mobile application to display a sequence of screens, each of the sequence of screens containing text of at least one utterance specified in the voice data collection campaign. Voice data may be received from the natural language processing training device in response to user interaction with the text of the at least one utterance. The voice data and the text may be stored in a transcription library.

    Abstract translation: 本文提供了提供与话语相关的文本以及响应于文本收集语音数据的系统和方法。 在各种实施方式中,可以从自然语言处理训练装置接收识别用于语音数据收集活动的第一文件的识别令牌和用于会话脚本的第二文件。 可以使用第一文件和第二文件来配置移动应用程序来显示屏幕序列,每个屏幕序列包含语音数据收集活动中指定的至少一个话语的文本。 响应于用户与至少一个话语的文本交互而可以从自然语言处理训练装置接收语音数据。 语音数据和文本可以存储在转录库中。

    MAPPING INPUT TO FORM FIELDS
    10.
    发明申请
    MAPPING INPUT TO FORM FIELDS 审中-公开
    映射输入到表格域

    公开(公告)号:WO2016164251A1

    公开(公告)日:2016-10-13

    申请号:PCT/US2016/025276

    申请日:2016-03-31

    Applicant: GOOGLE INC.

    CPC classification number: G10L15/26 G06F17/243 G10L15/193 G10L17/22 G10L25/48

    Abstract: In some implementations, user input is received while a form that includes text entry fields is being accessed. In one aspect, a process may include mapping user input to fields of a form and populating the fields of the form with the appropriate information. This process may allow a user to fill out a form using speech input, by generating a transcription of input speech, determining a field that best corresponds to each portion of the speech, and populating each field with the appropriate information. In some examples, the processes described herein may reduce the load on user input components, may reduce overall power consumption and may reduce a cognitive burden on the user.

    Abstract translation: 在一些实现中,接收用户输入,而正在访问包括文本输入字段的表单。 在一个方面,过程可以包括将用户输入映射到表单的字段并且用适当的信息填充表单的字段。 该过程可以允许用户使用语音输入来填写表单,通过生成输入语音的转录,确定与语音的每个部分最佳对应的字段,以及用适当的信息填充每个字段。 在一些示例中,本文描述的过程可以减少对用户输入组件的负担,可以降低总体功耗并且可以减少用户的认知负担。

Patent Agency Ranking