Method for combining a query and a communication command in a natural language computer system
    41.
    发明授权
    Method for combining a query and a communication command in a natural language computer system 有权
    在自然语言计算机系统中组合查询和通信命令的方法

    公开(公告)号:US09507849B2

    公开(公告)日:2016-11-29

    申请号:US14092966

    申请日:2013-11-28

    Abstract: A method for processing a natural language input to a computerized system. The method parses the input to identify a query portion and a communication portion of the input. The system then determines an answer to the query portion, including identifying communication parameters from the communication portion. Upon determining the answer, the system prepares an answer to the communication and transmits that answer. If the answer requires information from a remote source, the system creates a subsidiary query to obtain that information and then submits the subsidiary query to the remote source. A response to the query is used to compose the answer to the query from the answer to the subsidiary query. If the system concludes that the query portion does not require information from a remote source, analyzing and answering the query locally.

    Abstract translation: 一种用于处理对计算机化系统的自然语言输入的方法。 该方法解析输入以识别输入的查询部分和通信部分。 系统然后确定对查询部分的答案,包括从通信部分识别通信参数。 在确定答案后,系统准备通信的答案并发送答案。 如果答案需要来自远程源的信息,系统将创建一个辅助查询以获取该信息,然后将该子查询提交给远程源。 对查询的响应用于从辅助查询的答案中构成查询的答案。 如果系统断定查询部分不需要来自远程源的信息,则在本地分析和回答查询。

    System and Method for Performing Dual Mode Speech Recognition
    42.
    发明申请
    System and Method for Performing Dual Mode Speech Recognition 有权
    用于执行双模式语音识别的系统和方法

    公开(公告)号:US20160217788A1

    公开(公告)日:2016-07-28

    申请号:US15085944

    申请日:2016-03-30

    Abstract: A system and method is presented for performing dual mode speech recognition, employing a local recognition module on a mobile device and a remote recognition engine on a server device. The system accepts a spoken query from a user, and both the local recognition module and the remote recognition engine perform speech recognition operations on the query, returning a transcription and confidence score, subject to a latency cutoff time. If both sources successfully transcribe the query, then the system accepts the result having the higher confidence score. If only one source succeeds, then that result is accepted. In either case, if the remote recognition engine does succeed in transcribing the query, then a client vocabulary is updated if the remote system result includes information not present in the client vocabulary.

    Abstract translation: 提出了一种用于执行双模式语音识别的系统和方法,在移动设备上使用本地识别模块和在服务器设备上使用远程识别引擎。 该系统接受来自用户的口语查询,并且本地识别模块和远程识别引擎都对查询执行语音识别操作,返回转录和置信度得分,并受到延迟截止时间的限制。 如果两个来源成功地转录查询,则系统接受具有较高置信度得分的结果。 如果只有一个源成功,则该结果被接受。 在任一情况下,如果远程识别引擎确实成功地转录查询,则如果远程系统结果包括客户端词汇中不存在的信息,则更新客户词汇。

    Training a device specific acoustic model

    公开(公告)号:US11830472B2

    公开(公告)日:2023-11-28

    申请号:US17573551

    申请日:2022-01-11

    CPC classification number: G10L15/22 G06F3/167 G10L15/18

    Abstract: Developers can configure custom acoustic models by providing audio files with custom recordings. The custom acoustic model is trained by tuning a baseline model using the audio files. Audio files may contain custom noise to apply to clean speech for training. The custom acoustic model is provided as an alternative to a standard acoustic model. Device developers can select an acoustic model by a user interface. Speech recognition is performed on speech audio using one or more acoustic models. The result can be provided to developers through the user interface, and an error rate can be computed and also provided.

    Conditional responses to application commands in a client-server system

    公开(公告)号:US11250217B1

    公开(公告)日:2022-02-15

    申请号:US16791421

    申请日:2020-02-14

    Abstract: A client device receives a user request (e.g., in natural language form) to execute a command of an application. The client device delegates interpretation of the request to a response-processing server. Using domain knowledge previously provided by a developer of the application, the response-processing server determines the various possible responses that client devices could make in response to the request based on circumstances such as the capabilities of the client devices and the state of the application data. The response-processing server accordingly generates a response package that describes a number of different conditional responses that client devices could have to the request and provides the response package to the client device. The client device selects the appropriate response from the response package based on the circumstances as determined by the client device, executes the command (if possible), and provides the user with some representation of the response.

    System and method for interpreting natural language commands with compound criteria

    公开(公告)号:US11238101B1

    公开(公告)日:2022-02-01

    申请号:US17081996

    申请日:2020-10-27

    Inventor: Keyvan Mohajer

    Abstract: A command-processing server receives a natural language command from a user. The command-processing server has a set of domain command interpreters corresponding to different domains in which commands can be expressed, such as the domain of entertainment, or the domain of travel. Some or all of the domain command interpreters recognize user commands having a verbal prefix, an optional pre-filter, an object, and an optional post-filter; the pre- and post-filters may be compounded expressions involving multiple atomic filters. Different developers may independently specify the domain command interpreters and the sub-structure interpreters on which they are based.

    Modular virtual assistant platform
    46.
    发明授权

    公开(公告)号:US11144731B2

    公开(公告)日:2021-10-12

    申请号:US16128227

    申请日:2018-09-11

    Abstract: A platform provides for developers of applications, such as devices, with natural language interfaces to configure the availability of vertical domain modules in applications. Modules can include grammars for parsing natural language expressions and interfaces to data sources. Third party developers can create modules with pricing models for their usage or access to their data. Device developers can browse or search available modules and test their performance for specific queries. The platform provides for devices users to access the chosen modules as configured by device developers and for charging and payment between users, application developers, and module developers.

    Framework for identifying distinct questions in a composite natural language query

    公开(公告)号:US11138205B1

    公开(公告)日:2021-10-05

    申请号:US16292190

    申请日:2019-03-04

    Abstract: A query-processing server provides natural language services to applications. More specifically, the query-processing server receives and stores domain knowledge information from application developers, the domain knowledge information comprising a linguistic description of the natural language user queries that application developers wish their applications to support. A first portion of the domain knowledge information is applied to transform a natural language query received from an application to an ordered sequence of question elements. A second portion of the domain knowledge information is applied to group the ordered sequence of question elements into a plurality of distinct structured questions posed by the natural language query. The distinct structured questions may then be provided to the application, which may then execute them and obtain the corresponding data referenced by the questions.

    Custom acoustic models
    49.
    发明授权

    公开(公告)号:US11011162B2

    公开(公告)日:2021-05-18

    申请号:US15996393

    申请日:2018-06-01

    Abstract: The technology disclosed relates to performing speech recognition for a plurality of different devices or devices in a plurality of conditions. This includes storing a plurality of acoustic models associated with different devices or device conditions, receiving speech audio including natural language utterances, receiving metadata indicative of a device type or device condition, selecting an acoustic model from the plurality in dependence upon the received metadata, and employing the selected acoustic model to recognize speech from the natural language utterances included in the received speech audio. Each of speech recognition and the storage of acoustic models can be performed locally by devices or on a network-connected server. Also provided is a platform and interface, used by device developers to select, configure, and/or train acoustic models for particular devices and/or conditions.

Patent Agency Ranking