-
公开(公告)号:US11854552B2
公开(公告)日:2023-12-26
申请号:US17030459
申请日:2020-09-24
Applicant: Amazon Technologies, Inc.
IPC: G10L15/26 , H04M3/44 , H04M3/00 , H04M1/2757 , H04M1/27453 , H04M1/27457
CPC classification number: G10L15/26 , H04M1/2757 , H04M1/27453 , H04M3/007 , H04M3/44 , H04M1/27457 , H04M2201/22
Abstract: Techniques for using validated communications identifiers of a user's communications profile to resolve entries in another user's contact list are described. When a user imports a contact list, the contact list may include multiple entities related to the same person. The system may identify one of the entries in the contact list that corresponds to a validated communications identifier stored in another user's communications profile. The system may identify other validated communications identifiers in the other user's communications profile and cross-reference them against the entries of the contact list. If the system determines the contact list includes entries for the different validated communications identifiers of the other user, the system may consolidate the entries into a single entry associated with the other user.
-
公开(公告)号:US11636851B2
公开(公告)日:2023-04-25
申请号:US17387157
申请日:2021-07-28
Applicant: Amazon Technologies, Inc.
Inventor: Munir Mahmood , Leopold Bushkin , Alexander Thomas Loeb , Michael Schwartz , Mohammed Arif , Rongzhou Shen , Vikram Kumar Gundeti , Shemyla Anwar , Yaser Khan , Edward Page Foyle , Bo Li
Abstract: Techniques for a natural language processing (NLP) system to implement more than one assistant are described. The NLP system may receive a natural language input corresponding to more than one user command. The NLP system may respond to a first command, of the natural language input, using a TTS voice of a first NLP system assistant. The NLP system may respond to a second command, of the natural language input, using a TTS voice of a second NLP system assistant.
-
公开(公告)号:US11468889B1
公开(公告)日:2022-10-11
申请号:US16806516
申请日:2020-03-02
Applicant: Amazon Technologies, Inc.
Inventor: Gregory Michael Hart , Peter Paul Henri Carbon , John Daniel Thimsen , Vikram Kumar Gundeti , Scott Ian Blanksteen , Allan Timothy Lindsay , Frederic Johan Georges Deramat
Abstract: A speech recognition platform configured to receive an audio signal that includes speech from a user and perform automatic speech recognition (ASR) on the audio signal to identify ASR results. The platform may identify: (i) a domain of a voice command within the speech based on the ASR results and based on context information associated with the speech or the user, and (ii) an intent of the voice command. In response to identifying the intent, the platform may perform a corresponding action, such as streaming audio to the device, setting a reminder for the user, purchasing an item on behalf of the user, making a reservation for the user or launching an application for the user. The speech recognition platform, in combination with the device, may therefore facilitate efficient interactions between the user and a voice-controlled device.
-
公开(公告)号:US20210377702A1
公开(公告)日:2021-12-02
申请号:US17345409
申请日:2021-06-11
Applicant: Amazon Technologies, Inc.
Abstract: A system that determines that devices are co-located in an acoustic region and selects a single device to which to send incoming notifications for the acoustic region. The system may group devices into separate acoustic regions based on selection data that selects between similar audio data received from multiple devices. The system may select the best device for each acoustic region based on a frequency that the device was selected previously, input/output capabilities of the device, a proximity to a user, or the like. The system may send a notification to a single device in each of the acoustic regions so that a user receives a single notification instead of multiple unsynchronized notifications. The system may also determine that acoustic regions are associated with different locations and select acoustic regions to which to send a notification based on location.
-
公开(公告)号:US20210090575A1
公开(公告)日:2021-03-25
申请号:US16580307
申请日:2019-09-24
Applicant: Amazon Technologies, Inc.
Inventor: Munir Mahmood , Leopold Bushkin , Alexander Thomas Loeb , Michael Schwartz , Mohammed Arif , Rongzhou Shen , Vikram Kumar Gundeti , Shemyla Anwar , Yaser Khan , Edward Page Foyle , Bo Li
IPC: G10L15/32 , G10L13/047 , G10L15/26 , G06F16/9032 , G10L17/10 , G10L15/18
Abstract: Techniques for a natural language processing (NLP) system to implement more than one assistant during a dialog between one or more users and the NLP system are described. The NLP system may receive a first natural language input and associate same with a dialog identifier. The NLP system may output audio, responsive to the first natural language input, in a first NLP system assistant's voice. Thereafter, the NLP system may receive a second natural language input and associate same with the dialog identifier. The NLP system may output audio, responsive to the second natural language input, in a second NLP system assistant's voice.
-
公开(公告)号:US20200168240A1
公开(公告)日:2020-05-28
申请号:US16775246
申请日:2020-01-28
Applicant: Amazon Technologies, Inc.
Inventor: Fred Torok , Rohan Mutagi , Vikram Kumar Gundeti , Frederic Johan Georges Deramat
IPC: G10L21/06
Abstract: A speech-based system includes a local device in a user premises and a network-based control service that directs the local device to perform actions for a user. The control service may specify a first action that is to be performed upon detection by the local device of a stimulus. In some cases, performing the first action may rely on the availability of network communications with the control service or with another service. In these cases, the control service also specifies a second, fallback action that does not rely upon network communications. Upon detecting the stimulus, the local device performs the first action if network communications are available. If network communications are not available, the local device performs the second, fallback action.
-
公开(公告)号:US20200092687A1
公开(公告)日:2020-03-19
申请号:US16569779
申请日:2019-09-13
Applicant: Amazon Technologies, Inc.
Abstract: A system that determines that devices are co-located in an acoustic region and selects a single device to which to send incoming notifications for the acoustic region. The system may group devices into separate acoustic regions based on selection data that selects between similar audio data received from multiple devices. The system may select the best device for each acoustic region based on a frequency that the device was selected previously, input/output capabilities of the device, a proximity to a user, or the like. The system may send a notification to a single device in each of the acoustic regions so that a user receives a single notification instead of multiple unsynchronized notifications. The system may also determine that acoustic regions are associated with different locations and select acoustic regions to which to send a notification based on location.
-
公开(公告)号:US10055190B2
公开(公告)日:2018-08-21
申请号:US14107931
申请日:2013-12-16
Applicant: Amazon Technologies, Inc.
Inventor: Vikram Kumar Gundeti , Fred Torok , Peter Spalding VanLund , Frederic Johan Georges Deramat
CPC classification number: G06F3/165
Abstract: A speech-based system includes a local device in a user premises and a remote service that uses the local device to conduct speech dialogs with a user. The local device may also be directed to play audio such as music, audio books, etc. When designating audio for playing by the local device, the remote service may specify that the audio is either background audio or foreground audio. For background audio, the service indicates whether the background audio is mixable. For foreground audio, the service indicates an interrupt behavior. When the local device is playing background audio and receives foreground audio, the background audio is paused, attenuated, or not changed based on the indicated interrupt behavior of the foreground audio and whether the background audio has been designated as being mixable.
-
公开(公告)号:US20140180697A1
公开(公告)日:2014-06-26
申请号:US13723026
申请日:2012-12-20
Applicant: AMAZON TECHNOLOGIES, INC.
Inventor: Fred Torok , Frédéric Johan Georges Deramat , Vikram Kumar Gundeti
IPC: G10L15/22
CPC classification number: G10L15/26 , G06F17/30684 , G06F17/3074 , G06F17/30746 , G06F17/30778 , G10L15/08 , G10L15/222 , G10L15/30
Abstract: Features are disclosed for generating markers for elements or other portions of an audio presentation so that a speech processing system may determine which portion of the audio presentation a user utterance refers to. For example, an utterance may include a pronoun with no explicit antecedent. The marker may be used to associate the utterance with the corresponding content portion for processing. The markers can be provided to a client device with a text-to-speech (“TTS”) presentation. The markers may then be provided to a speech processing system along with a user utterance captured by the client device. The speech processing system, which may include automatic speech recognition (“ASR”) modules and/or natural language understanding (“NLU”) modules, can generate hints based on the marker. The hints can be provided to the ASR and/or NLU modules in order to aid in processing the meaning or intent of a user utterance.
Abstract translation: 公开了用于为音频呈现的元件或其他部分生成标记的特征,使得语音处理系统可以确定用户话语所指的音频呈现的哪一部分。 例如,话语可能包括没有明确先行词的代词。 标记可以用于将话语与相应的内容部分相关联以进行处理。 可以将标记提供给具有文本到语音(“TTS”)呈现的客户端设备。 然后可以将标记与客户端设备捕获的用户话语一起提供给语音处理系统。 可以包括自动语音识别(“ASR”)模块和/或自然语言理解(“NLU”)模块的语音处理系统可以基于标记产生提示。 可以将提示提供给ASR和/或NLU模块,以帮助处理用户话语的含义或意图。
-
公开(公告)号:US11922925B1
公开(公告)日:2024-03-05
申请号:US16035977
申请日:2018-07-16
Applicant: Amazon Technologies, Inc.
Inventor: Peter Paul Henri Carbon , Vikram Kumar Gundeti , Frederic Johan Georges Deramat , Ajay Gopalakrishnan , John Daniel Thimsen
CPC classification number: G10L15/00 , G10L15/22 , G10L21/06 , G10L15/1815 , G10L2015/223
Abstract: A speech recognition platform configured to receive an audio signal that includes speech from a user and perform automatic speech recognition (ASR) on the audio signal to identify ASR results. The platform may identify: (i) a domain of a voice command within the speech based on the ASR results and based on context information associated with the speech or the user, and (ii) an intent of the voice command. In response to identifying the intent, the platform may perform a corresponding action, such as streaming audio to the device, setting a reminder for the user, purchasing an item on behalf of the user, making a reservation for the user or launching an application for the user. In some instances, the speech recognition platform engages in a back-and-forth dialog with the user in order to properly fulfill the user's request.
-
-
-
-
-
-
-
-
-