INDICATOR FOR VOICE-BASED COMMUNICATIONS
    21.
    发明申请

    公开(公告)号:US20180061403A1

    公开(公告)日:2018-03-01

    申请号:US15254458

    申请日:2016-09-01

    Abstract: Systems, methods, and devices for outputting visual indications regarding voice-based interactions are described. A first speech-controlled device detects spoken audio corresponding to a voice message to a second speech-controlled device. The first device captures the audio and sends audio data corresponding to the captured audio to a server. The server performs speech processing on the audio data to determine a recipient and message content. The server then determines a second speech-controlled device associated with the recipient and sends the message content to the recipient's second speech-controlled device. Thereafter, the server receives an indication from the recipient's speech-controlled device that the second device is detecting speech, presumably in response to the original message. The server then causes a visual indication to be output by the first speech-controlled device, with the visual indication representing the recipient-speech controlled device is detecting speech.

    System for autonomous mobile device assisted communication

    公开(公告)号:US11368497B1

    公开(公告)日:2022-06-21

    申请号:US16134643

    申请日:2018-09-18

    Abstract: An autonomous mobile device (AMD) may be used in an environment as a communication endpoint for voice or video communications. An incoming request for communication may initiate a process in which the AMD finds a user within the environment. Information obtained from sensors onboard the AMD or in the environment may be used to determine the whereabouts of the user. If an existing communication endpoint is not available to the user or cannot support a requested communication modality, the AMD may travel to permitted areas within the environment to find the user, while avoiding areas designated as private. Once found, communication may be established with the user. If the incoming request expires, the AMD may present information indicative of the request to the user.

    COMMUNICATION WITH USER PRESENCE
    24.
    发明申请

    公开(公告)号:US20220059090A1

    公开(公告)日:2022-02-24

    申请号:US17464754

    申请日:2021-09-02

    Abstract: A system and method establishes a communication connection between a first device of a first user and a second device of a second user. Request data corresponding to a request to establish a communication connection with a second user is received, and a user profile associated with the second user is determined. One or more sensors of the second device receive input data corresponding to the environment of the second device, and an identity of the second user is determined based thereon. The communication connection is established and, based on the identity, the second device tracks movement of the second user in the environment.

    Ending communications session based on presence data

    公开(公告)号:US11240331B2

    公开(公告)日:2022-02-01

    申请号:US16458932

    申请日:2019-07-01

    Abstract: Methods and devices for causing a communications session between a first device and a second device to end based on lack of speech activity are described herein. In some embodiments, non-speech may be detected for both the first device and the second device. If the non-speech associated with the first device is determined to occur at a substantially same time as the non-speech associated with the second device, then this may indicate that no individuals are talking within earshot of their respective devices. Furthermore, the non-speech detected by the first device and the non-speech detected by the second device may both be of an amount of time that is greater than a predefined temporal threshold. If so, then the communications session may be caused to end because speech activity has not been detected by either device for more than the predefined temporal threshold.

    Adjusting speed of human speech playback

    公开(公告)号:US11232808B2

    公开(公告)日:2022-01-25

    申请号:US16394717

    申请日:2019-04-25

    Abstract: A system configured to vary a speech speed of speech represented in input audio data without changing a pitch of the speech. The system may vary the speech speed based on a number of different inputs, including non-audio data, data associated with a command, or data associated with the voice message itself. The non-audio data may correspond to information about an account, device or user, such as user preferences, calendar entries, location information, etc. The system may analyze audio data associated with the command to determine command speech speed, identity of person listening, etc. The system may analyze the input audio data to determine a message speech speed, background noise level, identity of the person speaking, etc. Using all of these inputs, the system may dynamically determine a target speech speed and may generate output audio data having the target speech speed.

    ENDING COMMUNICATIONS SESSION BASED ON PRESENCE DATA

    公开(公告)号:US20190325892A1

    公开(公告)日:2019-10-24

    申请号:US16458932

    申请日:2019-07-01

    Abstract: Methods and devices for causing a communications session between a first device and a second device to end based on lack of speech activity are described herein. In some embodiments, non-speech may be detected for both the first device and the second device. If the non-speech associated with the first device is determined to occur at a substantially same time as the non-speech associated with the second device, then this may indicate that no individuals are talking within earshot of their respective devices. Furthermore, the non-speech detected by the first device and the non-speech detected by the second device may both be of an amount of time that is greater than a predefined temporal threshold. If so, then the communications session may be caused to end because speech activity has not been detected by either device for more than the predefined temporal threshold.

    ADJUSTING SPEED OF HUMAN SPEECH PLAYBACK
    28.
    发明申请

    公开(公告)号:US20190318758A1

    公开(公告)日:2019-10-17

    申请号:US16394717

    申请日:2019-04-25

    Abstract: A system configured to vary a speech speed of speech represented in input audio data without changing a pitch of the speech. The system may vary the speech speed based on a number of different inputs, including non-audio data, data associated with a command, or data associated with the voice message itself. The non-audio data may correspond to information about an account, device or user, such as user preferences, calendar entries, location information, etc. The system may analyze audio data associated with the command to determine command speech speed, identity of person listening, etc. The system may analyze the input audio data to determine a message speech speed, background noise level, identity of the person speaking, etc. Using all of these inputs, the system may dynamically determine a target speech speed and may generate output audio data having the target speech speed.

    Ending communications session based on presence data

    公开(公告)号:US10339957B1

    公开(公告)日:2019-07-02

    申请号:US15385265

    申请日:2016-12-20

    Abstract: Methods and devices for causing a communications session between a first device and a second device to end based on lack of speech activity are described herein. In some embodiments, non-speech may be detected for both the first device and the second device. If the non-speech associated with the first device is determined to occur at a substantially same time as the non-speech associated with the second device, then this may indicate that no individuals are talking within earshot of their respective devices. Furthermore, the non-speech detected by the first device and the non-speech detected by the second device may both be of an amount of time that is greater than a predefined temporal threshold. If so, then the communications session may be caused to end because speech activity has not been detected by either device for more than the predefined temporal threshold.

    Message response routing
    30.
    发明授权

    公开(公告)号:US10325599B1

    公开(公告)日:2019-06-18

    申请号:US15392271

    申请日:2016-12-28

    Abstract: Systems and methods for extracting contact information from a message are described. A system can receive a message for a recipient, where the message originates from a message source having a first contact identifier (i.e., phone number, text address, etc.). The system can determine text data associated with the content of that message and process the text data to determine that the message refers to a second contact identifier that is different from the first contact identifier. The system may output the message to a recipient device (such as using text-to-speech, etc.) and may store an association between the message source and the second contact identifier. When the recipient speaks a command to reply to the first message or contact the message source, the system may determine the reply is intended for the message source and may route the reply using the second contact identifier included in the first message.

Patent Agency Ranking