Sensor enhanced speech recognition
    22.
    发明授权

    公开(公告)号:US10083350B2

    公开(公告)日:2018-09-25

    申请号:US15868546

    申请日:2018-01-11

    Abstract: A system for sensor enhanced speech recognition is disclosed. The system may obtain visual content or other content associated with a user and an environment of the user. Additionally, the system may obtain, from the visual content, metadata associated with the user and the environment of the user. The system may also include determining, based on the visual content and metadata, if the user is speaking. If the user is determined to be speaking, the system may obtain audio content associated with the user and the environment. The system may then adapt, based on the visual content, audio content, and metadata, one or more acoustic models that match the user and the environment. Once the one or more acoustic models are adapted and loaded, the system may enhance a speech recognition process or other process associated with the user.

    Exploiting Visual Information For Enhancing Audio Signals Via Source Separation And Beamforming
    24.
    发明申请
    Exploiting Visual Information For Enhancing Audio Signals Via Source Separation And Beamforming 有权
    利用视觉信息,通过源分离和波束成形来增强音频信号

    公开(公告)号:US20150365759A1

    公开(公告)日:2015-12-17

    申请号:US14302110

    申请日:2014-06-11

    Abstract: A system for exploiting visual information for enhancing audio signals via source separation and beamforming is disclosed. The system may obtain visual content associated with an environment of a user, and may extract, from the visual content, metadata associated with the environment. The system may determine a location of the user based on the extracted metadata. Additionally, the system may load, based on the location, an audio profile corresponding to the location of the user. The system may also load a user profile of the user that includes audio data associated with the user. Furthermore, the system may cancel, based on the audio profile and user profile, noise from the environment of the user. Moreover, the system may include adjusting, based on the audio profile and user profile, an audio signal generated by the user so as to enhance the audio signal during a communications session of the user.

    Abstract translation: 公开了一种利用视觉信息通过源分离和波束成形来增强音频信号的系统。 系统可以获得与用户的环境相关联的可视内容,并且可以从视觉内容中提取与环境相关联的元数据。 系统可以基于所提取的元数据来确定用户的位置。 另外,系统可以基于位置加载与用户的位置相对应的音频简档。 系统还可以加载包括与用户相关联的音频数据的用户的用户简档。 此外,系统可以基于音频简档和用户简档来取消来自用户的环境的噪声。 此外,系统可以包括基于音频简档和用户简档调整由用户生成的音频信号,以便在用户的通信会话期间增强音频信号。

    EXPLOITING VISUAL INFORMATION FOR ENHANCING AUDIO SIGNALS VIA SOURCE SEPARATION AND BEAMFORMING

    公开(公告)号:US20220180632A1

    公开(公告)日:2022-06-09

    申请号:US17652497

    申请日:2022-02-25

    Abstract: A system for exploiting visual information for enhancing audio signals via source separation and beamforming is disclosed. The system may obtain visual content associated with an environment of a user, and may extract, from the visual content, metadata associated with the environment. The system may determine a location of the user based on the extracted metadata. Additionally, the system may load, based on the location, an audio profile corresponding to the location of the user. The system may also load a user profile of the user that includes audio data associated with the user. Furthermore, the system may cancel, based on the audio profile and user profile, noise from the environment of the user. Moreover, the system may include adjusting, based on the audio profile and user profile, an audio signal generated by the user so as to enhance the audio signal during a communications session of the user.

    Pre-distortion system for cancellation of nonlinear distortion in mobile devices

    公开(公告)号:US11206332B2

    公开(公告)日:2021-12-21

    申请号:US16586269

    申请日:2019-09-27

    Abstract: A pre-distortion system for improved mobile device communications via cancellation of nonlinear distortion is disclosed. The pre-distortion system may transmit an acoustic signal from a network to a device, wherein the acoustic signal includes a linear signal and a nonlinear cancellation signal that cancels at least a portion of nonlinear distortions created once a loudspeaker in the device emits the linear signal. Thus, when a loudspeaker of a mobile device is operating and nonlinear distortions are generated by the loudspeaker or adjacent components of the mobile device in close proximity to the loudspeaker, the pre-distortion system may create one or more nonlinear cancellation signals in the network. The nonlinear cancellation signal may be combined with the linear signal sent to the loudspeaker to cancel the nonlinear distortion signal created by the loudspeaker emitting acoustic sounds from the linear signal. Thus, the nonlinear cancellation signal becomes a pre-distortion signal.

    Exploiting visual information for enhancing audio signals via source separation and beamforming

    公开(公告)号:US10402651B2

    公开(公告)日:2019-09-03

    申请号:US15905442

    申请日:2018-02-26

    Abstract: A system for exploiting visual information for enhancing audio signals via source separation and beamforming is disclosed. The system may obtain visual content associated with an environment of a user, and may extract, from the visual content, metadata associated with the environment. The system may determine a location of the user based on the extracted metadata. Additionally, the system may load, based on the location, an audio profile corresponding to the location of the user. The system may also load a user profile of the user that includes audio data associated with the user. Furthermore, the system may cancel, based on the audio profile and user profile, noise from the environment of the user. Moreover, the system may include adjusting, based on the audio profile and user profile, an audio signal generated by the user so as to enhance the audio signal during a communications session of the user.

    SENSOR ENHANCED SPEECH RECOGNITION
    29.
    发明申请

    公开(公告)号:US20180137348A1

    公开(公告)日:2018-05-17

    申请号:US15868546

    申请日:2018-01-11

    Abstract: A system for sensor enhanced speech recognition is disclosed. The system may obtain visual content or other content associated with a user and an environment of the user. Additionally, the system may obtain, from the visual content, metadata associated with the user and the environment of the user. The system may also include determining, based on the visual content and metadata, if the user is speaking. If the user is determined to be speaking, the system may obtain audio content associated with the user and the environment. The system may then adapt, based on the visual content, audio content, and metadata, one or more acoustic models that match the user and the environment. Once the one or more acoustic models are adapted and loaded, the system may enhance a speech recognition process or other process associated with the user.

    Pre-distortion system for cancellation of nonlinear distortion in mobile devices

    公开(公告)号:US09973633B2

    公开(公告)日:2018-05-15

    申请号:US14543261

    申请日:2014-11-17

    CPC classification number: H04M9/082 G10L2021/02082

    Abstract: A pre-distortion system for improved mobile device communications via cancellation of nonlinear distortion is disclosed. The pre-distortion system may transmit an acoustic signal from a network to a device, wherein the acoustic signal includes a linear signal and a nonlinear cancellation signal that cancels at least a portion of nonlinear distortions created once a loudspeaker in the device emits the linear signal. Thus, when a loudspeaker of a mobile device is operating and nonlinear distortions are generated by the loudspeaker or adjacent components of the mobile device in close proximity to the loudspeaker, the pre-distortion system may create one or more nonlinear cancellation signals in the network. The nonlinear cancellation signal may be combined with the linear signal sent to the loudspeaker to cancel the nonlinear distortion signal created by the loudspeaker emitting acoustic sounds from the linear signal. Thus, the nonlinear cancellation signal becomes a pre-distortion signal.

Patent Agency Ranking