PSEUDOTELEPATHY HEADSET
    2.
    发明公开

    公开(公告)号:US20240347036A1

    公开(公告)日:2024-10-17

    申请号:US18638155

    申请日:2024-04-17

    IPC分类号: G10L13/04 G06F3/01

    CPC分类号: G10L13/04 G06F3/012

    摘要: A system for enabling conversion of speech pantomimes of a user into synthesized speech includes a headset connected to an artificial intelligence network hosted on a computing device. The headset can include an array of distance measurement devices distributed adjacent facial regions of the user associated with speech. The system can further include a microphone and a speaker. Sensor data captured by the distance measuring devices and audio data captured by the microphone are used to train the artificial intelligence network to correlate speech pantomimes of the user with phonemes. The system can output synthesized speech generated from the phonemes through the speaker.

    SYSTEMS AND METHODS FOR AUTOMATIC GENERATION OF LISTENING TEST AUDIO FROM AUDIOSCRIPT

    公开(公告)号:US20240312450A1

    公开(公告)日:2024-09-19

    申请号:US18606270

    申请日:2024-03-15

    发明人: Xiaoyi ZHANG

    摘要: A system, computer-readable storage medium, and computer-implemented method for automatically generating listening test audio from audioscript. The audioscript is managed by sections, and each section is parsed and section-configuration generated accordingly. All the section-configurations are composed to a configuration of the audioscript, or said, a configuration. The configuration can be transmitted to a client device such that the configuration is viewable and the parameter values in the configuration can be set/reviewed through a graphical user interface. Responsive to receiving the fulfilled configuration, each section-configuration in the configuration can be applied to the corresponding section to generate the audio and/or audio-generation-script of that section. The complete audio and/or audio-generation-script of the audioscript is generated by concatenating all the audio and/or audio-generation-script of each section.

    SYNCHRONIZATION METHOD AND APPARATUS FOR AUDIO AND TEXT, DEVICE, AND MEDIUM

    公开(公告)号:US20240169972A1

    公开(公告)日:2024-05-23

    申请号:US18283433

    申请日:2022-02-15

    IPC分类号: G10L13/04 G10L21/055

    CPC分类号: G10L13/04 G10L21/055

    摘要: Provided are a synchronization method and apparatus for audio and text, a device, and a medium. The method includes: determining a plurality of first text segments for audio conversion and a second text for reading display, in which the plurality of first text segments and the second text are from an initial text; converting the plurality of first text segments into audio segments, to obtain a first mapping relationship between the first text segments and the audio segments; performing matching on the first text segments and the second text, to obtain a second mapping relationship between the first text segments and second text segments in the second text; determining the second text segment synchronized with each of the audio segments based on the first mapping relationship and the second mapping relationship.

    Systems and methods to alter voice interactions

    公开(公告)号:US11984112B2

    公开(公告)日:2024-05-14

    申请号:US17244659

    申请日:2021-04-29

    申请人: Rovi Guides, Inc.

    CPC分类号: G10L13/027 G10L13/04

    摘要: Systems and methods are disclosed for providing voice interactions based on user context. Data is received that causes a voice interaction to be generated for output at a user device. In response, current user contextual data of the user device is retrieved. A user availability level for consuming the voice interaction is determined based on the current user contextual data. The voice interaction is altered based on the user availability level. Content of the voice interaction may be altered to be suitable for consumption. The altered voice interaction is outputted at the user device.

    Systems and methods for communicating notifications and textual data associated with applications

    公开(公告)号:US11954403B1

    公开(公告)日:2024-04-09

    申请号:US18162735

    申请日:2023-02-01

    摘要: Embodiments are provided for communicating notifications and other textual data associated with applications installed on an electronic device. According to certain aspects, a user can interface with an input device to send (218) a wake up trigger to the electronic device. The electronic device retrieves (222) application notifications and converts (288) the application notifications to audio data. The electronic device also sends (230) the audio data to an audio output device for annunciation (232). The user may also use the input device to send (242) a request to the electronic device to activate the display screen. The electronic device identifies (248) an application corresponding to an annunciated notification, and activates (254) the display screen and initiates the application.