RECOGNITION OR SYNTHESIS OF HUMAN-UTTERED HARMONIC SOUNDS

    公开(公告)号:US20240363105A1

    公开(公告)日:2024-10-31

    申请号:US18290574

    申请日:2022-05-13

    摘要: Within each harmonic spectrum of a sequence of spectra derived from analysis of a waveform representing human speech are identified two or more fundamental or harmonic components that have frequencies that are separated by integer multiples of a fundamental acoustic frequency. The highest harmonic frequency that is also greater than 410 Hz is a primary cap frequency, which is used to select a primary phonetic note that corresponds to a subset of phonetic chords from a set of phonetic chords for which acoustic spectral is available. The spectral data can also include frequencies for primary band, secondary band (or secondary note), basal band, or reduced basal band acoustic components, which can be used to select a phonetic chord from the subset of phonetic chords corresponding to the selected primary note.

    INFORMATION PROVIDING METHOD
    6.
    发明公开

    公开(公告)号:US20240249744A1

    公开(公告)日:2024-07-25

    申请号:US18625507

    申请日:2024-04-03

    IPC分类号: G10L25/78 G10L15/20 G10L15/22

    摘要: An information providing method includes: generating first information indicating that a friendly gathering is occurring in a home when (i) a threshold amount of time or longer has elapsed from a start time of food preparation by a user and (ii) the volume of sound in a dining space is a first threshold volume or greater; obtaining, from a second information processing apparatus connected to a first information processing apparatus, information indicating first request content over a network; and when content of the first information is included in the first request content, outputting, to the second information processing apparatus, second information including information for identifying the user or the home, using the first information generated.

    Voice processing method, electronic device, and storage medium

    公开(公告)号:US12014730B2

    公开(公告)日:2024-06-18

    申请号:US17322238

    申请日:2021-05-17

    发明人: Xiangyan Xu

    IPC分类号: G10L15/20 G10L15/02

    摘要: A voice processing method includes: collecting a voice signal by a microphone of an electronic device, and signal-processing the collected voice signal to obtain a first voice frame segment; performing voice recognition on the first voice frame segment to obtain a first recognition result; in response to the first recognition result not matching a target content and a plurality of tokens in the first recognition result meeting a preset condition, performing frame compensation on the first voice frame segment to obtain a second voice frame segment; and performing voice recognition on the second voice frame segment to obtain a second recognition result. A matching degree between the second recognition result and the target content is greater than a matching degree between the first recognition result and the target content.