Low delay voice processing system
    41.
    发明授权

    公开(公告)号:US11475891B2

    公开(公告)日:2022-10-18

    申请号:US17077943

    申请日:2020-10-22

    Abstract: Disclosed is a speech processing method. The speech processing method controls activation timing of a microphone based on a response pattern of the microphone from a user in order to implement a natural conversation. The speech processing device and the NLP system of the present disclosure may be associated with an artificial intelligence module, a drone (or unmanned aerial vehicle (UAV)), a robot, an augmented reality (AR) device, a virtual reality (VR) device, a device related to 5G service, etc.

    Air conditioner with an artificial intelligence

    公开(公告)号:US11448412B2

    公开(公告)日:2022-09-20

    申请号:US16492714

    申请日:2019-04-02

    Inventor: Jonghoon Chae

    Abstract: Discussed is an air conditioner disposed in an indoor space. The air conditioner includes a sensor, a memory configured to store a plurality of learning results respectively corresponding to a plurality of members, and a processor configured to identify at least one member that is present in the indoor space among the plurality of members by using data acquired by the sensor, control operation of at least one of a compressor, a fan motor, or a vane motor based on the learning result corresponding to an identified member so as to adjust a set value including at least one of a set temperature, an air volume, or a wind direction, and update the learning results by using feedback on the adjusted set value.

    Artificial intelligence device and method for recognizing speech with multiple languages

    公开(公告)号:US11270700B2

    公开(公告)日:2022-03-08

    申请号:US16799727

    申请日:2020-02-24

    Abstract: An artificial intelligence device includes a microphone configured to acquire speech including a plurality of languages, and a processor configured to generate, from the speech, text data corresponding to the speech, generate a plurality of pieces of separated data acquired by separating the text data for each language, perform natural language understanding processing corresponding to a language of each of the plurality of pieces of separated data to generate a natural language understanding processing result for each of the plurality of pieces of separated data, acquire command information about a command to be instructed by the speech and slot information about an entity subjected to the command, based on the natural language understanding processing result, perform an operation corresponding to the speech based on the command information and the slot information, and generate a response based on a result of performing the operation.

    Artificial intelligence apparatus and method for predicting performance of voice recognition model in user environment

    公开(公告)号:US11211045B2

    公开(公告)日:2021-12-28

    申请号:US16486028

    申请日:2019-05-29

    Inventor: Jonghoon Chae

    Abstract: Provided is an artificial intelligence apparatus for predicting a performance of a voice recognition model in a user environment including: a memory configured to store a performance prediction model; and a processor configured to: obtain first controlled environment data including first controlled environment factors corresponding to a first controlled voice recognition environment and a first controlled voice recognition performance of a target voice recognition model in the first controlled voice recognition environment; obtain first user environment factors corresponding to a first user environment, in which the performance is to be predicted; predict, using the performance prediction model, a first user voice recognition performance of the target voice recognition model in the first user voice recognition environment from the obtained first controlled environment data and the first user environment factors; and output the predicted first user voice recognition performance.

    Artificial intelligence (AI)-based voice sampling apparatus and method for providing speech style in heterogeneous label

    公开(公告)号:US11056096B2

    公开(公告)日:2021-07-06

    申请号:US16566265

    申请日:2019-09-10

    Inventor: Jonghoon Chae

    Abstract: Disclosed is an artificial intelligence (AI)-based voice sampling apparatus for providing a speech style in a heterogeneous label, including a rhyme encoder configured to receive a user's voice, extract a voice sample, and analyze a vocal feature included in the voice sample, a text encoder configured to receive text for reflecting the vocal feature, a processor configured to classify the voice sample input to the rhythm encoder into a label according to the vocal feature, provide a weight by measuring a distance between a voice sample corresponding to the label and a voice sample corresponding to a heterogeneous label as a label other than the label and provide a weight by measuring similarity between the label and the heterogeneous label, extract an embedding vector representing the vocal feature, generate a speech style from the embedding vector, and apply the generated speech style to the text, and a rhyme decoder configured to output synthesized voice data in which the speech style is applied to the text by the processor.

Patent Agency Ranking