Abstract:
The present specification relates to a smart controlling device capable of utilizing machine learning for voice recognition and a method of controlling therefor. The smart controlling device according to the present invention includes a receiver configured to receive an input including a command trigger, and a controller configured to detect one or more external display devices, select a display device of the detected one or more external display devices, cause a power status of the selected display device to be changed to a first state, and cause a response data corresponding to a first command data received after the command trigger to be output on a display of the selected display device.
Abstract:
The present invention relates to a method and apparatus for processing a voice signal, and the voice signal encoding method according to the present invention comprises the steps of: generating transform coefficients of sine wave components forming an input voice signal by transforming the sine wave components; determining transform coefficients to be encoded from the generated transform coefficients; and transmitting indication information indicating the determined transform coefficients, wherein the indication information may include position information, magnitude information, and sign information of the transform coefficients.
Abstract:
The present invention relates to a method and apparatus for processing a voice signal, and the voice signal encoding method according to the present invention comprises the steps of: generating transform coefficients of sine wave components forming an input voice signal by transforming the sine wave components; determining transform coefficients to be encoded from the generated transform coefficients; and transmitting indication information indicating the determined transform coefficients, wherein the indication information may include position information, magnitude information, and sign information of the transform coefficients.
Abstract:
The present invention relates to a beverage supply apparatus comprising a sensing unit, a microphone, an artificial intelligence unit and a control unit. The control unit: activates the microphone when an object is detected; recognizes first sound data of a user when the first sound data is sensed via the activated microphone; acquires information associated with the user on the basis of the first sound data; stores the acquired information associated with the user; and determines a menu corresponding to the object on the basis of the information associated with the user and the first sound data.
Abstract:
The present invention relates to a frame loss recovering method, an audio decoding method, and an apparatus using the method. A method of recovering a frame loss of an audio signal according to the present invention includes: grouping transform coefficients of at least one frame into a predetermined number of bands among previous frames of a current frame; deriving an attenuation constant according to a tonality of the bands; and recovering transform coefficients of the current frame by applying the attenuation constant to the previous frame of the current frame.
Abstract:
The present specification relates to a smart controlling device capable of utilizing machine learning for voice recognition and a method of controlling therefor. The smart controlling device according to the present invention includes a receiver configured to receive an input including a command trigger, and a controller configured to detect one or more external display devices, select a display device of the detected one or more external display devices, cause a power status of the selected display device to be changed to a first state, and cause a response data corresponding to a first command data received after the command trigger to be output on a display of the selected display device.
Abstract:
This specification relates to a method for controlling an artificial intelligence system which performs a multilingual processing based on artificial intelligence technology. The method for controlling an artificial intelligence system which performs a multilingual processing includes: receiving voice information through a microphone; determining a language of the voice information, based on a preset reference; selecting a specific voice recognition server from a plurality of voice recognition servers which process different languages, based on a result of the determination; and transmitting the voice information to the selected specific voice recognition server.
Abstract:
An electronic device including a plurality of the microphones and operating method thereof are disclosed. The present invention includes obtaining an audio according to a touch input of touching at least one of a plurality of the microphones, determining at least one selected from the group consisting of a location, a touch pattern, a touch strength, a touch duration time and a touch periodicity of the touch input based on the obtained audio, and performing an operation corresponding to the touch input based on a result of the determination.
Abstract:
The present invention relates to a frame loss recovering method, an audio decoding method, and an apparatus using the method. A method of recovering a frame loss of an audio signal according to the present invention includes: grouping transform coefficients of at least one frame into a predetermined number of bands among previous frames of a current frame; deriving an attenuation constant according to a tonality of the bands; and recovering transform coefficients of the current frame by applying the attenuation constant to the previous frame of the current frame.
Abstract:
The present invention relates to a method of managing a jitter buffer and a jitter buffer using same. The method of managing a jitter buffer includes the steps of: receiving audio information frames; and adjusting a jitter buffer on the basis of the received audio information frames, wherein the adjusting step of the jitter buffer includes compensation of an audio signal, and the compensation of the audio signal can be performed for each sub frame of the audio information frames.