Remote control method and apparatus for an imaging apparatus

    公开(公告)号:US11437034B2

    公开(公告)日:2022-09-06

    申请号:US16806842

    申请日:2020-03-02

    Inventor: Kwangyong Lee

    Abstract: Disclosed are a method and apparatus for remotely controlling an imaging apparatus. A method of controlling a remote control apparatus includes converting a spoken utterance of a user into an utterance text or receiving the utterance text, applying a generative model-based first learning model to the utterance text and generating an image having attributes corresponding to a context of the utterance text, and externally transmitting the image and the utterance text. In addition, a method of controlling an imaging apparatus includes receiving a first input including text or speech data and a second input including a first image, capturing at least one second image based on the first input, comparing the first image and the second image, and transmitting the second image in response to a comparison result of the first image and the second image.

    Device with convolutional neural network for acquiring multiple intent words, and method thereof

    公开(公告)号:US11423884B2

    公开(公告)日:2022-08-23

    申请号:US16845563

    申请日:2020-04-10

    Inventor: Kwangyong Lee

    Abstract: The present disclosure relates to a convolutional-neural-network structure for acquiring intent words, and a speech recognition device and method using the network. the method includes receiving input data generated from speech, performing convolution on the input data and N3 filters each having N2 channels, and acquiring a feature map having N4 pieces of data for each channel, applying max pooling to the N4 pieces of data to acquire a representative value, and acquiring a feature map having N2 pieces of data for each filter, performing concatenation on the feature maps for the respective filters, and acquiring one feature map of an N3×N2 matrix, performing convolution on the feature map of the N3×N2 matrix and a filter of a 1×N3 matrix, and acquiring a feature map of a 1×N2 matrix; and inputting the feature map of the 1×N2 matrix into an artificial neural network, and acquiring at least one intent word.

    Enhancing performance of local device

    公开(公告)号:US11567494B2

    公开(公告)日:2023-01-31

    申请号:US16847213

    申请日:2020-04-13

    Inventor: Kwangyong Lee

    Abstract: A method for improving performance of a local device based on guide data from a remote device, according to one embodiment of the present disclosure, includes transmitting, to the remote device, first image data generated by the local device at a first time point, receiving guide data related to the first image data from the remote device, and registering, by a processor, the guide data to second image data generated by the local device at a second time point, based on first spatial information on the first image data, wherein the second time point is a time point that is after the first time point. A trained model for object recognition according to the present disclosure may include a deep neural network generated through machine learning, and the transmitting of the guide data may be performed in an Internet of Things (IoT) environment using a 5G network.

    Artificial intelligence apparatus for performing speech recognition and method thereof

    公开(公告)号:US11289074B2

    公开(公告)日:2022-03-29

    申请号:US16653569

    申请日:2019-10-15

    Inventor: Kwangyong Lee

    Abstract: Disclosed herein is an artificial intelligence apparatus for performing speech recognition including a microphone configured to receive a speech command of a user, a learning processor configured to determine anaphora included in text data corresponding to the speech command using an anaphora recognition model for determining anaphora included in predetermined text data, and a processor configured to specify an object referred to by the determined anaphora based on context information including information input to or output from the artificial intelligence apparatus, determine a response to the speech command based on the specified object, and control the artificial intelligence apparatus according to the determined response.

    Voice processing method based on artificial intelligence

    公开(公告)号:US11790893B2

    公开(公告)日:2023-10-17

    申请号:US17039169

    申请日:2020-09-30

    CPC classification number: G10L15/16 H04W72/1268 H04W72/23

    Abstract: A voice processing method is disclosed. The voice processing method applies first and second sentence vectors extracted from first and second utterances, that are included in one dialog group and are separated from each other, to a learning model and generates an output from which at least one word having an overlapping meaning is removed. The voice processing method can be associated with an artificial intelligence module, an unmanned aerial vehicle (UAV), a robot, an augmented reality (AR) device, a virtual reality (VR) device, devices related to 5G services, and the like.

    Artificial intelligence apparatus and method for recognizing object included in image data

    公开(公告)号:US11200467B2

    公开(公告)日:2021-12-14

    申请号:US16743862

    申请日:2020-01-15

    Inventor: Kwangyong Lee

    Abstract: An artificial intelligence apparatus for recognizing an object included in image data can include a camera, a communication modem, a memory configured to store an image recognition model, a natural language processing (NLP) model, and an NLP model-based image recognition model learned based on the NLP model, and a processor is configured to receive image data from the camera or the communication modem, in response to recognizing an object included in the image data using the image recognition model, generate first recognition information on the object included in the image data, and in response to the recognizing the object included in the image data using the image recognition model being unsuccessful, generate second recognition information on the object included in the image data based on recognizing the object using the NLP model-based image recognition model.

Patent Agency Ranking