PHONEME-BASED NATURAL LANGUAGE PROCESSING

    公开(公告)号:US20210183392A1

    公开(公告)日:2021-06-17

    申请号:US17028361

    申请日:2020-09-22

    Abstract: A natural language processing method and apparatus are disclosed. A natural language processing method according to an embodiment of the present disclosure includes extracting a phoneme string from a text corpus labeled with recognition information including at least one of one named entity (NE) or speech intention, generating a phoneme-based training data set by labeling the recognition information in the extracted phoneme string, and generating an artificial neural network-based learning model (LM) using the generated training data set. The natural language processing method of the present disclosure may be associated with an artificial intelligence module, a drone (Unmanned Aerial Vehicle, UAV), a robot, an AR (Augmented Reality) device, a VR (Virtual Reality) device, a device associated with 5G services, etc.

    VOICE PROCESSING METHOD BASED ON ARTIFICIAL INTELLIGENCE

    公开(公告)号:US20210158802A1

    公开(公告)日:2021-05-27

    申请号:US17039169

    申请日:2020-09-30

    Abstract: A voice processing method is disclosed. The voice processing method applies first and second sentence vectors extracted from first and second utterances, that are included in one dialog group and are separated from each other, to a learning model and generates an output from which at least one word having an overlapping meaning is removed. The voice processing method can be associated with an artificial intelligence module, an unmanned aerial vehicle (UAV), a robot, an augmented reality (AR) device, a virtual reality (VR) device, devices related to 5G services, and the like.

    ARTIFICIAL INTELLIGENCE APPARATUS AND METHOD FOR RECOGNIZING OBJECT INCLUDED IN IMAGE DATA

    公开(公告)号:US20210142127A1

    公开(公告)日:2021-05-13

    申请号:US16743862

    申请日:2020-01-15

    Inventor: Kwangyong LEE

    Abstract: An artificial intelligence apparatus for recognizing an object included in image data can include a camera, a communication modem, a memory configured to store an image recognition model, a natural language processing (NLP) model, and an NLP model-based image recognition model learned based on the NLP model, and a processor is configured to receive image data from the camera or the communication modem, in response to recognizing an object included in the image data using the image recognition model, generate first recognition information on the object included in the image data, and in response to the recognizing the object included in the image data using the image recognition model being unsuccessful, generate second recognition information on the object included in the image data based on recognizing the object using the NLP model-based image recognition model.

    TRAINING ARTIFICIAL NEURAL NETWORK MODEL BASED ON GENERATIVE ADVERSARIAL NETWORK

    公开(公告)号:US20210125075A1

    公开(公告)日:2021-04-29

    申请号:US17029256

    申请日:2020-09-23

    Inventor: Kwangyong LEE

    Abstract: Provided is training an artificial neural network model based on a GAN. In a method of training a classification model based on a GAN, a classification model capable of deducing an inference result of unknown and/or rejection can be generated by differently generating and training in-domain data and out-of-domain data in time series using a generative model. An intelligent device according to the present disclosure may be associated with an artificial intelligence module, a drone (unmanned aerial vehicle (UAV)), a robot, an augmented reality (AR) device, a virtual reality (VR) device, and 5G service-related devices.

    REMOTE CONTROL METHOD AND APPARATUS FOR AN IMAGING APPARATUS

    公开(公告)号:US20210158815A1

    公开(公告)日:2021-05-27

    申请号:US16806842

    申请日:2020-03-02

    Inventor: Kwangyong LEE

    Abstract: Disclosed are a method and apparatus for remotely controlling an imaging apparatus. A method of controlling a remote control apparatus includes converting a spoken utterance of a user into an utterance text or receiving the utterance text, applying a generative model-based first learning model to the utterance text and generating an image having attributes corresponding to a context of the utterance text, and externally transmitting the image and the utterance text. In addition, a method of controlling an imaging apparatus includes receiving a first input including text or speech data and a second input including a first image, capturing at least one second image based on the first input, comparing the first image and the second image, and transmitting the second image in response to a comparison result of the first image and the second image.

    DEVICE WITH CONVOLUTIONAL NEURAL NETWORK FOR ACQUIRING MULTIPLE INTENT WORDS, AND METHOD THEREOF

    公开(公告)号:US20210134274A1

    公开(公告)日:2021-05-06

    申请号:US16845563

    申请日:2020-04-10

    Inventor: Kwangyong LEE

    Abstract: The present disclosure relates to a convolutional-neural-network structure for acquiring intent words, and a speech recognition device and method using the network. the method includes receiving input data generated from speech, performing convolution on the input data and N3 filters each having N2 channels, and acquiring a feature map having N4 pieces of data for each channel, applying max pooling to the N4 pieces of data to acquire a representative value, and acquiring a feature map having N2 pieces of data for each filter, performing concatenation on the feature maps for the respective filters, and acquiring one feature map of an N3×N2 matrix, performing convolution on the feature map of the N3×N2 matrix and a filter of a 1×N3 matrix, and acquiring a feature map of a 1×N2 matrix; and inputting the feature map of the 1×N2 matrix into an artificial neural network, and acquiring at least one intent word.

    ENHANCING PERFORMANCE OF LOCAL DEVICE

    公开(公告)号:US20210208582A1

    公开(公告)日:2021-07-08

    申请号:US16847213

    申请日:2020-04-13

    Inventor: Kwangyong LEE

    Abstract: A method for improving performance of a local device based on guide data from a remote device, according to one embodiment of the present disclosure, includes transmitting, to the remote device, first image data generated by the local device at a first time point, receiving guide data related to the first image data from the remote device, and registering, by a processor, the guide data to second image data generated by the local device at a second time point, based on first spatial information on the first image data, wherein the second time point is a time point that is after the first time point. A trained model for object recognition according to the present disclosure may include a deep neural network generated through machine learning, and the transmitting of the guide data may be performed in an Internet of Things (IoT) environment using a 5G network.

    ARTIFICIAL INTELLIGENCE APPARATUS FOR PERFORMING SPEECH RECOGNITION AND METHOD THEREOF

    公开(公告)号:US20200043478A1

    公开(公告)日:2020-02-06

    申请号:US16653569

    申请日:2019-10-15

    Inventor: Kwangyong LEE

    Abstract: Disclosed herein is an artificial intelligence apparatus for performing speech recognition including a microphone configured to receive a speech command of a user, a learning processor configured to determine anaphora included in text data corresponding to the speech command using an anaphora recognition model for determining anaphora included in predetermined text data, and a processor configured to specify an object referred to by the determined anaphora based on context information including information input to or output from the artificial intelligence apparatus, determine a response to the speech command based on the specified object, and control the artificial intelligence apparatus according to the determined response.

Patent Agency Ranking