INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND COMPUTER PROGRAM PRODUCT

    公开(公告)号:US20240347048A1

    公开(公告)日:2024-10-17

    申请号:US18442441

    申请日:2024-02-15

    IPC分类号: G10L15/16 G10L15/08 G10L15/28

    摘要: According to an embodiment, an information processing apparatus includes one or more hardware processors configured to function as a memory control unit, a transformation unit, a first convolutional neural network (CNN), and a second CNN unit. The memory control unit reads a first stride parameter used for controlling an output resolution and a first dilation parameter used for controlling an input resolution from a memory device. The transformation unit transforms the first stride parameter to a second stride parameter and transforms the first dilation parameter to a second dilation parameter by using a transformation parameter. The first CNN unit executes first CNN processing of a feature vector by using at least the second stride parameter. The second CNN unit executes second CNN processing with an output vector of the first CNN unit as an input by using at least the second dilation parameter.

    Information processing device and information processing method

    公开(公告)号:US12062360B2

    公开(公告)日:2024-08-13

    申请号:US16972420

    申请日:2019-03-12

    申请人: SONY CORPORATION

    摘要: The present invention has an issue of effectively reducing the input load related to a voice trigger. There is provided an information processing device comprising a registration control unit that dynamically controls registration of startup phrases used as start triggers of a voice interaction session, in which the registration control unit temporarily additionally registers at least one of the startup phrases based on input voice. There is also provided an information processing method comprising dynamically controlling, by a processor, registration of startup phrases used as start triggers of a voice interaction session, in which the controlling further includes temporarily additionally registering at least one of the startup phrases based on input voice.

    Information processing apparatus and information processing method

    公开(公告)号:US12014736B2

    公开(公告)日:2024-06-18

    申请号:US17413158

    申请日:2019-10-30

    摘要: An information processing apparatus that includes a control unit controlling an action of an autonomous operation unit, and in which the control unit controls transition of plural states relating to speech recognition processing through the autonomous operation unit based on a detected trigger, and the states include a first active state in which an action of the autonomous operation unit is restricted, and a second active state in which the speech recognition processing is performed. An information processing method in which a processor controls an action of an autonomous operation unit, the controlling includes controlling transition of plural states relating to speech recognition processing through the autonomous operation unit based on a detected trigger, and the states include a first active state in which an action of the autonomous operation unit is restricted, and a second active state in which the speech recognition processing is performed.

    Clothes-processing device
    5.
    发明授权

    公开(公告)号:US12006619B2

    公开(公告)日:2024-06-11

    申请号:US17267073

    申请日:2019-08-21

    摘要: The present invention relates to a clothes-processing device comprising: a cabinet comprising a body; a body front surface fixed to the body and forming the front surface, and an introduction opening formed through the body front surface; a drum comprising a drum body disposed in the cabinet so as to store clothes and a drum introduction opening formed through the drum body to communicate with the introduction opening; a driving part for rotating the drum; a door rotatably disposed at the cabinet so as to open or close the introduction opening; a control part for controlling the driving part; and a voice recognition part disposed at the door so as to recognize a voice generated by a user and transmit a control command corresponding to the recognized voice to the control part.