DEVICE AND METHOD FOR CONTROLLING A ROBOT
    63.
    发明公开

    公开(公告)号:EP4386632A1

    公开(公告)日:2024-06-19

    申请号:EP22213403.3

    申请日:2022-12-14

    申请人: Robert Bosch GmbH

    IPC分类号: G06N3/092 G06N3/045

    CPC分类号: G06N3/092 G06N3/045

    摘要: According to various embodiments, a method for training a control policy is described, comprising estimating the variance of a value function which associates a state with a value of the state or a pair of state and action with a value of the pair by solving a Bellman uncertainty equation, wherein, for each of multiple states, the reward function of the Bellman uncertainty equation is set to the difference of the total uncertainty about the mean of the value of the subsequent state following the state and the average aleatoric uncertainty of the value of the subsequent state and biasing the control policy in training towards regions for which the estimation gives a higher variance of the value function than for other regions.

    METHOD, APPARATUS AND COMPUTER PROGRAM
    69.
    发明公开

    公开(公告)号:EP4300857A1

    公开(公告)日:2024-01-03

    申请号:EP23181046.6

    申请日:2023-06-22

    摘要: There is provided an apparatus for a first communication node, the apparatus comprising: means for synchronising a common reference timing with a second communication node; means for obtaining an indication of a time window, wherein the time window specifies a period of time between a first time instance and a second time instance; and means for configuring a machine learning-based function at the first communication node, wherein the configuration of the machine learning-based function is common between the first and second communication nodes. The apparatus further comprising means for executing the machine learning-based function; and means for obtaining information by measuring a performance metric, for the machine learning-based function, during the time window. The apparatus further comprising means for assigning a time identification to the measured information during the time window, wherein the time identification is associated with the common reference timing; and means for providing, to the second communication node, the measured information according to the time identification.

    COMPUTER-IMPLEMENTED METHOD, COMPUTER PROGRAM PRODUCT AND SYSTEM FOR DATA ANALYSIS

    公开(公告)号:EP4290412A3

    公开(公告)日:2024-01-03

    申请号:EP23204765.4

    申请日:2018-09-05

    摘要: A computer-implemented method for data analysis is provided. The method comprises: obtaining a deep neural network (100) for processing data and at least a part of a training dataset used for training the deep neural network, the deep neural network comprising a plurality of hidden layers, the training dataset including possible observations that can be input to the deep neural network, the deep neural network being trained using the training dataset; obtaining first sets of intermediate output values that are output from at least one of the plurality of hidden layers, each of the first sets of intermediate output values obtained by inputting a different one of the possible observations included in said at least the part of the training dataset; constructing a latent variable model using the first sets of intermediate output values, the latent variable model providing a mapping of the first sets of intermediate output values to first sets of latent variables for the latent variable model in a sub-space that has a dimension lower than a dimension of the sets of the intermediate outputs; and storing the latent variable model and the first sets of latent variables for the latent variable model in a storage medium.