Incorporating user feedback into text prediction models via joint reward planning

    公开(公告)号:US11181988B1

    公开(公告)日:2021-11-23

    申请号:US17008265

    申请日:2020-08-31

    Applicant: Apple Inc.

    Abstract: An example process includes: obtaining input token(s); determining, using a joint prediction model, based on the input token(s): a first predicted token following the input token(s) and a second predicted token following the first predicted token; and a first user action to be performed on the first predicted token, where determining the first user action includes: determining a first reward value for performing the first user action based on a first current reward value for performing the first user action and a second reward value for performing a second user action on the second predicted token; outputting the first predicted token; detecting a user action performed on the first predicted token; and in accordance with a determination that the detected user action does not match the first user action: causing parameters of the joint prediction model to be updated, the parameters being configured to determine the first user action.

Patent Agency Ranking