Multimodal Dialog State Tracking and Action Prediction for Assistant Systems

    公开(公告)号:US20210117681A1

    公开(公告)日:2021-04-22

    申请号:US17006339

    申请日:2020-08-28

    Applicant: Facebook, Inc.

    Abstract: In one embodiment, a method includes receiving, from a client system associated with a user, a user request comprising a reference to a target object, accessing visual data from the client system, wherein the visual data comprises images portraying the target object and one or more additional objects, and wherein attribute information of the target object is recorded in a multimodal dialog state, resolving the reference to the target object based on the attribute information recorded in the multimodal dialog state, determining relational information between the target object and one or more of the additional objects portrayed in the visual data, and sending, to the client system, instructions for presenting a response to the user request, wherein the response comprises the attribute information and the determined relational information.

    Multimodal Entity and Coreference Resolution for Assistant Systems

    公开(公告)号:US20210118442A1

    公开(公告)日:2021-04-22

    申请号:US17006377

    申请日:2020-08-28

    Applicant: Facebook, Inc.

    Abstract: In one embodiment, a method includes accessing visual data from a client system associated with a user, wherein the visual data comprises images portraying one or more objects, receiving, from the client system, a user request, wherein the user request comprises a coreference to a target object, resolving the coreference to the target object from among the one or more objects, resolving the target object to a specific entity, and sending, to the client system, instructions for providing a response to the user request, wherein the response comprises attribute information about the specific entity.

Patent Agency Ranking