-
公开(公告)号:US20250124236A1
公开(公告)日:2025-04-17
申请号:US18518155
申请日:2023-11-22
Applicant: Databricks, Inc.
Inventor: Ridhima Gupta , Prithvi Kannan , Sunish Sohil Sheth , Kasey Uhlenhuth , Hubert Zub , Corey Zumar
IPC: G06F40/40 , G06F40/103 , G06F40/30
Abstract: A method for evaluating textual output of one or more machine-learned language models is presented. The method includes receiving, from a user of a client device, a first prompt for input to one or more machine-learned language models, providing the first prompt to the one or more models for execution, and receiving a set of generated responses to the first prompt from the one or more models. The method further includes generating a user interface (UI) on the client device displaying the first prompt and generated responses as a table user interface element. The method applies a selected evaluation function to the generated response to evaluate the response with respect to an evaluation objective and identifies words that influence the evaluation. The method generates one or more UI elements on the UI to display the results of the evaluation for the generated responses.