SYSTEMS AND METHODS NEAR NEGATIVE DISTINCTION FOR EVALUATING NLP MODELS

Invention Publication

US20230229861A1 SYSTEMS AND METHODS NEAR NEGATIVE DISTINCTION FOR EVALUATING NLP MODELS 审中-公开

Please log in to see more content

Patent Title: SYSTEMS AND METHODS NEAR NEGATIVE DISTINCTION FOR EVALUATING NLP MODELS
Application No.: US17837546

Application Date: 2022-06-10
Publication No.: US20230229861A1

Publication Date: 2023-07-20
Inventor: Philippe Laban , Chien-Sheng Wu , Wenhao Liu , Caiming Xiong
Applicant: Salesforce, Inc.
Applicant Address: US CA San Francisco
Assignee: Salesforce, Inc.
Current Assignee: Salesforce, Inc.
Current Assignee Address: US CA San Francisco
Main IPC: G06F40/284
IPC: G06F40/284

SYSTEMS AND METHODS NEAR NEGATIVE DISTINCTION FOR EVALUATING NLP MODELS

Abstract:

Embodiments described herein provide a method of evaluating a natural language processing model. The method includes receiving an evaluation dataset that may include a plurality of unit tests, the unit tests having: an input context, and a first candidate and a second candidate that are generated in response to the input context, where the first test candidate is associated with a first quality notation, and the second candidate is associated with a second quality notation. The method includes determining, via a model, a first likelihood of generating the first candidate and a second likelihood of generating the second candidate in response to the input context. The method also includes determining whether the first likelihood being greater than the second likelihood. The method also includes determining whether the first model passed the unit test, where the first quality notation indicates a higher quality candidate and the second quality notation indicate a lower quality candidate.

Public/Granted literature

US12223270B2 Systems and methods near negative distinction for evaluating NLP models Public/Granted day:2025-02-11

Information query

Global Dossier Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F40/00	处理自然语言数据（语音分析或综合，语音识别G10L）
G06F40/20	.自然语言分析（自然语言的语义分析入G06F40/30）
G06F40/279	..文字实体的识别
G06F40/284	...词汇分析，例如标记或搭配词