-
公开(公告)号:US20250094877A1
公开(公告)日:2025-03-20
申请号:US18969719
申请日:2024-12-05
Inventor: Fan WANG , Hua WU , Yingzhan LIN , Zengfeng ZENG , Yufeng HU , Jianhui DING , Haifeng WANG
IPC: G06N20/00
Abstract: A large model-based method of generating a text, a method of training a text generation model, a device, and a medium are provided, which relate to a field of artificial intelligence technology, specifically to fields of deep learning, natural language processing and large model technologies. The large model-based method of generating a text includes: acquiring a memory state for a text to be processed, where the memory state is generated based on a previous text of the text to be processed; determining an embedding feature of the text to be processed as an initial hidden state, and processing the memory state and the initial hidden state by using a first attention mechanism to obtain an updated hidden state; and generating a subsequent text for the text to be processed based on the updated hidden state.