GENERATING TOPIC-SPECIFIC LANGUAGE MODELS
    1.
    发明公开

    公开(公告)号:US20240347053A1

    公开(公告)日:2024-10-17

    申请号:US18629200

    申请日:2024-04-08

    申请人: TiVo Corporation

    IPC分类号: G10L15/183 G10L15/197

    CPC分类号: G10L15/183 G10L15/197

    摘要: Speech recognition may be improved by generating and using a topic specific language model. A topic specific language model may be created by performing an initial pass on an audio signal using a generic or basis language model. A speech recognition device may then determine topics relating to the audio signal based on the words identified in the initial pass and retrieve a corpus of text relating to those topics. Using the retrieved corpus of text, the speech recognition device may create a topic specific language model. In one example, the speech recognition device may adapt or otherwise modify the generic language model based on the retrieved corpus of text.

    Systems and methods for unsupervised structure extraction in task-oriented dialogues

    公开(公告)号:US12087281B2

    公开(公告)日:2024-09-10

    申请号:US17589693

    申请日:2022-01-31

    申请人: Salesforce, Inc.

    摘要: Embodiments described herein propose an approach for unsupervised structure extraction in task-oriented dialogues. Specifically, a Slot Boundary Detection (SBD) module is adopted, for which utterances from training domains are tagged with the conventional BIO schema but without the slot names. A transformer-based classifier is trained to detect the boundary of potential slot tokens in the test domain. Next, while the state number is usually unknown, it is more reasonable to assume the slot number is given when analyzing a dialogue system. The detected tokens are clustered into the number of slot of groups. Finally, the dialogue state is represented with a vector recording the modification times of every slot. The slot values are then tracked through each dialogue session in the corpus and label utterances with their dialogue states accordingly. The semantic structure is portrayed by computing the transition frequencies among the unique states.