Analysis of Unstructured Computer Text to Generate Themes and Determine Sentiment

    公开(公告)号:US20170242919A1

    公开(公告)日:2017-08-24

    申请号:US15047527

    申请日:2016-02-18

    申请人: FMR LLC

    IPC分类号: G06F17/30

    CPC分类号: G06F17/30684 G06F17/30867

    摘要: Methods and apparatuses are described for analyzing unstructured computer text for theme generation to determine sentiment. A computer store stores unstructured text that is delimited, a searched phrases log, and a phrase click log. A computer server extracts phrases from the unstructured delimited text by splitting each line of the unstructured delimited text into one or more phrases. The computer server generates tokens from the unstructured delimited text, where the tokens comprise segments of the unstructured delimited text. The computer server determines one or more themes present in the unstructured delimited text.