Invention Grant
- Patent Title: Document content analysis based on topic modeling
-
Application No.: US15269458Application Date: 2016-09-19
-
Publication No.: US10558657B1Publication Date: 2020-02-11
- Inventor: Weiwei Cheng , Christopher Gonzales
- Applicant: Amazon Technologies, Inc.
- Applicant Address: US WA Seattle
- Assignee: Amazon Technologies, Inc.
- Current Assignee: Amazon Technologies, Inc.
- Current Assignee Address: US WA Seattle
- Agency: Lee & Hayes, P.C.
- Main IPC: G06F16/00
- IPC: G06F16/00 ; G06F16/2453 ; G06F16/93 ; G06F16/248

Abstract:
A mechanism for progressive topic modeling is disclosed to facilitate document content analysis. Input documents can be sorted and divided into multiple groups. Topic modeling is performed for each group, where the topic modeling for one group is based on the generated topic model from a previous group, if available. The vocabulary used in the topic modeling process can also be updated for each group of documents. The generated topics can be presented in a user interface to facilitate a user in analyzing the documents. The topic modeling mechanism can also be utilized to enhance a document search experience by generating topics from documents contained in search results and presenting topic words to a user as suggested search terms.
Information query