发明申请
- 专利标题: Systems and methods for hybrid text summarization
- 专利标题(中): 混合文本摘要的系统和方法
-
申请号: US10684508申请日: 2003-10-15
-
公开(公告)号: US20050086592A1公开(公告)日: 2005-04-21
- 发明人: Livia Polanyi , Martin Van Den Berg , Giovanni Thione , Richard Crouch , Christopher Culy , David Ahn
- 申请人: Livia Polanyi , Martin Van Den Berg , Giovanni Thione , Richard Crouch , Christopher Culy , David Ahn
- 主分类号: G06F17/30
- IPC分类号: G06F17/30 ; G06F15/00 ; G06F17/27
摘要:
Techniques are provided for segmenting text into categorized discourse constituents and attaching discourse constituents into a structural representation of discourse. Techniques for determining hybrid structural and non-structural summaries of a text are also provided. A text is segmented based on a theory of discourse analysis into at least a main discourse constituent containing spatio-temporal information about a single event in a possible world view. The discourse constituents are then inserted into a structural representation of discourse. Non-structural techniques are used to determine relevance scores and important discourse constituents are determined. Relevance scores are percolated through the structural representation of discourse to determine supporting preceding discourse constituents that preserve grammaticality. A hybrid text summary is then determined based on the structural representation of the discourse and relevance scores.
公开/授权文献
- US07610190B2 Systems and methods for hybrid text summarization 公开/授权日:2009-10-27
信息查询