Invention Grant
US08209665B2 Identification of topics in source code 有权
识别源代码中的主题

Identification of topics in source code
Abstract:
Topics in source code can be identified using Latent Dirichlet Allocation (LDA) by receiving source code, identifying domain specific keywords from the source code, generating a keyword matrix, processing the keyword matrix and the source code using LDA, and outputting a list of topics. The list of topics is output as collections of domain specific keywords. Probabilities of domain specific keywords belonging to their respective topics can also be output. The keyword matrix comprises weighted sums of occurrences of domain specific keywords in the source code.
Public/Granted literature
Information query
Patent Agency Ranking
0/0