Techniques for focused crawling
Abstract:
In some embodiments, a user interface module may be configured to assist a user in defining a set of states and a set of transitions, each transition of the set of transitions linking two states of the set of states, the set of states and the set of transitions defining a path to a target state, each state of the set of states being a web page, each transition of the set of transitions defining an interaction with an interactive element. A navigation instruction generation module may be configured to generate navigation instructions based on the set of states and the set of transitions. A focused crawler module may be configured to navigate to the target state based on the navigation instructions. A scraper module may be configured to copy information from the target state in response to the focused crawler module navigating to the target state.
Public/Granted literature
Information query
Patent Agency Ranking
0/0