Search Engine

class engine.search_engine.SearchEngine.SearchEngine[source]
reload_queries()[source]

Reloads queries from database.

search(content)[source]

Iterates over all queries and returns those for which number of found keywords satisfies search threshold.

Parameters:content (str) – content of web page associated with the URL.
Returns:list of queries for which search threshold was satisfied.
search_in_url(url, content)[source]

Search web page content in order to find keywords.

Parameters:
  • url (str) – URL of web page being crawled.
  • content (str) – content of web page associated with the URL.