Threats Watcher - Core Algorithm

Watcher.threats_watcher.core.cleanup(): Remove words with a creation date greater than 30 days.

Watcher.threats_watcher.core.fetch_last_posts(nb_max_post)

Fetch the nb last posts for each feed.

Watcher.threats_watcher.core.focus_five_letters(): Focus on 5 letters long words.

Watcher.threats_watcher.core.focus_on_top(words_occurrence)

Focus on top words. Populated the database with only words with a minimum occurrence of “words_occurence” in feeds.

Watcher.threats_watcher.core.main_watch()

Main function:

Watcher.threats_watcher.core.remove_banned_words(): Clean the posts for specific patterns: BannedWord, then english + french common words.

Watcher.threats_watcher.core.send_threats_watcher_notifications(email_words): Sends notifications to Slack, Citadel, TheHive or Email based on Threats Watcher.

Watcher.threats_watcher.core.start_scheduler()

Launch multiple planning tasks in background:

Watcher.threats_watcher.core.tokenize_count_urls(): Tokenize phrases to words, Count word occurences and keep the word post source urls.