Web Data Collection
SPIDA is a set of tools that collect unstructured data from the clear, deep and dark web.
SPIDA comes in three configurations that enable investigators to acquire and collate the material in the most appropriate form.
For fully automated full site captures based on keywords and URL’s. Wolf unique feature is a heuristic learning engine that enables Wolf to learn the layout of various web site forms such as bulletin boards with their wide variety of layout and conventions for data presentation. Wolf can learn date formats, name conventions, post configurations, reply formats and then retrieve into data
Funnelweb delivered complete website download based on key word or URL. Multiple searches and downloads can be run simultaneously. All data is stored and can be exported to IBM i2 Analyst's Notebook charts for analysis.
Anonymous searching can be carried out using the built in TOR browser.
For targeted integration of unstructured data, Huntsman gives investigators and analysts the ability to search and select web content from any web site to add to an IBM i2 Analyst's Notebook chart. Collect intelligence from any web source including web pages, forums, bulletin boards and social networks “Clip” data from any web source including images and text. Enhance Analyst productivity and extend the scope of investigations Enables web information and links to be collected from any web site – whether on the Surface, Deep or Dark Web. Information collected is added directly to IBM i2 Analyst's Notebook charts as entities or links and can be fully annotated. All items are linked to original sources and available through IBM i2 Analyst's Notebook Item Properties.