Open internet (Web sites, results of Internet search engines),
hidden Internet (with special scripts for site-specific search engines),
intranet, file servers in the companies' network, mailboxes (Outlook
including PST archives, Outlook Express, Thunderbird, Exchange Server,
Lotus Notes), user PCs/laptops, MS SharePoint portal, databases, and
CMS systems (with a tailored interface).
Please note that all data sources can be merged into one and the same
document collection.
MS Word, PDF, PPT, Excel, HTML, XML, plain TXT, RTF, various
mail formats, PS, various image file formats (for image files, the text
is composed of the full file name and the text that can be extracted
from the image file), and any other file format for which an i-filter
can be supplied by the user.