InfoCodex - Cross-lingual Text Analysis, Text Mining, Linguistic Database, Taxonomy, Ontology

InfoCodex is based on an advanced new technology for text analysis and cross-lingual content recognition (patented in the EU and U.S.A.).

Words or expressions (groups of words such as "European Union", "Enterprise Search Engine" etc.) present in the text are matched against InfoCodex's linguistic database, which comprises more than 3 million words/expressions which are structured into cross-lingual synonym groups. These synonym groups point to a node in a universal taxonomy (ontology) characterizing the meaning of the matched synonym. This language-independent information is then used to extract the content of the document.

Cross-lingual universal knowledge repository

The central multi-lingual knowledge base (linguistic database) is the result of combining and harmonizing knowledge repositories from established works such as the WordNet (Princeton University), EuroVoc, AgriVoc, JuriVoc, the International Classification for Standards (ICS), financial taxonomies and many other sources.

Content recognition and categorization

Content similarity

Semantic and similarity search

Abstracts Generation

Advanced visualization

Privacy and security

Semantic Web

Customer benefits

The semantic engine

Distributed Sources

Cross-lingual text analysis