Turning Information
... into Knowledge
Our Solutions Semantic Technologies Support Partners About
Distributed Sources Cross-lingual text analysis Content recognition and categorization Content similarity Semantic and similarity search Abstracts Generation Advanced visualization Privacy and security Semantic Web Customer benefits
Contact English Deutsch Franšais

Cross-lingual text analysis

The linguistic database - the
universal knowledge base

InfoCodex is based on an advanced new technology for text analysis and cross-lingual content recognition (patented in the EU and U.S.A.).

Words or expressions (groups of words such as "European Union", "Enterprise Search Engine" etc.) present in the text are matched against InfoCodex's linguistic database, which comprises more than 3 million words/expressions which are structured into cross-lingual synonym groups. These synonym groups point to a node in a universal taxonomy (ontology) characterizing the meaning of the matched synonym. This language-independent information is then used to extract the content of the document.

Cross-lingual universal knowledge repository

The central multi-lingual knowledge base (linguistic database) is the result of combining and harmonizing knowledge repositories from established works such as the WordNet (Princeton University), EuroVoc, AgriVoc, JuriVoc, the International Classification for Standards (ICS), financial taxonomies and many other sources.


26. - 27.09.2016 Keynote at Big Data Analytics and Data Mining, London

14.03.16 WebPlatform Ares4 with InfoCodex presented at CeBIT 2016 Hannover

16.09.15 Presentation at SEMANTICS Vienna 2015 "Scientific Discovery by Machine ..." (Session 1.4)

02.06.2015 Presentation at Swiss Competitive Intelligence

26.02.13 Semantic Tech Turns up Biomarkers

13.02.13 Discovery of Novel Biomarkers (joint paper with Merck & Co)