• Extractor
     

    Extract the pure essence, the pure meaning of a of a document, an ebook or a series of search engine results automatically. 

    Used by search engine optimization (SEO) and document management companies alike, the Extractor summarization technology reads a document, much like a human being does, returning lists of the keywords and key phrases accurately weighted as they are found in that document, text or web page.

    Uniquely positioned for web services, Extractor is immediately capable of consuming documents of any length and subject matter, distilling the precise, contextual meaning of the target content into keyword and key phrase summary formats. Extractor's unique patented technology delivers precise content summaries in any subject domain without retraining and without human intervention.

    Why is Extractor Different : The Extractor technology uses a patented set of core algorithms to extract (read) keywords and key phrases from any text-based document. The patented process allows Extractor to maintain the context in which the subject matter has been expressed and one of the primary differences. In essence a machine learned method for reading (summarizing ) any document of any subject matter written in an electronic text format. The distinction of machine learned is important in contrast to other methods of summarization discussed in the market place, which typically use referential and / or probability based theories for trying to determine content. These contrasting approaches require that their algorithms be trained and retrained per specific subject domain (non-machine learned) and are usually English language based. The Extractor Technology does not require per subject training and retraining allowing it to be used across any set of documents in one or a mix of subject domains and in any of six international languages (English, French, German, Japanese, Korean and Spanish).

    Platform agnostic : At its core the Extractor component software is designed for developers using the most common computing and design platforms which today include Windows, Linux and Solaris. The architecture of the Extractor Technology does allow for custom system compiles as need arise. The commercial software development kit (SDK) includes development support for C, VC++, C#, Java, Perl, Python (Visual Studio and Visual Studio .NET).

    Summarization: Extractor is exceptionally good at web content summarization incorporating its patented technology to summarize text, e-mail and html content into weighted lists of keywords and key phrases extracting the primary contextual sentence highlight of how the keyword / key phrase has been used. Uniquely positioned for web services, Extractor is immediately capable of consuming documents of any length and subject matter, distilling the precise, contextual meaning of the content into keyword and key phrase summary formats. Extractor's unique patented technology delivers precise content summaries of any subject domain without retraining and without human intervention.

    Relevant Information : Not just information but contextually accurate, relevant information is a critical tool for the success of business today. Being able to source relevant information in context of the subject matter is what gives organizations an ultimate competitive advantage. Rather than working through traditional, time consuming, iterative search engine processes, incorporating Extractor into Enterprise systems empowers corporate information with relevant and meaningful representations meeting the needs of today's social workforce