Apache Stanbol Entityhub Default Configuration · This provides the default configuration for the Entityhub. This bundle can be removed by users that want/need to provide their own configuration conflicting with the defaults.
Group: org.apache.stanbol - All Dependencies
Apache Stanbol Enhancer JobManager Reactor · Pseudo project to build JoManager Implementations
Apache Stanbol Enhancement Engine : XMP Extractor · Enhancement Engine that extracts XMP data
Apache Stanbol Enhancement Engine : Tika · Enhancement Engine that uses Apache Tika to convert parsed documents to plain text and xhtml
Apache Stanbol Enhancement Engine : TextAnnotation new Model Conterter · Post processing Engine that converts fise:TextAnnotation to the model as descibed by STANBOL-087. This means that it add fise:selection-prefix and fise:selection-suffix values for all TextAnnotations. This Engine allows EnhancementEngines that do not support those properties to be used in usecases where those properties are required.
Apache Stanbol Enhancement Engine : Smart Chinese Tokenizer · Sententence detection and Tokenizer for Chinese based on the Solr/Lucene smartcn (Smart Chinese) analyzer module
Apache Stanbol Enhancement Engine : Sentiment Word Classifier · Sentiment Tagging Engine for the Stanbol NLP chain. Tags sentiment values in word granularity based on SentimentClassifier implementations registered as OSGI services. This engine includes implementations for English (Sentiment Word Net) and German (SentiWs) data (please note the license of the according data sets).
Apache Stanbol Enhancement Engine : Sentiment Summarization · Consumes Sentiment tags on Token level and summarizes those for Noun Phrases, Sentences and the document as a whole (STANBOL-760).
Apache Stanbol Enhancement Engine : RESTful NLP processing · A Stanbol engine that uses a standadized API to call a RESTful service for NLP processing. The service is based on JSON serialized AnalysedText ContentPart.
Apache Stanbol Enhancement Engine : RESTful Language Identification · A Stanbol engine that uses a standadized API to call a RESTful service for Language Identification
Apache Stanbol Enhancement Engine : Refactor Engine · Implementation of an enhancement engine which refactors the ouptut to a target vocabulary
Apache Stanbol Enhancement Engine : POS tag based Chunker · Uses POS tag information of Tokens to create Noun and Verb phrases.
Apache Stanbol Enhancement Engine : OpenNLP Tokenizer · A Stanbol engine tokenizing the AnalyzedText contentpart for further processing by other engines.
Apache Stanbol Enhancement Engine : OpenNLP Sentence Detection · A Stanbol engine adding Sentence annotations to the AnalyzedText content part to a content item for further processing.
Apache Stanbol Enhancement Engine : OpenNLP POS Tagging · A Stanbol engine adding a POS content part to a content item for further processing
Apache Stanbol Enhancement Engine : OpenNLP Chunker · OpenNLP chunking engine to detect phrases in a text. Requires POS tagging to be applied to content item AnalysedText content part before it can run.
Apache Stanbol Enhancement Engine : OpenCalais · Retrieves named entities from OpenCalais
Apache Stanbol Enhancement Engine : NLP to RDF converter · This Enhancement Engine converts NLP data stored in the AnalyzedText content part to RDF and adds them to the enhancement metadata of the processed ContentItem. This Engine uses the NIF 1.0 (NLP Interchange Format) Ontologies for representing the NLP data in RDF.
Apache Stanbol Enhancement Engine : Lucene FST Entity Linking · An in-memory EntityLinking engine that uses Lucenes FST (Finite State Transducer) technology. This engine is based on code provided by the Solr Text Tagger (https://github.com/OpenSextant/SolrTextTagger/) but provides a deep integration with Apache Stanbol (Datafile provider, NLP processing module and existing EntityLinking functionality).
Apache Stanbol Enhancement Engine : Language Identifier · identifies a text as: German(de), English(en), French(fr), Spanish(es), Italian(it), Swedish(sv), Polish(pl), Dutch(nl), Norwegian(no), Finnish(fi), Albanian(sq), Slovakian(sk), Slovenian(sl), Danish(da), Hungarian(hu)
Apache Stanbol Enhancement Engine : Language Detection · language detection for 53 languages based on http://code.google.com/p/language-detection
Apache Stanbol Enhancement Engine : Kuromoji NLP · NLP processing based on the Lucene Kuromoji module