JCoRe OpenNLP Sentence Splitter, Biomedical English · This project employs the UIMA wrapper (jcore-opennlp-sentence-ae) with a model trained on the PennBioIE and Genia corpora.
Group: de.julielab - All Dependencies
JCoRe OpenNLP Parser, Biomedical English · This project employs the OpenNLP wrapper (jcore-opennlp-parser-ae) with a model trained on the Genia corpus.
JCoRe OpenNLP Chunker, Biomedical English · This project employs the OpenNLP wrapper (jcore-opennlp-chunker-ae) with a model trained on the Genia corpus.
JCoRe File Reader · Reads simple text files; derived from UIMA example "FileSystemCollectionReader"
JCoRe OpenNLP Tokenizer, Medical German · This Model was trained on the confidential FraMed corpus.
JCoRe OpenNLP Sentence Segmenter, Medical German · This Model was trained on the confidential FraMed corpus.
JCoRe OpenNLP POS Tagger, Medical German · This Model was trained on the confidential FraMed corpus.
JCoRe Lingpipe Porter Stemmer AE · This component applies the Porter stemming algorithm to all tokens in the CAS.
JCoRe Token Annotator, Medical German · UIMA Wrapper for the JCoRe Token Boundary Detector with a model trained on the confidential FraMed corpus.
JCoRe Sentence Annotator, Medical German · UIMA Wrapper for the JCoRe Sentence Boundary Detector (jcore-jsbd-ae) with a model trained on data from the confidential FraMed corpus
JCoRe DTA Reader · Reader for DTA files (German digital humanities corpus)
JULIE Lab Parent POM · The Parent POM for all JULIE Lab projects.
Plugin: Mapper: Preanalyzed · Provides the preanalyzed field datatype. The plugin allows to specify the exact terms of a field in JSON format without any further analysis being done.
JULIE Lab Neo4j Server Plugins · This project contains Neo4j server plugins used by the Julie Lab team.
Aho-Corasick algorithm Neo4j plugin · This plugin enables the Aho-Corasick TRIE building and dictionary recognition algorithm into Neo4j by modeling the resulting TRIE graph right into Neo4j.
JULIE Lab ElasticSearch Query Components · This is a Tapestry 5 project that offers a Tapestry-IoC integrated query layer for an ElasticSearch server.
JULIE Lab Embeddings Server · This project offers a Java server that wraps a Word2Vec embedding file. The server can then be queried for embeddings via a simple HTTP API. A Java client is available via the julielab-embeddings-client project. For other languages, the server protocol must be implemented which is a simple binary format as described in the README.md file.
JULIE Lab Embedding Management · This is a helper project for the usage of word embeddings. It is able to read pretrained embeddings from a word2vec file and offers direct access via the Deeplearning4j API or a web server that offers the embeddings via an HTTP interface.
JULIE Lab Embeddings Client · This is the Java counterpart to the JULIE Lab embeddings server module. It uses the common embedding access API offered by this project but obtains the word vectors from the server using a HTTP connection.
Jena Document Information System · A utility for managing documents stored in a PostgreSQL database. The documents are imported into a PostgreSQL DB as full texts with the goal to be able to retrieve the documents by their PubMedID efficiently. For more sophisticated tasks, a user configuration file can be delivered which can take control of the table schema to use, the PostgreSQL schema to use and the actual database server to connect to as well as the concrete database.
Semedico · The Semedico parent POM for all projects associated with Semedico (like semedico-core, semedico-frontend and semedico-tools; there might be more in the future).