BlackLab Core · BlackLab is a corpus retrieval engine written in Java, built on top of Apache Lucene. It allows fast, complex searches with accurate hit highlighting on large, tagged and annotated, bodies of text. It was developed at the Institute of Dutch Lexicology (INL) to provide a fast and feature-rich search interface on our historical and contemporary text corpora.
Group: nl.inl.blacklab - All Dependencies
BlackLab Server · BlackLab Server is a REST webservice on top of corpus search engine BlackLab. It allows you to use BlackLab from any programming language.
BlackLab Corpus Search · The parent project for BlackLab Core and BlackLab Server.
BlackLab Utilities · Several utility functions used by BlackLab.
BlackLab Tools · Some useful tools for BlackLab, such as IndexTool and QueryTool, as well as some test utilities.
BlackLab Text Pattern · Abstract query classes that can be translated into Lucene queries.
BlackLab Query Parsers · Query parsers for BlackLab, to translate queries to TextPatterns.
BlackLab Mocks · Mock implementations of several BlackLab classes for testing.
BlackLab legacy DocIndexers · Implementations of the DocIndexer class for various document formats. Custom DocIndexer implementations have been mostly superceded by configuration-file based indexing, but you can still use these or implement your own DocIndexer if you wish.
BlackLab Instrumentation Implementation · Instrumentation implementation for blacklab server. Including metrics
BlackLab Instrumentation · Instrumentation for blacklab server. Including metrics
BlackLab Engine · The BlackLab "engine" that actually executes queries. Functionality that uses the engine but is not required by it will be extracted into separate modules.
BlackLab Convert and tag indexer · Example of a DocIndexer that is able to convert several non-XML file types to XML and tag them.
BlackLab Content Store · Stores the documents as originally indexed.
BlackLab Common · Common classes shared among different BlackLab modules, such as exceptions.