Corpus.Tools is a joint portal of Masaryk University's NLP Centre and Lexical Computing, dedicated to a range of software tools for text corpus processing, including the widely used corpus software Sketch Engine.
It offers advanced corpus tools for language processing and research. There are tools for corpus analysis and corpus building, helping linguists, experts in language technology, and NLP engineers process efficiently large language data.
These corpus tools streamline working with large text datasets across many languages. They are designed to clean and deduplicate documents and text data, compile and annotate them, and to analyse them using linguistic and statistical criteria. The tools are language-independent, suitable for major languages as well as low-resourced and minority languages.
If you have questions, join the NoSketch Engine Google group to connect with the developers and other users.
| Licence