What we do
Sketch Engine is a leading corpus management software developed since 2003. It allows people to study how words behave in context.
Try Sketch Engine!
SkELL is a very simplified version of Sketch Engine – a web interface for English language learning targeting students and teachers of English.
We are specialists in building, management and analytics of very large (billions of words) text data known as text corpora. We possess corpora for over 80 languages. See our overview of corpora.
We can deliver corpus-derived wordlists including a number of statistical characteristics (raw frequency, document frequency, n-gram statistics etc.), and also advanced annotation like part-of-speech tagging or lemmatization. See our overview of languages.
We have developed or integrated large number of tools for corpus building and processing, including an efficient web crawler, a deduplication and boilerplate removal toolkit and a universal tokenizer. Most of the tools are open source and available through our corpus.tools portal.
Terminology and Translation
We are providing solutions for monolingual and bilingual terminology extraction and terminology checking as well as assisting tools for translators exploiting our large multilingual text corpora. See overview of features.
We have a long-term experience with legal counselling and providing expert witness services thanks to collaboration with top experts in lexicography and linguistics all around the world.
Get in touch!
For over 10 years we have been providing consultancy in lexicography and corpus linguistics, advising on dictionary projects and corpus building initiatives for many languages. Our solutions have been used for producing dictionaries and corpora worldwide.
Get in touch!