API
for text analytics, Natural Language Processing (NLP), corpus building and searching
The text analytics API was developed to enable other software to exploit the NLP functionality of Sketch Engine. The API opens the door to a complete suite of text analytics tools in Sketch Engine which is designed as text analysis software. It mirrors the functionality available through the web interface.
We offer a complete functionality for at least 25 major languages. Certain features (e.g. morphological analysis or part-of-speech tagging) might not be available for all 100+ languages. Please contact us to check the extent of support for your language.
One-off jobs
If you need a large amount of language data processed, analysed or generated only once, it is usually more practical and cost-effective to request a one-off job. An example can be a generation of a word database of all words in a language or processing a multi-billion-word data set into a text corpus. The language data will be processed by the Lexical Computing team and delivered in a format according to your specification.
Examples of text analytics API use
retrieval of
- keywords and terms
- collocations (coocurrences)
- synonyms (thesaurus)
- Good Dictionary EXamples
test processing to obtain
- part-of-speech tagging
- lemmatization
- frequency counts (word lists)
- keywords and terms for the purpose of topic modelling
Supported languages
Please check the languages and the size of corpora we already have for each language.
The text analytics API is available for these languages
A complete functionality is available for 25+ major languages. Some features might not be available for the remaining supported languages. Please contact us for details.