Slovak Financial Exam

This dataset contains 1,334 multiple-choice questions from the financial domain in the Slovak language. It was created to address the limited availability of language resources for Slovak, providing a benchmark for evaluating language models' capabilities in a specialized, lo...

Continue reading...

SK-QuAD Retrieval

SK-QuAD Retrieval is a unique dataset designed to evaluate Slovak search performance using metrics like MRR, MAP, and NDCG, derived from the SK-QuAD dataset. It features questions and answers sourced from a search engine before annotation. The annotated data assigns categories...

Continue reading...

Slovak Question Answering Dataset

Try it:

Downloads

License: Attribution-ShareAlike 4.0 International (CC BY-SA 4.

Continue reading...

The main goal of this work is to enable research of natural language processing of social media and colloquial speech in Slovak language. This corpus was presentend at PolTAL 2014 Conference.

Download

Bibliography

D. Hládek, J. Staš, J. Juhár: Slovak...

Continue reading...

This corpus aims to be the first attempt to create a representative sample of the contemporary Slovak language from various domains with easy searching and automated processing. It contains a selection of news articles, processed by our NLP tools.

The corpus consists of two parts. The first part...

Continue reading...