Datasets

A set of text essay files marked up and unmarked. Designed for contestants.
Datasets are regularly updated on the IT-platform.
Join the contest to get the full access.

SAMPLE DATASET IN RUSSIAN

Training set
Training set
This dataset contains annotated Russian text data for training the algorithms.
Unmarked set
Unmarked set
This dataset contains original unlabeled Russian text data.

SAMPLE DATASET IN ENGLISH

Training set
Training set
This dataset contains annotated English text data for training the algorithms.
Unmarked set
Unmarked set
This dataset contains original unlabeled English text data.
Datasets will be updated. Register to participate
JOIN US
to get full access to the datasets, markup platform
and decision comparison program
REGISTER

SIGN UP FOR OUR NEWSLETTER

Thank you!