The 2026 episode at Faculty of Mathematics, Physics and Informatics of Comenius University

This site lists various ideas that could be explored as possible projects as part of the NLP course. The list is far from exhaustive, but represents (current) course instructor’s interests.

All of the ideas assume the final project will contain some “novelty bits”, either in the explored idea itself, its execution, practical usability or the underlying dataset.

Creating a new dataset for any of the NLP tasks listed below (or any other, really) is a huge plus.

Shared tasks#

Shared tasks are essentially “academic Kaggle”: you get a task, some data and produce a model that tries to do well on it. During the evaluation period, you normally produce a prediction on the test set. It’s a relatively straightforward way of going from a task to some solution, while not having to bother with the difficult part of finding an appropriate dataset. Further, there are highly likely other people working on the same task, so it’s really a bit of a “competition” (although that’s really not what it’s about and why it’s done).

CLEF 2026#

CLEF 2026 is the best fit for our semester: training data is available from February/March, with evaluation runs due May 7 and working notes due May 28. Registration is open until April 23. Here are some particularly relevant labs:

Or any other CLEF 2026 lab!

NTCIR-19#

NTCIR-19 (Tokyo, December 8–10) has formal run submissions in June–July 2026.

IberLEF 2026#

IberLEF 2026 runs shared tasks in Spanish and other Iberian languages (workshop: September 22). A good fit if you have some Spanish.

Or any other IberLEF 2026 task!

Project MIMEDIS#

The project’s aim is to “study the impact of media discourse on attitudes towards migration, migrants and migration policy in Slovakia”. As such, there are many classification tasks that can be explored in that regard.

You can find more about the project at https://cogsci.fmph.uniba.sk/MIMEDIS/index.html.

Your own idea!#

Feel free to come up with an idea on your own -- if you are working on something NLP-related for your thesis, that would be a good candidate. But in general, I’d be happy to talk about any NLP-related idea you may have.

Alternatively feel free to check out the sites below, find an NLP task you find interesting and see if you can make an interesting project out of it!