ParlaCAP - Comparing agenda settings across parliaments via the ParlaMint dataset

ParlaCAP is an OSCARS Open Science cascading grant project focused on extending the usage of the open, comparable corpora of parliamentary debates ParlaMint to researchers in social sciences and beyond. The project leverages advanced natural language processing to analyse political agendas and sentiments in debates from 28 European parliaments. The automatic coding of agendas throughout a wide dataset of more than 8 million speeches, given in more than 20 languages, has become possible recently with significant developments in natural language processing and artificial intelligence, allowing for multilingual transformer models to provide both highly consistent and accurate codings. By integrating the ParlaMint dataset and the Comparative Agendas Project's coding scheme, the project will create a comprehensive, FAIR dataset for comparative political research, enhancing transparency and accountability in legislative discourse across Europe.

Project start date: 1 January 2025 Project duration: 24 months
See also the project description at the OSCARS website.

This project is funded by the OSCARS project's cascading grant, which has received funding from the European Commission’s Horizon Europe Research and Innovation programme under grant agreement No. 101129751.

Operated by

Data

The ParlaCAP dataset and other freely-available datasets related to the ParlaCAP project.

Models

Multilingual models fine-tuned on the tasks of CAP topic schema classification and sentiment identification.

Tutorials

Step-by-step guides for using the ParlaCAP dataset.

Publications

Publications connected with the ParlaCAP project.

Events

Upcoming and past events related to this project: talks and workshops.

Partners

Contact

If you have any questions about the ParlaCAP project, its datasets, models, or any other project-related content, we will be happy to help.