{"id": "doi/10.5281/zenodo.14600018", "created": "2026-02-18T06:45:43.832982+00:00", "updated": "2026-04-14T02:06:14.516930+00:00", "links": {"self": "https://nma.eosc.cz/api/datasets/doi/10.5281/zenodo.14600018", "parent": "https://nma.eosc.cz/api/datasets/w25z4-9x538", "latest": "https://nma.eosc.cz/api/datasets/doi/10.5281/zenodo.14600018/versions/latest", "record": "https://nma.eosc.cz/api/datasets/doi/10.5281/zenodo.14600018", "versions": "https://nma.eosc.cz/api/datasets/doi/10.5281/zenodo.14600018/versions", "self_html": "https://nma.eosc.cz/datasets/records/doi/10.5281/zenodo.14600018", "latest_html": "https://nma.eosc.cz/datasets/records/doi/10.5281/zenodo.14600018/latest", "self_iiif_manifest": "https://nma.eosc.cz/api/iiif/record:doi/10.5281/zenodo.14600018/manifest", "self_iiif_sequence": "https://nma.eosc.cz/api/iiif/record:doi/10.5281/zenodo.14600018/sequence/default", "files": "https://nma.eosc.cz/api/datasets/doi/10.5281/zenodo.14600018/files", "media_files": "https://nma.eosc.cz/api/datasets/doi/10.5281/zenodo.14600018/media-files/files", "archive": "https://nma.eosc.cz/api/datasets/doi/10.5281/zenodo.14600018/files-archive", "archive_media": "https://nma.eosc.cz/api/datasets/doi/10.5281/zenodo.14600018/media-files/files-archive", "access_links": "https://nma.eosc.cz/api/records/doi/10.5281/zenodo.14600018/access/links", "access_grants": "https://nma.eosc.cz/api/records/doi/10.5281/zenodo.14600018/access/grants", "access_users": "https://nma.eosc.cz/api/records/doi/10.5281/zenodo.14600018/access/users", "access_groups": "https://nma.eosc.cz/api/records/doi/10.5281/zenodo.14600018/access/groups", "access_request": "https://nma.eosc.cz/api/records/doi/10.5281/zenodo.14600018/access/request", "access": "https://nma.eosc.cz/api/records/doi/10.5281/zenodo.14600018/access", "self_persistent_html": "https://nma.eosc.cz/s/doi/10.5281/zenodo.14600018"}, "revision_id": 31, "parent": {"id": "w25z4-9x538", "access": {"owned_by": {"user": "system"}, "settings": {"allow_user_requests": false, "allow_guest_requests": false, "accept_conditions_text": null, "secret_link_expiration": 0}}, "communities": {}, "pids": {}}, "versions": {"is_latest": true, "index": 1}, "is_published": true, "is_draft": false, "$schema": "local://datasets-v1.0.0.json", "editors": [{"id": "63", "full_name": "Maty\u00e1\u0161 Kopp", "affiliations": ""}], "metadata": {"publication_date": "2025", "persistent_url": "https://doi.org/10.5281/zenodo.14600018", "last_checked": "2026-04-14T02:06:14.379132Z", "check_status": "success", "check_message": "URL is accessible. Title found in content (exact match, score: 100)", "resource_type": {"id": "dataset", "title": {"cs": "Datov\u00e1 sada", "de": "Datensatz", "en": "Dataset", "es": "Conjunto de datos", "sv": "Dataset"}}, "creators": [{"person_or_org": {"type": "personal", "name": "\u00c7\u00f6ltekin, \u00c7a\u011fr\u0131", "given_name": "\u00c7a\u011fr\u0131", "family_name": "\u00c7\u00f6ltekin", "identifiers": [{"identifier": "0000-0003-1031-6327", "scheme": "orcid"}]}, "affiliations": [{"name": "University of T\u00fcbingen"}]}, {"person_or_org": {"type": "personal", "name": "Kopp, Maty\u00e1\u0161", "given_name": "Maty\u00e1\u0161", "family_name": "Kopp", "identifiers": [{"identifier": "0000-0001-7953-8783", "scheme": "orcid"}]}, "affiliations": [{"name": "Charles University, Faculty of Mathematics and Physics"}]}, {"person_or_org": {"type": "personal", "name": "Morkevi\u010dius, Vaidas", "given_name": "Vaidas", "family_name": "Morkevi\u010dius", "identifiers": [{"identifier": "0000-0002-2174-0396", "scheme": "orcid"}]}, "affiliations": [{"name": "Kaunas University of Technology"}]}, {"person_or_org": {"type": "personal", "name": "Ljube\u0161i\u0107, Nikola", "given_name": "Nikola", "family_name": "Ljube\u0161i\u0107", "identifiers": [{"identifier": "0000-0001-7169-9152", "scheme": "orcid"}]}, "affiliations": [{"name": "Jo\u017eef Stefan Institute"}, {"name": "University of Ljubljana"}, {"name": "Institute of Contemporary History"}]}, {"person_or_org": {"type": "personal", "name": "Meden, Katja", "given_name": "Katja", "family_name": "Meden", "identifiers": [{"identifier": "0000-0002-0464-9240", "scheme": "orcid"}]}, "affiliations": [{"name": "Institute of Contemporary History"}, {"name": "Jo\u017eef Stefan Institute"}]}, {"person_or_org": {"type": "personal", "name": "Erjavec, Toma\u017e", "given_name": "Toma\u017e", "family_name": "Erjavec", "identifiers": [{"identifier": "0000-0002-1560-4099", "scheme": "orcid"}]}, "affiliations": [{"name": "Research Centre of the Slovenian Academy of Sciences and Arts"}, {"name": "Jo\u017eef Stefan Institute"}]}], "title": "Training data for the shared task Ideology and Power Identification in Parliamentary Debates (2025)", "publisher": "Zenodo", "dates": [{"date": "2025-01-04", "type": {"id": "issued", "title": {"cs": "Vyd\u00e1no", "de": "Ver\u00f6ffentlicht", "en": "Issued", "es": "Publicado", "sv": "Utf\u00e4rdad"}}}], "related_identifiers": [{"identifier": "11356/1912", "scheme": "handle", "relation_type": {"id": "isderivedfrom", "title": {"cs": "Je odvozen od (\u010deho)", "de": "Wird abgeleitet von", "en": "Is derived from", "es": "Se deriva de", "sv": "H\u00e4rr\u00f6r fr\u00e5n"}}, "resource_type": {"id": "dataset", "title": {"cs": "Datov\u00e1 sada", "de": "Datensatz", "en": "Dataset", "es": "Conjunto de datos", "sv": "Dataset"}}}, {"identifier": "10.1007/s10579-021-09574-0", "scheme": "doi", "relation_type": {"id": "references", "title": {"cs": "Odkazuje na (co)", "de": "Referenziert", "en": "References", "es": "Referencias", "sv": "H\u00e4nvisar till"}}, "resource_type": {"id": "publication-article", "title": {"cs": "\u010cl\u00e1nek v \u010dasopise", "de": "Zeitschriftenartikel", "en": "Journal article", "es": "Art\u00edculo de revista", "sv": "Tidskriftsartikel"}}}, {"identifier": "10.1007/s10579-024-09798-w", "scheme": "doi", "relation_type": {"id": "references", "title": {"cs": "Odkazuje na (co)", "de": "Referenziert", "en": "References", "es": "Referencias", "sv": "H\u00e4nvisar till"}}, "resource_type": {"id": "publication-article", "title": {"cs": "\u010cl\u00e1nek v \u010dasopise", "de": "Zeitschriftenartikel", "en": "Journal article", "es": "Art\u00edculo de revista", "sv": "Tidskriftsartikel"}}}, {"identifier": "10.5281/zenodo.14600017", "scheme": "doi", "relation_type": {"id": "isversionof", "title": {"cs": "Je verz\u00ed (\u010deho)", "de": "Ist eine Version von", "en": "Is version of", "es": "Es versi\u00f3n de", "sv": "\u00c4r version av"}}}], "rights": [{"id": "cc-by-4.0", "title": {"en": "Creative Commons Attribution 4.0 International"}, "description": {"en": "The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited."}, "icon": "cc-by-icon", "props": {"url": "https://creativecommons.org/licenses/by/4.0/", "scheme": "spdx"}}], "description": "This dataset contains a selection of speeches from ParlaMint corpora (version 4.1) as the training set for \u00a0the shared task on \"Ideology and Power Identification in Parliamentary Debates\" in CLEF 2025.\n\nAll files are tab-separated text files with the following fields:\n\n\n\n\"id\" is a unique (arbitrary) ID for each text.\n\n\"speaker\" is a unique (arbitrary) ID for each speaker. There may be multiple speeches from the same speaker.\n\n\"sex\" is the (binary/biological) sex of the speaker. This information is collected from varying sources (typically data published by the respective parliament), and in some cases it may be unspecified or unknown.\n\n\"text\" is the transcribed text of the parliamentary speech. Real examples may include line breaks, and other special sequences escaped or quoted.\n\n\"text_en\" is an automatic English translation of the corresponding text. This field may be empty (obviously) \u00a0for speeches in English, but the translations may be missing for a small number of non-English speeches as well.\n\n\"orientation\" is the binary/numeric label ( 0 is left and 1 is right). Orientation labels are based on Wikipedia.\n\n\"power\" is the binary label for power role (0 is opposition, 1 is coalition), this information is based on the information provided by the ParlaMint contributors. This value is not always present, either due to parliamentary systems with no defined coalition/opposition, or unknown orientation information for some speakers (e.g., PMs with no party affilitiation). Missing values are indicated as 'NA'.\n\n\"populism\" is the populism index based on Global Party Survey (GPS). This is a 4-point ordinal scale (1: Strongly Pluralist, 2: Moderately Pluralist 3: Moderately Populist, 4: Strongly Populist). Not all values are present in all parliaments. Many parties/speakers are not covered by GPS data, and some values are missing due to failure to match the GPS and ParlaMint identifiers. Missing values are indicated as 'NA'.\n\n\nSmall samples of the data files are provided in the shared task GitHub repository at https://github.com/coltekin/ideology-power-st-baseline.\n\nFile names include a code for the parliament. We provide data from the following national and regional parliaments.\n\n\n\nAustria (at)\n\nBosnia and Herzegovina (ba)\n\nBelgium (be)\n\nBulgaria (bg)\n\nCzechia (cz)\n\nDenmark (dk)\n\nEstonia (ee)\n\nSpain (es)\n\nCatalonia (es-ct)\n\nGalicia (es-ga)\n\nBasque Country (es-pv)\n\nFinland (fi)\n\nFrance (fr)\n\nGreat Britain (gb)\n\nGreece (gr)\n\nCroatia (hr)\n\nHungary (hu)\n\nIceland (is)\n\nItaly (it)\n\nLatvia (lv)\n\nThe Netherlands (nl)\n\nNorway (no)\n\nPoland (pl)\n\nPortugal (pt)\n\nSerbia (rs)\n\nSweden (se)\n\nSlovenia (si)\n\nTurkey (tr)\n\nUkraine (ua)\n\n\nThe number of training instances and the class imbalance differs for each training set. We do not provide a fixed validation split. Please see the shared task website and the GitHub repository for further description of the data set and the sampling process."}, "files": {"enabled": false, "order": [], "count": 0, "total_bytes": 0, "entries": {}}, "pids": {"oai": {"identifier": "oai:https://nma.eosc.cz:doi/10.5281/zenodo.14600018", "provider": "oai"}}, "access": {"record": "public", "files": "public", "embargo": {"active": false, "reason": null}, "status": "metadata-only"}, "media_files": {"enabled": false, "order": [], "count": 0, "total_bytes": 0, "entries": {}}, "status": "published", "deletion_status": {"is_deleted": false, "status": "P"}, "stats": {"this_version": {"views": 6, "unique_views": 6, "downloads": 0, "unique_downloads": 0, "data_volume": 0.0}, "all_versions": {"views": 6, "unique_views": 6, "downloads": 0, "unique_downloads": 0, "data_volume": 0.0}}, "custom_fields": {}}