Proteverb – Legal, ethical and technological aspects of processing textual and speech data for scientific, research and development purposes
ABOUT THE PROJECT
From the perspective of language technology development, Slovenian is a language for which few digital resources are available. As a result, modern research from a computer science perspective and the development of products based around natural language processing are significantly slower than for languages with many digital resources. However, in order to adequately acquire language resources and to make secondary use of them in as natural a form as possible, which may also contain some personal data, it is important to interpret the General Data Protection Regulation (GDPR) and the exceptions specifically for research purposes. It is precisely such exemptions that make it possible to achieve the specific purposes to which the present targeted research project is also linked, which will, for the first time in Slovenia, systematically address the acquisition and processing of (personal) data in a way that is in the interest of science, and thus, through application in the form of a pilot project, contribute to the development of the latter, as well as of the economy, on the basis of new insights and practices.
The research project will bring together, intertwine and deepen the knowledge of several different scientific disciplines in the field of social sciences, natural sciences, technical sciences and humanities. Such synergies are essential to ensure that advances in technological development are understood in the appropriate context and regulated in a way that maximises societal benefits and simultaneously minimises negative impacts and interference with ethical and legal standards and human rights. Such a comprehensive approach is the only way to develop the concept of open science to the full extent.
The research project will bring together, intertwine and deepen the knowledge of several different scientific disciplines in the field of social sciences, natural sciences, technical sciences and humanities. Such synergies are essential to ensure that advances in technological development are understood in the appropriate context and regulated in a way that maximises societal benefits and simultaneously minimises negative impacts and interference with ethical and legal standards and human rights. Such a comprehensive approach is the only way to develop the concept of open science to the full extent.
The project will primarily make a significant contribution to the development of three scientific fields, namely law, informatics and computer science, and humanities.
In all three fields of science, the research team will immediately transfer the findings and increased knowledge into the undergraduate and postgraduate studies at national and foreign universities, both through the participation of students in the development of the above-mentioned technologies and in the teaching carried out by the researchers involved in this project, who are also working as professors at different universities.
The academic results of this project will overcome key obstacles to the advancement of science and will optimise the use of data for research purposes without violating legal standards and human rights.
Project Type: Target research programme
Project no: V5-2265
Project duration: 1. 10. 2022 – 31. 3. 2025
CONTENT OF A PROJECT
The targeted research project will be divided into several phases:
- We will examine the legal framework of data processing for research and scientific purposes. The starting point will be the General Regulation and the ZVOP-1, which will be built upon through a comparative legal analysis and monitoring of the development of the ZVOP-2 legislative proposal.
- We will look at the current practices of data collection for scientific research purposes, with an interest both in the access to data by researchers and research organisations and in the experience of data sharing by public authorities and institutions (e.g. courts). We will identify the key risk factors that have prevented access to data in the past, in order to develop a protocol to protect privacy in the course of data processing for scientific research purposes.
- The project will develop procedures for appropriate data access and anonymisation, based on the adaptation and improvement of existing anonymisers. Recommendations will be made on methods for biometric anonymisation of audio speech recordings based on machine learning methods, with the aim of reducing the impact on the reliability of automatic speech recognisers.
We will attempt to acquire the data (pilot) using a privacy protocol and data access procedures, including anonymisation. The pilot part of the research will consist of the preparation of the necessary groundwork for data acquisition, data takeover, data anonymisation, and the organisation of documentation, procedures and rules for the needs of data processing within the research institution. On the basis of the data obtained for the pilot part of the targeted research project, we will specialise a text anonymiser as well as a speech recogniser for the Slovene language.
PROJECT LEADER AND CONSORTIUM PARTNERS
Project leader is the Institute of Criminology at the Faculty of Law in Ljubljana.
Consortium Partners:
Project members:

Aleš Završnik
project leader
Simon Dobrišek
Faculty of Electrical Engineering, University of Ljubljana

Kristina Lazarevič Padar
Marko Bajec
Faculty of Computer & Information Science, University of Ljubljana

Iva Ramuš Cvetkovič

Saša Krajnc
Simon Krek
“Jožef Štefan” Institute
PROJECT RESULTS AND ACHIEVEMENTS
Original scientific article
- ZAVRŠNIK, Aleš. Criminal justice, artificial intelligence systems, and human rights. Ûridičeskie nauki i obrazovanie. 2023, no. 70, str. 150-164. ISSN 2304-1730. http://www.iolr.org/wp-content/uploads/2023/04/Zavrsnik-A.-Criminal-justice….pdf. [COBISS.SI-ID 159764739]
- ZAVRŠNIK, Aleš, RAMUŠ CVETKOVIČ, Iva, LAZAREVIČ PADAR, Kristina, STARIHA, Andraž. Nadzor nad podatki in raziskovanje v kriminologiji. Revija za kriminalistiko in kriminologijo. jan.-mar. 2024, letn. 75, št. 1, str. 72-89. ISSN 0034-690X. [COBISS.SI-ID 192594691]
Professional article
- ZAVRŠNIK, Aleš. Umjetna inteligencija u krivičnom pravosođu : uticaj na prava čovjeka. Pravo i pravda : časopis za pravnu teoriju i praksu. 2023, god. 21, br. 1, str.173-192. ISSN 1512-8571. [COBISS.SI-ID 155372291]
Unpublished conference contribution
-
ŠARF, Pika. The Thin Line Between Personal and Anonymized Data in the Digital Age: Lecture, Conference on Data Protection Law, Portorož, November 15, 2022. [COBISS.SI-ID 150492163]
-
ZAVRŠNIK, Aleš. Artificial Intelligence and Criminal Justice: Lecture, International Scientific and Practical Conference “Digital Forensics in the Modern World: Problems of Theory and Practice”, Tashkent, May 5, 2023. [COBISS.SI-ID 159767299]
-
ZAVRŠNIK, Aleš. Crime and Data: Lecture at the 4th Conference on Information Security Law, Portorož, March 16, 2023. [COBISS.SI-ID 159392003]
-
ZAVRŠNIK, Aleš. Artificial Intelligence and Criminal Justice: Lecture, International Seminar “Digitalization in Law, Privacy Protection, and Automation – DPZPA”, Sarajevo, May 18, 2023. [COBISS.SI-ID 159770371]
Published scientific conference contribution abstract
-
ZAVRŠNIK, Aleš. Fair Trial Implications of Automation in Criminal Justice Systems. In: 2023 ASC Annual Meeting: Seeking Justice: Reconciling with Our Past, Reimagining the Future, Philadelphia, November 15–18, 2023. [S.l.]: American Society of Criminology, 2023. 1 online resource. [Online]: https://convention2.allacademic.com/one/asc/asc23/index.php?cmd=Online+Program+View+Paper&selected_paper_id=2075050&PHPSESSID=7p1hcnhkka36t6t7ki72oj4i07. [COBISS.SI-ID 183045891]
Other
- Organization of the Autumn School titled “Law Facing the Challenges of the Digital (R)evolution”, November 22, 2024. [Online]: https://www.inst-krim.si/category/jesenska-sola/
-
KREK, Simon. Copyrights, the Grave of the Slovenian Language. Dnevnik. [Print edition]. November 29, 2022, vol. 72, no. 276, p. 17, in Slovenian. ISSN 1318-0320. [Online]: https://www.dnevnik.si/1043001868/Kultura/jezikolumna-avtorske-pravice-slovenskega-jezika-grob, https://trojina.si/2022/12/14/avtorske-pravice-slovenskega-jezika-grob/. [COBISS.SI-ID 184173827]
-
SPLICHAL, Slavko (interviewee), BRATKO, Ivan (interviewee), KRONEGGER, Luka (interviewee), KALUŽA, Jernej (interviewee), KREK, Simon (interviewee), ŠARF, Pika (interviewee), GORJANC, Vojko (interviewee). The Summer of Artificial Intelligence. Ljubljana: Radiotelevizija Slovenija, Public Institution, 2023. 1 online resource (1 audio file, 22 min 12 sec). Vroči mikrofon (Hot Microphone). [Online]: https://val202.rtvslo.si/podkast/vroci-mikrofon/584/174960060. [COBISS.SI-ID 154859523]
-
SPLICHAL, Slavko (interviewee), BRATKO, Ivan (interviewee), KREK, Simon (interviewee), KALUŽA, Jernej (interviewee), ŠARF, Pika (interviewee). The Datafication of Society. Ljubljana: Radiotelevizija Slovenija, Public Institution, 2023. 1 online resource (1 audio file, 9 min 11 sec). Aktualna tema (Current Affairs). [Online]: https://365.rtvslo.si/arhiv/aktualna-tema/174958360. [COBISS.SI-ID 154855171]
-
SLAČEK, Nina (interviewer), BOGATAJ JANČIČ, Maja (interviewee), CVAR, Nina (interviewee), DOBRANIĆ, Filip (interviewee), ZAVRŠNIK, Aleš (interviewee). Who Will Benefit and Who Will Suffer from the New Artificial Intelligence? Ljubljana: Radiotelevizija Slovenija, Public Institution, 2023. 1 online resource (1 audio file, 50 min 52 sec). Intelekta (Intellect). [Online]: https://365.rtvslo.si/arhiv/intelekta/174964772. [COBISS.SI-ID 155695363]
Invited lecture at foreign university
-
ZAVRŠNIK, Aleš. AI and the Penal System: lecture at the module “Artificial Intelligence and Intellectual Property”, Strasbourg University, Center for International Intellectual Property Studies, January 24, 2023, Zoom. [COBISS.SI-ID 142227715]
-
ZAVRŠNIK, Aleš. AI and the Penal System: lecture at Università Cattolica del Sacro Cuore, Milan, September 12, 2023, Zoom. [COBISS.SI-ID 164318467]
-
ZAVRŠNIK, Aleš. Artificial Intelligence in the Judiciary: Opportunities and Risks of Algorithmic Governance: lecture at the Faculty of Organizational Sciences, University of Belgrade, and the Research and Development Institute for Artificial Intelligence of Serbia, Novi Sad, December 8, 2023. [COBISS.SI-ID 178939651]
Other Lectures
-
ZAVRŠNIK, Aleš. Artificial Intelligence and Social Harm: lecture at the Constitutional Court of the Republic of Slovenia, Ljubljana, November 8, 2023. [COBISS.SI-ID: 183957251]