RESOURCEFUL workshop series

Series description

The RESOURCEFUL workshop series is an active research initiative that aims to bring attention to the problems of applying computational and linguistic tools for collection, analysis, and application of resources of text, images or other modalities. The initiative brings together researchers from natural language processing, linguistics, and related fields.

Previous editions

The first workshop focused on the size of resources available for natural language processing and approaches of dealing with data bottlenecks. It was co-located with the 8th Swedish Language Technology Conference (SLTC), University of Gothenburg, Sweden.

The second edition of the workshop explored the role of the kind and the quality of resources that are available to us and challenges and directions for constructing new resources in light of the current trends in natural language processing where language models (seem to) traditionally amounts of knowledge that had to be traditionally annotated. It was co-located with the 24th Nordic Conference on Computational Linguistics (NoDaLiDa), Tórshavn, Faroe Islands.

The third edition of the workshop focused on the relation between data-driven and expert-based types of resources in natural language processing at the time of the dominance of the former and an increasing interest for expert-annotated resources for evaluation of trained models. The workshop identified synergies between data-driven and human collected resources, for example for model training and evaluation and vice-versa for answering linguistic and socio-linguistic questions.

The forth and upcoming edition of the workshop will focus approaches and methods of resource creation and utilisation in the age of LLMs which have absorbed tons of publicly available data and the boundaries between training and evaluation have become blurred. In parallel, synthetic linguistic data is used for creation of new linguistic materials for the models. The workshop will address methods for evaluation of models through resources, ensuring data transparency, ethical considerations, and preserving the integrity of linguistic resources.

ACL Anthology

The workshop proceedings are published in the ACL Anthology and NEALT Proceedings

Interest group

Špela Arhar Holdt, University of Ljubljana, spela.arharholdt@ff.uni-lj.si
Micaella Bruton, Uppsala University, micaella.bruton@ling.su.se
Dana Dannélls, Språkbanken Text, University of Gothenburg, dana.dannells@svenska.gu.se
Simon Dobnik, CLASP, University of Gothenburg, simon.dobnik@gu.se
Nikolai Ilinykh, CLASP, University of Gothenburg, nikolai.ilinykh@gu.se
Crina Madalina Tudor, Stockholm University, crina.tudor@ling.su.se
Beáta Megyesi, Stockholm University, beata.megyesi@ling.su.se
Joakim Nivre, RISE and Uppsala University, joakim.nivre@lingfil.uu.se
Iben Nyholm Debess, The University of the Faroe Islands, IbenND@setur.fo
Barbara Scalvini, The University of the Faroe Islands, barbaras@setur.fo
Sara Stymne, Uppsala University, sara.stymne@lingfil.uu.se
Jörg Tiedemann, University of Helsinki, jorg.tiedemann@helsinki.fi
Lilja Øvrelid, University of Oslo, liljao@ifi.uio.no

Contact us

E-mail: resourceful at listserv dot gu dot se