Co-located with the fifteenth biennial Language Resources and Evaluation Conference (LREC), Palma, Mallorca (Spain).

Important information

To be updated.

Workshop description

The workshop is a continuation of the workshop series RESOURCEFUL, focusing on the role of resources in the age of large language models (LLMs).

The language resources community has long provided the empirical foundation for language technology, building datasets that have been crucial for development of NLP models. However, the introduction of large language models (LLMs), trained on vast and undisclosed texts, has disrupted this ecosystem. Traditional notions and methods of resource building are evolving. As LLMs have absorbed tons of publicly available data, the boundaries between training and evaluation sets are becoming blurred, and the very idea of “unseen” data is fading. Moreover, these models can now generate synthetic linguistic data, enabling the creation of new linguistic material for the models. This paradigm introduces new challenges and risks, particularly in the domain of evaluation. These shifting dynamics raise fundamental questions about how we evaluate models, ensure data transparency, and preserve the integrity of linguistic resources. The RESOURCEFUL 2026 workshop aims therefore at stimulating a critical dialogue on the methodological, ethical, and practical dimensions of data creation, authenticity, and representation in the age of LLMs.

The workshop aims to bring together researchers involved in the creation, validation, and evaluation of next-generation language resources. We invite contributions from all areas of language resource research, especially on (i) corpus and annotation design, (ii) evaluation and benchmarking methodologies, (iii) low-resource NLP and linguistic diversity, (iv) synthetic data generation and validation, (v) ethics, data governance, and reproducibility. We aim to promote a discussion between traditional resource builders, evaluation specialists, linguists, anthropologists, field researchers and LLM researchers, creating a shared forum to redefine the role of resources in NLP.

Topics of interest

We would like to open a forum by bringing together students, researchers, and experts to address and discuss the following points:

Contact

E-mail: resourceful at listserv dot gu dot se