FAIR@CLIB2026: FAIR Language Resources in NLP: Stewardship, Reuse and Long-Term Sustainability Sofia, Bulgaria, September 7, 2026 |
| Submission link | https://easychair.org/conferences/?conf=fairclib2026 |
FAIR Language Resources in NLP: Stewardship, Reuse and Long-Term Sustainability
Pre-conference Workshop at CLIB 2026
Sofia, Bulgaria
7 September 2026
Why this workshop and why now?
This workshop aims to bring together researchers, infrastructure providers, data stewards, and policy actors who are committed to building durable language resource ecosystems. We aim to address the pressing challenges of sustaining datasets used in linguistic research and in the development of NLP systems—from documentation and versioning to governance, licensing, and infrastructure support.
The workshop will explore how FAIR principles (Findable, Accessible, Interoperable, Reusable) can be meaningfully operationalised for language resources in NLP and computational linguistics.
Important Dates
- Submission deadline: 22 April 2026
- Notification of acceptance: 22 May 2026
- Camera-ready deadline: To be confirmed
- Workshop date: 7 September 2026
Submission Guidelines
The proposal considers short papers (4-6 pages), which will be delivered in 10 min slots.
List of Topics
1. Technical Foundations
Designing language resources so they are interoperable, transparent, and structurally reusable.
- Domain-specific FAIR implementation strategies for corpora, lexicons, datasets, and models
- Metadata, paradata, and annotation transparency frameworks
- Repository architectures and infrastructure design for linguistic data
2. Lifecycle & Reuse
Ensuring language resources remain usable, traceable, and measurable across research cycles.
- Documentation, versioning, and provenance tracking for evolving resources
- Persistent identifiers and citation mechanisms for language datasets
- Methods for tracking, measuring, and evidencing reuse
- Critical reflections and lessons learned from implementation challenges
3. Policy & Sustainability
Creating the institutional and legal conditions that allow language resources to endure.
- Legal, ethical, and licensing considerations in sharing and reusing language data
- Governance structures and sustainability models beyond project funding
Committees
Program Committee (TBC)
- TBC
Chairs
- Dr. Milena Dobreva (Unviersity of Strathclyde, IMI BAS)
- Dr. Ivan Lambov (IMI BAS)
Organizing committee
- Dr. Krassimira Ivanova
- Teodora Gandova
Invited Speakers (TBC)
- TBC
Publication
We are finalising the proceedings information.
Venue
The conference will be held in Sofia, Bulgaria.
Contact
All questions about submissions should be emailed to milena.dobreva AT ustrath.ac.uk.
Acknowledgement of support
The workshop organisation is partially supported by the project BG16RFPR002-1.016-0002 “ERA Chair in fostering digital cultural heritage via open innovations and open science” funded by the Programme “Research, Innovation and Digitalization for Smart Transformation” 2021-2027 (PRIDST) and co-funded by the European Union.
