Language diversity & digital infrastructures workshop 2018



Linguistic diversity, minority languages and digital research infrastructures

Hamburg, September 20-21 2018

The workshop addresses the requirements on and potential of current developments in digital research infrastructures for the Humanities for research within the fields of linguistic diversity, minority and endangered languages, and further related so called ‘small disciplines’ (see announcement here).



The venue of the workshop is the Universität Hamburg building which accommodates the collaborative office of the Hamburg Centre of Language Corpora (HZSK) and the long-term research project INEL (“Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages”) carried out at the Institute for Finno-Ugric/Uralic Studies (IFUU).

The building is located in just a few minutes’ walk from the Hamburg-Altona railway/S-Bahn station:
Max-Brauer-Allee 60, 22765 Hamburg, Room 233 (2nd floor)



Thursday, September 20
Time Talk
14:00-14:40 Nikolett Mus (Research Institute for Linguistics, Hungarian AS, Budapest)

Theoretical and experimental approaches to dialectal variation and contact-induced change:
A case study of Tundra Nenets

14:40-15:20 Elena Kryukova (Tomsk State Pedagogical University)

Manuscript archives: problems of digitization and accessibility

coffee & tea
15:40-16:20 Timofey Arkhangelskiy (Universität Hamburg, Humboldt-Stipendiat)

Handling multilingual content in social media corpora in Uralic languages

16:20-17:00 Svetlana Toldova (Universität Hamburg, Visiting Fellow & NRU Higher School of Economics, Moscow)

Towards semi-automated shallow syntax: parsing FLEx data


Friday, September 21
Time Talk
09:00-09:40 Hanna Hedeland, Daniel Jettka (Universität Hamburg)
Corpus workflows: formalization and operationalization of workflows for the creation and publication of language corpora
09:40-10:20 Felix Rau (Universität zu Köln)

Applications and limits of machine learning for language documentation resources

tea & coffee
10:40-11:20 Josep Quer, Roland Pfau, Carlo Cecchetto, Pietro Braione & Carlo Geraci
(Universitat Pompeu Fabra, Barcelona; Universiteit van Amsterdam; Università degli studi di Milano; Consorzio Interuniversitario Nazionale per l’Informatica; Centre National de la Recherche Scientifique)
Building sign language repositories for linguistic and cultural purposes: the SIGN-HUB database of life narratives by elderly signers
11:20-12:00 Michael Daniel (NRU Higher School of Economics, Moscow)

Approaching dialect loss: experiences from Ustja River Basin Corpus project

12:00-12:30 Closing discussion



Alexandre Arkhipov (Universität Hamburg)

Rogier Blokland (Uppsala Universitet)

Jonathan Blumtritt (Universität zu Köln)

Chris Lasse Däbritz (Universität Hamburg)

Vera Ferreira (Endangered Languages Archive, London)

Sven Grawunder (Max Planck Institute for Evolutionary Anthropology, Leipzig)

Thomas Hanke (Universität Hamburg)

Timm Lehmberg (Universität Hamburg)

Tommi Pirinen (Universität Hamburg)

Thomas Schmidt (Institut für Deutsche Sprache, Manheim)

Mandana Seyfeddinipur (Endangered Languages Archive, London)

Beáta Wagner-Nagy (Universität Hamburg)

Jan Patrick Zeller (Universität Hamburg)



Throughout the workshop, posters from the INEL project (“Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages”) will also be on display and project members will be glad to comment on them.
Maria Brykina, Svetlana Orlova. INEL Selkup corpus

Chris Lasse Däbritz. INEL Dolgan corpus

Anne Ferger. INEL: Linguistic data workflows: requirements and best practice

Valentin Gusev. INEL Kamas corpus

Daniel Jettka. INEL: Catalogization of linguistic resources: a graph-based approach