Dataset of two experiments in the language pairs English-Russian, English-German. The dataset contains linguistic data (Segments translated in different methods) and human data (participants filled in questionnaires). The dataset contains translation versions across different steps of a team translation approach.
Technological development continues to offer ways of health data collection that go beyond asking survey questions. Such new types of data were collected in the SHARE survey by means of Dried Blood Spot Samples (DBSS) and accelerometer data. These data demand new data protection rules and must often be extensively processed, validated and calibrated before they can be made accessible. SHARE and CentERdata develop a strategy to provide access to such data according to FAIR principles.
Many different domains lack multilingual terminological resources. Making data and services accessible and usable in SSH is very much a matter of providing terminology across languages and multilingual vocabularies. Shortage of multilingual terminologies and vocabularies represents an obstacle to the access and reuse of information. Using the appropriate vocabularies can greatly improve both discovery and classification. Consequently, for SSHOC, it is important to address this issue with respect to the SSH domain.
The Audio Survey Experiment provides guidelines describing how to integrate the collection of digital language data into the traditional social sciences data collection process and provides audio transcript data which can be analysed with the help of natural language processing tools.