Søgning » Link-Lives release 2
Aflevering 14001
Forskningsdata

Link-Lives release 2 (1787-1921)

Formål ?

The aim of the research project Link-Lives is to extend the range of register-research, based on Danish data, from decades to centuries. By combining historical research and data science methods, we reconstruct life-courses and family relations of almost everyone who lived in Denmark from 1787 until the introduction of the modern Danish civil registration system (CPR) in 1968. The project is funded by the Innovation Fund Denmark and the Carlsberg Foundation and runs from 2019 to 2025.

Indhold ?

Link-Lives release 2 is the second data release the project launches during the course of the project. It contains 64 million person registrations from three types of sources along with links and life courses created from them. There are transcribed person registrations from: 1) Censuses (the years 1787, 1801, 1834, 1840, 1845, 1850, 1860, 1880, 1885, 1901) originally transcribed by volunteers coordinated by the National Archives (Dansk Demografisk Database). 2) The Copenhagen Burial Register (1861-1911) transcribed by volunteers conducted at Copenhagen City Archives. 3) Parish Registers (1813-1917) transcribed by Ancestry.The datasets include core information such as name, age/date of birth, gender, marital status, address, occupation and position in household for most individuals. There is information about family relations and events (births, confirmations, marriages, deaths, arrivals and departures) with dates, places, etc. The data is given in three formats: the original transcription, a version harmonized by Link-Lives and a version to use in our software for computer-assisted manual record linkage (ALA). Additionally, there are over 16 million links identifying the same individual person in different datasets. The links are created through three methods: 1) A benchmark dataset created by domain experts used for training and testing. 2) Two sets of links created through a rule-based algorithm and machine learning implementations of the XG Boost algorithm. 3) Two sets of life-courses created with the two methods. Given the different origin and ownership of the original datasets, not all the data can be downloaded. Access to the Parish Register data requires permission from Ancestry, and the originally transcribed version of Copenhagen City Archives datasets must be obtained from them (https://arkivfinder.dk/kbharkiv/forskningsdata/e29f494a-e3c9-11ef-8b13-2399cb4e9f41). There will be a final Link-Lives release with more sources, further standardization and different linking methods, when the research project Link-Lives ends. By downloading Link-Lives data, you agree to the conditions for use as described in the Link-Lives Release 2 Guide.

Tabeller ?

Der er ingen tilgængelig information om tabeller.
Du kan søge om adgang til tabeloplysninger ved at klikke på 'Søg om adgang'.

Emneord ?

Der er ingen emneord.

Dækker perioden ?

Fra 01-01-1787 til 01-01-1921.

ID-oplysninger i data ?
  • Ingen