Source Datasets

A pie graph displaying information about the four defining features of the LINCS Datasets: Canadian, Cultural, Research, and Linked.

LINCS will incorporate datasets from a diverse array of researchers, institutions, and areas of focus.


There are large bodies of content on the web related to Canadian culture and history, both smaller sets of materials created by researchers and large digitized collections held by memory institutions like Canadiana and Library and Archives Canada. We need better ways of accessing this content.


There are also millions of books, periodicals, and other content that has been digitized by groups such as the Internet Archive, The Hathi Trust, and Project Gutenberg, as well as much native web content relevant to cultural research. LINCS will provide new ways of discovering and using these kinds of materials.


Datasets carefully curated by Canadian researchers are at the core of LINCS, which will mobilize this material and interlink it to other related content. The Source Datasets are rich and diverse, as are the Research Themes that will be developed by linking them.


LINCS builds on much existing work towards an open, semantically structured web, from W3C standards and established ontologies to major community projects such as DBpedia and Wikidata. We aim to strengthen the Linked Open Data ecology through high quality open content and open-source tools.

These source datasets, with their various formats and origins, will worth through a range of conversion processes in LINCS.