README.md 1.27 KB
Newer Older
1
2
3
4
5
6
7
8
# Metagrid Data
This repository collects and updates all the data imported into metagrid. We usually try to crawl data from existing websites, but some providers just offer csv dumps. They can't provide a valid sitemap, do not have a website or have no structured data.
## Provider
- parlament.ch (rm-1848-inkl-id-1.csv) Members of the swiss parlament since 1848
- ethz.ch (2017-09_Hochschularchiv_ETH_Z-rich_GND_VEs_puliziert_aufbereittet.csv). Person in the archive of the ethz
- SSRQ (SSRQ_pers_26_11_2015.csv). Members of the DB from the "Sammlung Schweizerischer Rechtsquellen"
- BSG (bsg.csv). All members of the "Bibliohgraphie zur Schweizergeschichte"
- Helveticat (helveticat.csv). All authors from the catalog of the "Schweizerischen Nationalbibliothek (NB)"
Tobias Steiner's avatar
Tobias Steiner committed
9
10
- Eth Hochschularchiv (2019-02_Hochschularchiv_ETH_Zürich_GND_VEs_puliziert.csv). All proffessors of the eth
- Gechischte der Sozialen Sicherheit (geschichtedersozialensicherheit.csv). The csv is generated with a spider. See the corresponding project
Tobias Steiner's avatar
Tobias Steiner committed
11

12
13
14
15
16
## Import
For each provider there is a special importer in metagrid. Importers handle the mapping between the csv and metagrid's datamodel. To start an import use the following command:
```bash
php bin/console MetagridApiBundle:import:csv <path to csv> <slug of the provider>
```
Tmen's avatar
Tmen committed
17