The SemStats Challenge is back with more action! It is organized in the context of the SemStats 2014 workshop. Participants are invited to apply statistical techniques and semantic web technologies within one of two possible tracks, namely the Census Data Track and Open Track. Following up on the success of last year’s Challenge, this year, the Census Data Track will have data from France, Italy, and Ireland. We would also like to introduce the new Open Track, where any type of statistical data of your choice may be used in the challenge.
The challenge will consist in the realization of mashups or visualizations, but also on comparisons, analytics, alignment and enrichment of the data and concepts involved in statistical data (see below for the data made available and additional requirements).
The deadline for participants to submit their short papers and application is
Sun 7th September, 2014, 23:59pm Hawai Time. Submission is done via EasyChair by selecting the Challenge paper category.
It is strongly suggested to all challenge participants to send contact informations to email@example.com in order to be kept informed in case of any changes in the data provided. For any questions on the challenge, please contact firstname.lastname@example.org.
Census Data Track
We would like to point you to plenty of raw data. The conversion process will be considered as part of the challenge.
- Istat (Italian National Institute of Statistics) offers Census 1991, 2001, 2011 data and metadata: http://www.istat.it/it/archivio/104317#variabili_censuarie (See “Variabili censuarie / Censimento della popolazione e delle abitazioni”), which gives the population count by age range and sex at a very detailed geographic level.
- INSEE (National Institute of Statistics and Economic Studies) can provide different things:
- Detailed results for Census 2011: http://insee.fr/fr/themes/detail.asp?reg_id=0&ref_id=fd-RP2011&page=fichiers_detail/RP2011/telechargement.htm giving results on individuals only at the region level but with a great number of other variables (see http://insee.fr/fr/ppp/bases-de-donnees/fichiers_detail/RP2011/doc/contenu_RP2011_INDREG.pdf)
- Detailed results for Census 2010: http://insee.fr/fr/themes/detail.asp?reg_id=0&ref_id=fd-RP2010&page=fichiers_detail/RP2010/telechargement.htm with, for example, results on individuals at a smaller geographic level
- Key figures for Census 2011 on different themes at the municipality level: http://insee.fr/fr/bases-de-donnees/default.asp?page=recensement/resultats/2011/donnees-detaillees-recensement-2011.htm
- ABS (Australian Bureau of Statistics) offers Census 2011 data at http://stat.abs.gov.au/ . Data that is in particularly of interest to this challenge can be found by navigating to: Social Statistics > 2011 Census of Population and Housing > Time Series Profiles (Local Government Areas) > T03 Age by Sex (LGA)
- CSO (Central Statistics Office) Ireland’s Census 2011 data and metata available as Linked Data: http://data.cso.ie/
- You are welcome to use any other Census data whether it is Linked Data based or not
There is one essential requirement for the Open Track: papers must describe a publicly available application. We would love to see everyone play and learn from what you have created. You are welcome to use any statistical data whether it is already in Linked Data shape or not! While you are at it, why not combine it with data from other domains?
Here are some statistical linked dataspaces (off the top of our heads):