User login

Navigation

Share

live demo

This demo mixes two different datasets: One containing information about schools (location, phone number, etc.) and other containing the SIMCE (a standardized Chilean test) scores of that school over the years.

Note: Region's name, latitude and population are taken from dbpedia. For some regions there is no such information so I provide only the URI of those regions.

Data table

This table show the results from the SIMCE (Quality of Education Measurement System, in spanish) together with data obtained from dbpedia. An interesting aspect is that there is information missing in dbpedia about some regions.

Region (dbpedia)

Population (dbpedia)

Latitude (dbpedia)

Avg. math score per region(simce)

Avg. language score per region (simce)

Visualizations

The first graph shows the average of Math and Language scores in each region. It is clear that the scores are higher in the central region. A potential explanation if the correlation between population in each region and its score (as you can see in the second graph). Finally, we used dbpedia to obtain the latitude (remember that Chie is roughly speaking a vertical line, so longitudes are more or less the same for each region) and we compared that with the SIMCE results (as can be seen in the third graph). It is possible to do more analysis (in terms of density, ratio of rural/urban population and so on).

This demonstration shows when data.gov datasets were last updated, as determined by their HTTP responses. How up-to-date are the datasets listed in data.gov? We start with a time line histogram for a moment in time.

On the afternoons of __ and __, we crawled data.gov's links to datasets and recorded the modification dates reported by the source agency's HTTP servers. Of __ data.gov datasets, __ datasets returned modification dates on __ and __ returned modification dates on __. The list of data.gov datasets came from the 13 Sept 2010 version of their Dataset 92.

change blue crawl:

change red crawl:

Number of Datasets Modified since __

Process used

The following diagram shows the process used to obtain the last-modified dates of the data.gov datasets.

3 - The data.gov details page is requested and scraped to obtain references to data sources. Redirects are followed and HTTP HEADs are recorded in RDF using Turtle syntax and terminology from the Proof Markup Language ontology.

6 - This Javascript is included within THIS PAGE to query the LOGD SPARQL endpoint for dataset modification dates (using this query and these nightly-cached results). It calculates the histogram and uses Google Visualization's Annotated Timeline to render the results. A few notes are hard-coded to provide a frame of reference.

Use of provenance

The following diagram illustrates how the Proof Markup Language was used to record the HTTP headers obtained from the government agency servers. A portion of this data graph is selected by the SPARQL query listed above (the headers are excluded). The red links show the HTTP redirects and HTML anchor hrefs from the data.gov details page to the actual dataset URL on the hosting agency's domain. The green boxes represent URIs for the information obtained (HTTP headers), while the purple boxes associate the information received with its source and when it was received. For more discussion on how provenance is used in LOGD and csv2rdf4lod, see
A look at how csv2rdf4lod incorporates provenance into its tabular conversions.

Please note: The National Cancer Institute’s (NCI) PopSciGrid Community Health
Data Portal enables users to explore the relationship between
cancer risk factors, health outcomes, and factors in the social and
political environment. The maps, graphs, and tables on this website
are not intended to convey information about causal relationships, and
the NCI does not endorse any conclusions or inferences that individual
users may draw from the maps, tables, or graphs produced by this
website. Learn more about cancer.

Browser considerations: Firefox and IE 8 on Windows XP have difficulties displaying the following demonstrations. IE 7 on Windows Vista also has difficulties. Demonstrations can be successfully viewed using IE 8
on Windows 7, and Firefox and Safari on Mac. Flash is required for all browsers.

Using the demo: This demo shows a chart of the number of states with policies regulating use of tobacco in bars, restaurants, and workplaces from 1990 to 2007.

This demonstration shows when the data.gov datasets were last updated, as determined by their HTTP header responses. How up-to-date are the datasets listed in data.gov? We start with a time line histogram for a moment in time.

3 - The data.gov details page is requested and scraped to obtain references to data sources. Redirects are followed and HTTP HEADs are recorded in RDF using Turtle syntax and terminology from the Proof Markup Language ontology.

6 - This Javascript is included within THIS PAGE to query the LOGD SPARQL endpoint for dataset modification dates (using this query and these nightly-cached results). It calculates the histogram and uses Google Visualization's Annotated Timeline to render the results. A few notes are hard-coded to provide a frame of reference.

Use of provenance

The following diagram illustrates how the Proof Markup Language was used to record the HTTP headers obtained from the government agency servers. A portion of this data graph is selected by the SPARQL query listed above (the headers are excluded). The red links show the HTTP redirects and HTML anchor hrefs from the data.gov details page to the actual dataset URL on the hosting agency's domain. The green boxes represent URIs for the information obtained (HTTP headers), while the purple boxes associate the information received with its source and when it was received. For more discussion on how provenance is used in LOGD and csv2rdf4lod, see
A look at how csv2rdf4lod incorporates provenance into its tabular conversions.

Please note: The National Cancer Institute’s (NCI) PopSciGrid Community Health
Data Portal enables users to explore the relationship between
cancer risk factors, health outcomes, and factors in the social and
political environment. The maps, graphs, and tables on this website
are not intended to convey information about causal relationships, and
the NCI does not endorse any conclusions or inferences that individual
users may draw from the maps, tables, or graphs produced by this
website. Learn more about cancer.

Browser considerations: Firefox and IE 8 on Windows XP have difficulties displaying the following demonstrations. IE 7 on Windows Vista also has difficulties. Demonstrations can be successfully viewed using IE 8
on Windows 7, and Firefox and Safari on Mac. Flash is required for all browsers.

Using the demo: Select a year and a data series to show a heatmap that compares data for different states on a US map (the map may be blank in some combination because impacteen datasets (other than the policy data) only cover data between 1991 and 2007). Select a state to show the evolution of data series from 1991 to 2007.

year:
data:

state:

Smoking prevalence refers to the estimated percentage of adults in the US who currently smoke.

Select a year and a data series (workplace, restaurant or bar) to show a map of the U.S. that compares data for different states on a US map. Select a state to show the change of all data series in the selected state from 1990 to 2009 .

To the extent possible under law,
Tetherless World Constellation
has waived all copyright and related or neighboring rights to
TWC LOGD. TWC LOGD is an educational project on open government data using Semantic Web technologies.
Datasets hosted on this site are converted from a number of data sources such as data.gov. All data created by us are open for reuse, and usage of data created and managed by other sources should follow their own licenses.

The data contained on this site is automatically repopulated from US government or other open data sites, and any personal data in our linked-data versions is coming from those sources. If your information is removed from the government sources, it will be automatically removed from ours on the next update.