<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>OUseful.Info, the blog... &#187; Where Linked Data Would Be Useful &#8211; Creating More MPs&#8217; Maps from the Guardian Politics API</title>
	<atom:link href="http://blog.ouseful.info/2010/12/10/where-linked-data-would-be-useful-creating-more-mps-maps-from-the-guardian-politics-api/feed/?withoutcomments=1" rel="self" type="application/rss+xml" />
	<link>http://blog.ouseful.info</link>
	<description>Trying to find useful things to do with emerging technologies in open education</description>
	<lastBuildDate>Sun, 19 May 2013 20:47:31 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='blog.ouseful.info' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>OUseful.Info, the blog... &#187; Where Linked Data Would Be Useful &#8211; Creating More MPs&#8217; Maps from the Guardian Politics API</title>
		<link>http://blog.ouseful.info</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://blog.ouseful.info/osd.xml" title="OUseful.Info, the blog..." />
	<atom:link rel='hub' href='http://blog.ouseful.info/?pushpress=hub'/>
		<item>
		<title>Where Linked Data Would Be Useful &#8211; Creating More MPs&#8217; Maps from the Guardian Politics API</title>
		<link>http://blog.ouseful.info/2010/12/10/where-linked-data-would-be-useful-creating-more-mps-maps-from-the-guardian-politics-api/</link>
		<comments>http://blog.ouseful.info/2010/12/10/where-linked-data-would-be-useful-creating-more-mps-maps-from-the-guardian-politics-api/#comments</comments>
		<pubDate>Fri, 10 Dec 2010 00:50:47 +0000</pubDate>
		<dc:creator>Tony Hirst</dc:creator>
				<category><![CDATA[Anything you want]]></category>
		<category><![CDATA[Tinkering]]></category>
		<category><![CDATA[datastore]]></category>
		<category><![CDATA[jiscmu]]></category>
		<category><![CDATA[mp map]]></category>

		<guid isPermaLink="false">http://blog.ouseful.info/?p=4649</guid>
		<description><![CDATA[So given the news from the Commons today, I was wondering where the current crop of MPs came from in terms of birthplace, school, and university&#8230; Would a map based view turn up from across the UK, or something a little more clumpy&#8230;? Searching for sources of biographical data, two sources came to mind &#8211; [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.ouseful.info&#038;blog=325417&#038;post=4649&#038;subd=ouseful&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>So given the news from the Commons today, I was wondering where the current crop of MPs came from in terms of birthplace, school, and university&#8230; Would a map based view turn up from across the UK, or something a little more clumpy&#8230;?</p>
<p>Searching for sources of biographical data, two sources came to mind &#8211; Wikipedia infoboxes (and hence DBpedia) and the Guardian Politics API. In this post, I&#8217;ll describe a minimal &#8211; and not very reliable &#8211; recipe for plotting a map of UK MPs&#8217; <em>alma mater</em> based on data grabbed from the <a href="http://www.guardian.co.uk/open-platform/politics-api/getting-started">Guardian Politics API</a>, identifying a couple of ways in which the data could be made so much more useful, and indicating why the Linked Data approach is a Good Thing&#8230;</p>
<p>Just because, I&#8217;ll give a Yahoo Pipes recipe&#8230;</p>
<p>The first step is a <a href="http://pipes.yahoo.com/pipes/pipe.info?_id=bd0834cfe6cdacb0bea5569505d330b8">handler pipe</a> for grabbing an MP&#8217;s details from the Guardian Politics API from their Guardian ID:</p>
<p><a href="http://www.flickr.com/photos/psychemedia/5247895808/" title="Photo Sharing"><img src="http://farm6.static.flickr.com/5084/5247895808_d0209e801d.jpg" width="500" height="267" alt="MP details via Guardian Politics API" /></a></p>
<p>The next step is to get a list of current MPs and annotate the list with MP details using the helper pipe:</p>
<p><a href="http://www.flickr.com/photos/psychemedia/5247900576/" title="Photo Sharing"><img src="http://farm6.static.flickr.com/5167/5247900576_f3e9355392.jpg" width="500" height="494" alt="Lookup MP details" /></a></p>
<p>Not every MP has an <em>alma mater</em> listed, so we filter out the ones where there is no university information. We then use the university data as the input to a rough and ready geocoder, which does its best to identify a location and then geocode it. The Yahoo pipes trick of putting geo-data into the <em>y:location</em> attribute means that the pipe will automagically generate KML and map based previews of the output of the pipe.</p>
<p><a href="http://www.flickr.com/photos/psychemedia/5247302057/" title="Photo Sharing"><img src="http://farm6.static.flickr.com/5163/5247302057_7e4123d5eb.jpg" width="498" height="339" alt="Lookup location by university" /></a></p>
<p>Finally, we tidy up the feed a little:</p>
<p><a href="http://www.flickr.com/photos/psychemedia/5247911682/" title="Photo Sharing"><img src="http://farm6.static.flickr.com/5046/5247911682_de2ac86582.jpg" width="500" height="437" alt="Tidy the pipe..." /></a></p>
<p>Here&#8217;s the output of the <a href="http://pipes.yahoo.com/pipes/pipe.info?_id=a4276187c4b797dadd393d2fb880ec95">resulting pipe</a>:</p>
<p><a href="http://pipes.yahoo.com/pipes/pipe.info?_id=a4276187c4b797dadd393d2fb880ec95" title="Photo Sharing"><img src="http://farm6.static.flickr.com/5168/5247313953_05da3d04b9.jpg" width="500" height="429" alt="MPs by university - badly coded..." /></a></p>
<p>Clicking on the various markers, we see that there is a lot of miscoding going on. Also, some MPs have several universities listed, which may also contribute to the confusion. (A rough and ready way of handling that would be to split the university field on a semi-colon, and just use the first listed university in the location lookup.)</p>
<p>So what would make things easier? The Guardian Politics API is getting the data out there, but can it be improved in any way in order to make it a little (or a lot?!) more useful in a machine automated context such as this?</p>
<p>I think so&#8230;</p>
<p>Here&#8217;s one possible approach: a few weeks ago, the JISC Monitoring unit published a lookup service for looking up UK HEIs using a variety of identifier schemes and a crude name based lookup, and returning synonymous identifers, canonical URLs and lat/long data: <a href="http://data.jiscmu.ac.uk">data.jiscmu.ac.uk</a>. As identified in the <a href="http://www.jiscmu.ac.uk/news/view/189">announcement post</a>, this information complements rather more formally some of that already collated in the Guardian&#8217;s Education Datastore Rosetta Stone spreadsheet&#8230;</p>
<p>(I&#8217;m not sure if Leigh Dodds looked at how the JISCMU data could be used as part of a <a href="http://www.ldodds.com/blog/2010/08/gridworks-reconciliation-api-implementation/">Google Refine reconciliation API service</a>? I seem to remember a brief flurry of tweets on a related topic at the time&#8230;;-)</p>
<p>So, what would be really useful would be for the Guardian Politics API to use a weak Linked Data approach and provide a list of HEI identifiers using a formal identification scheme such as UCAS or HESA codes so that we knew which institutions they were actually referring to; (though this wouldn&#8217;t cope with overseas universities&#8230; Hmm&#8230; is there an international identifier scheme for universities?)</p>
<p>We could then hop over to the JISCMU service and pull down the lat/long information, before popping it on a map.</p>
<p>Looking deeper into the Guardian Politics API, we also see a field for listing the MPs&#8217; schools&#8230; which in turn could be enhanced by including identifiers used in the data.gov.uk education datastore.</p>
<p>So &#8211; Linked Data: can you see how it works yet? And do you get the feeling that network effects could kick in to place really quickly as data is enhanced with linking elements such as well defined identifiers using know identification schemes?</p>
<p>PS <a href="http://twitter.com/cgutteridge">Chris Gutteridge</a> has also picked up the challenge of this post, contributing a <a href="http://graphite.ecs.soton.ac.uk/experiments/parlibirth/mps.ttl">list of DBPedia URIs for current MPs to the cause</a>. I wouldn&#8217;t be surprised if he turns up a whole load more data actually cracks the problem way before I do!</p>
<p>Ah &#8211; seems like Chris has been on the case, and produced, (with caveats: <em>&#8220;Note that the data is patchy. It only shows MPs with a geocoded birthplace/university listed on dbpedia&#8221;</em>) a map [updated] of 313 MPs&#8217; birthplaces:</p>
<p><a href="//maps.google.com/?q=http:%2F%2Fgraphite.ecs.soton.ac.uk%2Fexperiments%2Fparlibirth%2Fborn.kml&amp;ie=UTF8&amp;ll=56.752723,3.515625&amp;spn=27.824184,66.621094&amp;z=4" title="Photo Sharing"><img src="http://farm6.static.flickr.com/5205/5248155559_223207c0ae.jpg" width="500" height="337" alt="MPs birthplaces" /></a></p>
<p>as well as a map [updated] of 176 MPs&#8217; universities <s>(though I don&#8217;t have a valid link for this&#8230; yet&#8230;;-)</s> Ah &#8211; here it is:</p>
<p><a href="http://maps.google.com/?q=http:%2F%2Fgraphite.ecs.soton.ac.uk%2Fexperiments%2Fparlibirth%2Fstudied.kml&amp;ie=UTF8&amp;ll=53.370221,0.703125&amp;spn=7.488833,16.655273&amp;z=6" title="Photo Sharing"><img src="http://farm6.static.flickr.com/5007/5248153609_3985f88963.jpg" width="500" height="378" alt="MPs alma mater map" /></a></p>
<p>UPDATE: here&#8217;s the recipe &#8211; <a href="http://blogs.ecs.soton.ac.uk/webteam/2010/12/10/studying-the-mps/">Studying the MPs</a></p>
<p>PPS I really need to add Chris&#8217; <a href="http://graphite.ecs.soton.ac.uk/geo2kml/">geo-tagged RDF to KML converter</a> service (described <a href="http://blogs.ecs.soton.ac.uk/webteam/2010/12/05/new-tools/">here</a> to my toolkit&#8230;</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/ouseful.wordpress.com/4649/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/ouseful.wordpress.com/4649/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=blog.ouseful.info&#038;blog=325417&#038;post=4649&#038;subd=ouseful&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://blog.ouseful.info/2010/12/10/where-linked-data-would-be-useful-creating-more-mps-maps-from-the-guardian-politics-api/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/abbd9f90565ce9ae4d065d93a81d8c03?s=96&#38;d=http%3A%2F%2F1.gravatar.com%2Favatar%2Fad516503a11cd5ca435acc9bb6523536%3Fs%3D96" medium="image">
			<media:title type="html">Tony Hirst</media:title>
		</media:content>

		<media:content url="http://farm6.static.flickr.com/5084/5247895808_d0209e801d.jpg" medium="image">
			<media:title type="html">MP details via Guardian Politics API</media:title>
		</media:content>

		<media:content url="http://farm6.static.flickr.com/5167/5247900576_f3e9355392.jpg" medium="image">
			<media:title type="html">Lookup MP details</media:title>
		</media:content>

		<media:content url="http://farm6.static.flickr.com/5163/5247302057_7e4123d5eb.jpg" medium="image">
			<media:title type="html">Lookup location by university</media:title>
		</media:content>

		<media:content url="http://farm6.static.flickr.com/5046/5247911682_de2ac86582.jpg" medium="image">
			<media:title type="html">Tidy the pipe...</media:title>
		</media:content>

		<media:content url="http://farm6.static.flickr.com/5168/5247313953_05da3d04b9.jpg" medium="image">
			<media:title type="html">MPs by university - badly coded...</media:title>
		</media:content>

		<media:content url="http://farm6.static.flickr.com/5205/5248155559_223207c0ae.jpg" medium="image">
			<media:title type="html">MPs birthplaces</media:title>
		</media:content>

		<media:content url="http://farm6.static.flickr.com/5007/5248153609_3985f88963.jpg" medium="image">
			<media:title type="html">MPs alma mater map</media:title>
		</media:content>
	</item>
	</channel>
</rss>
