<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: My Personal Intro to F1 Race Statistics</title>
	<atom:link href="http://blog.ouseful.info/2013/01/11/my-personal-intro-to-f1-race-statistics/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.ouseful.info/2013/01/11/my-personal-intro-to-f1-race-statistics/</link>
	<description>Trying to find useful things to do with emerging technologies in open education</description>
	<lastBuildDate>Sat, 25 May 2013 17:29:23 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: Tony Hirst</title>
		<link>http://blog.ouseful.info/2013/01/11/my-personal-intro-to-f1-race-statistics/#comment-30128</link>
		<dc:creator><![CDATA[Tony Hirst]]></dc:creator>
		<pubDate>Fri, 11 Jan 2013 01:00:06 +0000</pubDate>
		<guid isPermaLink="false">http://blog.ouseful.info/?p=9458#comment-30128</guid>
		<description><![CDATA[Hi Chris - thanks for that suggestion. &quot;Matters of independence&quot; was another of the bugbears I should have added to my opening line! As well as rankings, I have lap time data (for a couple of seasons at least, courtesy of Ergast API), and sector times for practice and qualifying going back several years (scraped from Formula One website; I&#039;ve also got pit times, of a sort, from that site, and fastest laps). 

One thing I have been mulling over is things like podium finish, top 10/middle 7/bottom 7 in qualifying, clean side/dirty side on grid (for end of first lap position change stats) etc.]]></description>
		<content:encoded><![CDATA[<p>Hi Chris &#8211; thanks for that suggestion. &#8220;Matters of independence&#8221; was another of the bugbears I should have added to my opening line! As well as rankings, I have lap time data (for a couple of seasons at least, courtesy of Ergast API), and sector times for practice and qualifying going back several years (scraped from Formula One website; I&#8217;ve also got pit times, of a sort, from that site, and fastest laps). </p>
<p>One thing I have been mulling over is things like podium finish, top 10/middle 7/bottom 7 in qualifying, clean side/dirty side on grid (for end of first lap position change stats) etc.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Chris Hanretty (@chrishanretty)</title>
		<link>http://blog.ouseful.info/2013/01/11/my-personal-intro-to-f1-race-statistics/#comment-30126</link>
		<dc:creator><![CDATA[Chris Hanretty (@chrishanretty)]]></dc:creator>
		<pubDate>Fri, 11 Jan 2013 00:21:35 +0000</pubDate>
		<guid isPermaLink="false">http://blog.ouseful.info/?p=9458#comment-30126</guid>
		<description><![CDATA[One of the problems with using final placings (i.e., ranks) as outcomes is that these are (a) definitely not normally distributed, and (b) not independent of one another: if Button finishes second, no one else can. You might instead want to look at models for survival data. Survival data is often used for time to mortality, modelled as a function of various risk factors. Here, the outcome is time to completion, higher &quot;mortality&quot; is a good thing, and risk factors are actually factors associated with racing quicker. 

The benefit of using this kind of analysis is that survival analyses are quite good at dealing with censored observations -- i.e., times where we don&#039;t know how quick a driver would have finished a race, because they had a DNF. 

Googling &quot;survival analysis tutorial R&quot; throws up a number of useful-looking tutorials. Predicting from survival models is a bit ropier, but should be possible out of the box.]]></description>
		<content:encoded><![CDATA[<p>One of the problems with using final placings (i.e., ranks) as outcomes is that these are (a) definitely not normally distributed, and (b) not independent of one another: if Button finishes second, no one else can. You might instead want to look at models for survival data. Survival data is often used for time to mortality, modelled as a function of various risk factors. Here, the outcome is time to completion, higher &#8220;mortality&#8221; is a good thing, and risk factors are actually factors associated with racing quicker. </p>
<p>The benefit of using this kind of analysis is that survival analyses are quite good at dealing with censored observations &#8212; i.e., times where we don&#8217;t know how quick a driver would have finished a race, because they had a DNF. </p>
<p>Googling &#8220;survival analysis tutorial R&#8221; throws up a number of useful-looking tutorials. Predicting from survival models is a bit ropier, but should be possible out of the box.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
