<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/'><id>tag:blogger.com,1999:blog-9147415858568072588.post2165979123819696507..comments</id><updated>2008-12-07T09:25:54.148Z</updated><title type='text'>Comments on Info Clarity: Spotting fraud in numbers</title><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://infoclarity.blogspot.com/feeds/2165979123819696507/comments/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9147415858568072588/2165979123819696507/comments/default'/><link rel='alternate' type='text/html' href='http://infoclarity.blogspot.com/2008/11/spotting-fraud-in-numbers.html'/><author><name>David Boyle</name><uri>http://www.blogger.com/profile/17073510484824457260</uri><email>noreply@blogger.com</email></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>4</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-9147415858568072588.post-6096331582142248628</id><published>2008-12-07T09:25:00.000Z</published><updated>2008-12-07T09:25:00.000Z</updated><title type='text'>That would be very cruel of me. The organizer numb...</title><content type='html'>That would be very cruel of me. The organizer numbers have been changed. I don't think we should be surprised by the findings though, should we :)</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9147415858568072588/2165979123819696507/comments/default/6096331582142248628'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9147415858568072588/2165979123819696507/comments/default/6096331582142248628'/><link rel='alternate' type='text/html' href='http://infoclarity.blogspot.com/2008/11/spotting-fraud-in-numbers.html?showComment=1228641900000#c6096331582142248628' title=''/><author><name>David Boyle</name><uri>http://www.blogger.com/profile/17073510484824457260</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='06699960019854151675'/></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='http://infoclarity.blogspot.com/2008/11/spotting-fraud-in-numbers.html' ref='tag:blogger.com,1999:blog-9147415858568072588.post-2165979123819696507' source='http://www.blogger.com/feeds/9147415858568072588/posts/default/2165979123819696507' type='text/html'/></entry><entry><id>tag:blogger.com,1999:blog-9147415858568072588.post-3476079479535915675</id><published>2008-12-06T21:02:00.000Z</published><updated>2008-12-06T21:02:00.000Z</updated><title type='text'>Very cool!  Who is organizer 4?  Do I have to sear...</title><content type='html'>Very cool!  Who is organizer 4?  Do I have to search around through my decommissioned computers for zone 4 to find the answer to this?</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9147415858568072588/2165979123819696507/comments/default/3476079479535915675'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9147415858568072588/2165979123819696507/comments/default/3476079479535915675'/><link rel='alternate' type='text/html' href='http://infoclarity.blogspot.com/2008/11/spotting-fraud-in-numbers.html?showComment=1228597320000#c3476079479535915675' title=''/><author><name>Dan Check</name><uri>http://www.blogger.com/profile/00858087992150809692</uri><email>noreply@blogger.com</email></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='http://infoclarity.blogspot.com/2008/11/spotting-fraud-in-numbers.html' ref='tag:blogger.com,1999:blog-9147415858568072588.post-2165979123819696507' source='http://www.blogger.com/feeds/9147415858568072588/posts/default/2165979123819696507' type='text/html'/></entry><entry><id>tag:blogger.com,1999:blog-9147415858568072588.post-6694447166245840961</id><published>2008-12-04T10:59:00.000Z</published><updated>2008-12-04T10:59:00.000Z</updated><title type='text'>First, I'm certain its not about the total number ...</title><content type='html'>First, I'm certain its not about the total number of heads or tails. Comparing that across the class would allow you to estimate the proportion of the class that cheated, but not to identify individuals.&lt;BR/&gt;&lt;BR/&gt;I suspect therefore its about the number sequence itself. Probably the number of sequential strings of H or T. &lt;BR/&gt;&lt;BR/&gt;If it were me I would convert the sequence in to numbers and then look at the distribution of these numbers. I would do this by listing the length of each 'run' of heads or tails in order. &lt;BR/&gt;&lt;BR/&gt;E.g. HHTHTTHHTHTHHHT would become 2,1,1,2,2,2,1,1,1,3,1&lt;BR/&gt;&lt;BR/&gt;Lets have a stab at the maths:&lt;BR/&gt;&lt;BR/&gt;Once you have flipped your first coin, the probability of the second  flip being different, and ending the 'run' is 50%. So we should see '1' occur about 50% of the time in our list. (In my above made-up list, it is 6/11 - not too bad!)&lt;BR/&gt;&lt;BR/&gt;The probability of the second flip being the same is also 50%. And the probability of this 'run' ending on the next flip is 50%. This means the total probability of getting a run of length two is 25%. (In my made-up flips above, I had '2' 4/11 times, or 36% of the time. Oops!)&lt;BR/&gt;&lt;BR/&gt;Similarly, the probability of getting a run of length n is (50%)^n.&lt;BR/&gt;&lt;BR/&gt;All the lecturer needs to do is convert each student's list of coin flips in to the number sequence above and do a statistical test to understand whether the difference from the expected pattern is less than 5% likely to be down to chance (or similar). 200 coin flips should give about 100 numbers once converted, which seems to be a decent sample size.&lt;BR/&gt;&lt;BR/&gt;Anyone want to chip in on what the statistical test should be?&lt;BR/&gt;&lt;BR/&gt;By the way, here's another (probable) application of Benford's law to coin-flipping in the real world: http://paul.kedrosky.com/archives/2008/07/21/hedge_fund_test.html&lt;BR/&gt;&lt;BR/&gt;(From a blog I highly recommend, by the way)</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9147415858568072588/2165979123819696507/comments/default/6694447166245840961'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9147415858568072588/2165979123819696507/comments/default/6694447166245840961'/><link rel='alternate' type='text/html' href='http://infoclarity.blogspot.com/2008/11/spotting-fraud-in-numbers.html?showComment=1228388340000#c6694447166245840961' title=''/><author><name>David Boyle</name><uri>http://www.blogger.com/profile/17073510484824457260</uri><email>noreply@blogger.com</email><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='OpenSocialUserId' value='06699960019854151675'/></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='http://infoclarity.blogspot.com/2008/11/spotting-fraud-in-numbers.html' ref='tag:blogger.com,1999:blog-9147415858568072588.post-2165979123819696507' source='http://www.blogger.com/feeds/9147415858568072588/posts/default/2165979123819696507' type='text/html'/></entry><entry><id>tag:blogger.com,1999:blog-9147415858568072588.post-8775603913229056998</id><published>2008-12-03T17:12:00.000Z</published><updated>2008-12-03T17:12:00.000Z</updated><title type='text'>So, help me understand how this would apply to fak...</title><content type='html'>So, help me understand how this would apply to fake coin flipping data?</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/9147415858568072588/2165979123819696507/comments/default/8775603913229056998'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/9147415858568072588/2165979123819696507/comments/default/8775603913229056998'/><link rel='alternate' type='text/html' href='http://infoclarity.blogspot.com/2008/11/spotting-fraud-in-numbers.html?showComment=1228324320000#c8775603913229056998' title=''/><author><name>BaltoBen</name><uri>http://www.blogger.com/profile/16847455923648437365</uri><email>noreply@blogger.com</email></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='http://infoclarity.blogspot.com/2008/11/spotting-fraud-in-numbers.html' ref='tag:blogger.com,1999:blog-9147415858568072588.post-2165979123819696507' source='http://www.blogger.com/feeds/9147415858568072588/posts/default/2165979123819696507' type='text/html'/></entry></feed>