[cracks knuckles] MT @FiveThirtyEight: We're mapping kidnapping rates in Nigeria 1982 to 2014. http://53eig.ht/1oof5cw pic.twitter.com/FQjZQDHJme//twitter.com/charlie_simpson/status/466300650016763904
— EM Simpson (@charlie_simpson)Tue, May 13 2014 19:35:13- Nope MT @armarvin: .@charlie_simpson @FiveThirtyEight Do you think the #GDELT is sound enough to make calls on #Nigeria kidnapping trends?
//twitter.com/charlie_simpson/status/466304983017594880
— EM Simpson (@charlie_simpson)Tue, May 13 2014 19:52:26 - All trend analysis using #GDELT has to take into account the exponential increase in news stories which generate the data. @FiveThirtyEight
//twitter.com/charlie_simpson/status/466305581083414529
— EM Simpson (@charlie_simpson)Tue, May 13 2014 19:54:48 - #GDELT isn't designed for tracking discrete events like "kidnappings" or "suicide bombings" bc it's based on news reports. @FiveThirtyEight
//twitter.com/charlie_simpson/status/466307333648576513
— EM Simpson (@charlie_simpson)Tue, May 13 2014 20:01:46 - The beauty of automated event data is that you don't hand-code things. The power-and the limitations-come from automation @FiveThirtyEight
//twitter.com/charlie_simpson/status/466307563697352704
— EM Simpson (@charlie_simpson)Tue, May 13 2014 20:02:41 - So if #GDELT says there were 649 kidnappings in Nigeria in 4 months, WHAT IT'S REALLY SAYING is there were 649 news stories abt kidnappings.
//twitter.com/charlie_simpson/status/466308105416884225
— EM Simpson (@charlie_simpson)Tue, May 13 2014 20:04:50 - Apropos of nothing, @jay_yonamine & Schrodt have written things about using event data. http://jayyonamine.com/wp-content/uploads/2012/07/YonamineSchrodt_A_Guide_to_Event_Data.pdf … and http://jayyonamine.com/wp-content/uploads/2012/06/Working-with-Event-Data-A-Guide-to-Aggregation-Choices.pdf …
//twitter.com/johnb30/status/466308363970568192
— John Beieler (@johnb30)Tue, May 13 2014 20:05:52 - News coverage varies widely over time and space. That's really important when making comparisons across time and space. @FiveThirtyEight
//twitter.com/charlie_simpson/status/466308603079847936
— EM Simpson (@charlie_simpson)Tue, May 13 2014 20:06:49 - As basic validation of #GDELT for this problem set, I'd like to see the following for kidnappings in Nigeria...
//twitter.com/charlie_simpson/status/466309194002731009
— EM Simpson (@charlie_simpson)Tue, May 13 2014 20:09:10 - 1) Total number of stories coded for Nigeria over time (what is the shape of that curve)?
//twitter.com/charlie_simpson/status/466309310126231552
— EM Simpson (@charlie_simpson)Tue, May 13 2014 20:09:37 - 2) What are the total number of events generated for Nigeria over time? (What is the shape of that curve?)
//twitter.com/charlie_simpson/status/466309471242027008
— EM Simpson (@charlie_simpson)Tue, May 13 2014 20:10:16 - 3) How does the number of kidnappings compare to the number of coded events? Same shape? Key differences?
//twitter.com/charlie_simpson/status/466309607380369408
— EM Simpson (@charlie_simpson)Tue, May 13 2014 20:10:48 - 4) How many overall events are coded with a specific geolocation? How many get coded to a centroid? (And where is the centroid?)
//twitter.com/charlie_simpson/status/466309825719070722
— EM Simpson (@charlie_simpson)Tue, May 13 2014 20:11:40 - 5) How many kidnapping events are coded with a specific geolocation? Does that change over time?
//twitter.com/charlie_simpson/status/466309955184623616
— EM Simpson (@charlie_simpson)Tue, May 13 2014 20:12:11 - 6) How does this information track with other open source reporting? HRW, UN, WB local NGO crime reporting? Can we corroborate trends?
//twitter.com/charlie_simpson/status/466310118460514304
— EM Simpson (@charlie_simpson)Tue, May 13 2014 20:12:50 - In conclusion: VALIDATE YOUR FREAKING DATA. It's not true just because it's on a goddamn map. #GDELT @FiveThirtyEight
//twitter.com/charlie_simpson/status/466310350837911554
— EM Simpson (@charlie_simpson)Tue, May 13 2014 20:13:45 - Learn the data generating process. Learn the coding rules. Match it against some real world reporting. THEN publish #GDELT @FiveThirtyEight
//twitter.com/charlie_simpson/status/466310600600322048
— EM Simpson (@charlie_simpson)Tue, May 13 2014 20:14:45 - And never, EVER use #GDELT for reporting of discrete events. That's not what it's for. Not kidnappings, not murders, not suicide bombings.
//twitter.com/charlie_simpson/status/466310866225217536
— EM Simpson (@charlie_simpson)Tue, May 13 2014 20:15:48 - Event data, and #GDELT in particular, have unique quirks. But all data have to be understood on their own terms. It's not magic, people.
//twitter.com/charlie_simpson/status/466311358024527872
— EM Simpson (@charlie_simpson)Tue, May 13 2014 20:17:46 - I'll buy a beer for whoever storifys that rant...
//twitter.com/charlie_simpson/status/466316350315851778
— EM Simpson (@charlie_simpson)Tue, May 13 2014 20:37:36 - Hi
- @charlie_simpson yes! Thank God somebody who knows what the fuck she's doing is pointing this out.
//twitter.com/WHarkavy/status/466317399416119298
— Ward Harkavy (@WHarkavy)Tue, May 13 2014 20:41:46 - Go read @charlie_simpson's timeline to understand why the @FiveThirtyEight map of Nigerian kidnapping data is so problematic. Great rant.
//twitter.com/brian_root/status/466319137871237121
— Brian Root (@brian_root)Tue, May 13 2014 20:48:40 - Another major data blunder by @FiveThirtyEight. Go read @charlie_simpson TL. GDELT data is what it is; it cannot do what you wish it does.
//twitter.com/zeynep/status/466320244903256065
— Zeynep Tufekci (@zeynep)Tue, May 13 2014 20:53:04 - I was going to blog about #GDELT and http://fivethirtyeight.com/datalab/mapping-kidnappings-in-nigeria/ …, but @charlie_simpson has twitter-ranted most everything I meant to say.
//twitter.com/AABoyles/status/466323022379765760
— Tony Boyles (@AABoyles)Tue, May 13 2014 21:04:07

![[cracks knuckles] MT @FiveThirtyEight: We're mapping kidnapping rates in Nigeria 1982 to 2014. http://t.co/zIfSxm0uSq http://t.co/FQjZQDHJme](http://i.embed.ly/1/display/resize?key=1e6a1a1efdb011df84894040444cdc60&url=http%3A%2F%2Fpbs.twimg.com%2Fmedia%2FBnhM-2BIUAAaqv1.png)





