<?xml version="1.0" encoding="iso-8859-1"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>dekstop weblog : MidnightBot: When People Post</title>
    <link>http://dekstop.de/weblog/2006/06/midnightbot/</link>
    <description>I started to think about the different times of day people post on their blogs, and wondered what that said about your personality or occupation. So one day, with a long train-ride ahead of me, I set out to find out. The goal: for a selection of blogs, plot the ...</description>
    <dc:language>en-us</dc:language>
    <dc:rights>Copyright 2006 Martin Dittus</dc:rights>
    <lastBuildDate>Thu, 29 Jun 2006 00:39:21 GMT</lastBuildDate>
    <generator>MicroLinks 5.6 (dekstop.de)</generator>
    <managingEditor>public&#64;dekstop&#46;de</managingEditor>
    <webMaster>public&#64;dekstop&#46;de</webMaster>

    <item>
      <title>Comment on "MidnightBot: When People Post"</title>
      <link>http://dekstop.de/weblog/2006/06/midnightbot/#274</link>
      <description><![CDATA[<p>Just so that I won't forget: I should also add scatter plots of TOD over the article sequence, to give a sense of how posting habits change over time. Just did a quick sketch in R, and some of the plots look very interesting.</p>]]> &lt;p&gt;- <![CDATA[<a href="http://dekstop.de/" rel="nofollow">Martin Dittus</a>]]>&lt;/p&gt;</description>
      <dc:creator>Martin Dittus</dc:creator>
      <guid isPermaLink="true">http://dekstop.de/weblog/2006/06/midnightbot/#274</guid>
      <pubDate>Fri, 30 Jun 2006 13:58:37 GMT</pubDate>
    </item>
    <item>
      <title>Comment on "MidnightBot: When People Post"</title>
      <link>http://dekstop.de/weblog/2006/06/midnightbot/#270</link>
      <description><![CDATA[<p>Hey, you're right! That's a good reason to get into P5... (I was just starting look into PHP/Python sparkline libraries)</p>]]> &lt;p&gt;- <![CDATA[<a href="http://dekstop.de/" rel="nofollow">Martin Dittus</a>]]>&lt;/p&gt;</description>
      <dc:creator>Martin Dittus</dc:creator>
      <guid isPermaLink="true">http://dekstop.de/weblog/2006/06/midnightbot/#270</guid>
      <pubDate>Thu, 29 Jun 2006 18:31:07 GMT</pubDate>
    </item>
    <item>
      <title>Comment on "MidnightBot: When People Post"</title>
      <link>http://dekstop.de/weblog/2006/06/midnightbot/#269</link>
      <description><![CDATA[<p>i have a suggestion concerning visualization software: processing ;)<br />
very easy to read data and write images.</p>]]> &lt;p&gt;- senorpako&lt;/p&gt;</description>
      <dc:creator>senorpako</dc:creator>
      <guid isPermaLink="true">http://dekstop.de/weblog/2006/06/midnightbot/#269</guid>
      <pubDate>Thu, 29 Jun 2006 18:17:13 GMT</pubDate>
    </item>
    <item>
      <title>Comment on "MidnightBot: When People Post"</title>
      <link>http://dekstop.de/weblog/2006/06/midnightbot/#268</link>
      <description><![CDATA[<p>Hey,</p>

<p>yeah I know, I'm dissatisfied with the plots for a number of reasons (e.g. there is no clear baseline, and no clear separation between individual hours/days), but the software I used was too limited. Maybe I'll look around for better alternatives. Any suggestions?</p>

<p>Good idea with the horizontal scaling.</p>]]> &lt;p&gt;- <![CDATA[<a href="http://dekstop.de/" rel="nofollow">Martin Dittus</a>]]>&lt;/p&gt;</description>
      <dc:creator>Martin Dittus</dc:creator>
      <guid isPermaLink="true">http://dekstop.de/weblog/2006/06/midnightbot/#268</guid>
      <pubDate>Thu, 29 Jun 2006 11:02:28 GMT</pubDate>
    </item>
    <item>
      <title>Comment on "MidnightBot: When People Post"</title>
      <link>http://dekstop.de/weblog/2006/06/midnightbot/#267</link>
      <description><![CDATA[<p>hey martin,<br />
very nice. though i think, that the sparklines for the week-blogging-behaviour are a bit short. in your visualization, hours and days have the same scaling. thus one hour takes the same space than one day. but i´d propose to give days a slightly bigger scaling. just for readability.</p>

<p>by the way:<br />
http://www.edwardtufte.com/bboard/q-and-a-fetch-msg?msg_id=0001OR&topic_id=1</p>

<p>though i´m pretty sure, that you already had a look on that one ;)</p>]]> &lt;p&gt;- <![CDATA[<a href="http://wortwechsel.biz/blog" rel="nofollow">senorpako</a>]]>&lt;/p&gt;</description>
      <dc:creator>senorpako</dc:creator>
      <guid isPermaLink="true">http://dekstop.de/weblog/2006/06/midnightbot/#267</guid>
      <pubDate>Thu, 29 Jun 2006 10:56:27 GMT</pubDate>
    </item>
    <item>
      <title>Comment on "MidnightBot: When People Post"</title>
      <link>http://dekstop.de/weblog/2006/06/midnightbot/#260</link>
      <description><![CDATA[<p>OK, hm, forgot, authentication was still in my browser...</p>]]> &lt;p&gt;- <![CDATA[<a href="http://pascal.vanhecke.info" rel="nofollow">Pascal</a>]]>&lt;/p&gt;</description>
      <dc:creator>Pascal</dc:creator>
      <guid isPermaLink="true">http://dekstop.de/weblog/2006/06/midnightbot/#260</guid>
      <pubDate>Thu, 29 Jun 2006 01:26:19 GMT</pubDate>
    </item>
    <item>
      <title>Comment on "MidnightBot: When People Post"</title>
      <link>http://dekstop.de/weblog/2006/06/midnightbot/#259</link>
      <description><![CDATA[<p>Gaaa! I really should have thought of that...</p>

<p>(Google does require authentication: I had to login to see the feed.)</p>

<p>Thx...</p>]]> &lt;p&gt;- <![CDATA[<a href="http://dekstop.de/" rel="nofollow">Martin Dittus</a>]]>&lt;/p&gt;</description>
      <dc:creator>Martin Dittus</dc:creator>
      <guid isPermaLink="true">http://dekstop.de/weblog/2006/06/midnightbot/#259</guid>
      <pubDate>Thu, 29 Jun 2006 01:10:28 GMT</pubDate>
    </item>
    <item>
      <title>Comment on "MidnightBot: When People Post"</title>
      <link>http://dekstop.de/weblog/2006/06/midnightbot/#258</link>
      <description><![CDATA[<p>Hello Martin,</p>

<p>amusing article :-)</p>

<p>About data acquisition: </p>

<p>have a look at the "unofficial Google Reader api":<br />
http://www.niallkennedy.com/blog/archives/2005/12/google_reader_a.html</p>

<p>example:<br />
http://www.google.com/reader/atom/feed/http://dekstop.de/weblog/index.xml</p>

<p>and for 100 entries:<br />
http://www.google.com/reader/atom/feed/http://dekstop.de/weblog/index.xml?n=100</p>

<p>Bloglines has a similar api, but Google Reader is a lot easier (no authentication needed).</p>]]> &lt;p&gt;- <![CDATA[<a href="http://pascal.vanhecke.info" rel="nofollow">Pascal Van Hecke</a>]]>&lt;/p&gt;</description>
      <dc:creator>Pascal Van Hecke</dc:creator>
      <guid isPermaLink="true">http://dekstop.de/weblog/2006/06/midnightbot/#258</guid>
      <pubDate>Thu, 29 Jun 2006 01:04:08 GMT</pubDate>
    </item>


    <item>
      <title>MidnightBot: When People Post</title>
      <link>http://dekstop.de/weblog/2006/06/midnightbot/</link> 
      <description><![CDATA[<p>I started to think about the different times of day people post on their blogs, and wondered what that said about your personality or occupation. So one day, with a long train-ride ahead of me, I set out to find out.</p>

<p>The goal: for a selection of blogs, plot the time of day and day of week of the 100 most recent articles.</p>

<p>Initial guess: all the cool guys post at really odd hours, and the boring guys only during their lunch break ;)</p>

<p><strong>Update</strong> 2006-07-05 -- check out the <a href="http://dekstop.de/weblog/2006/07/midnightbot_ii_no_errors/">new version of the graphs</a>. Much nicer, and, err, this time around maybe even with less errors...</p>

<h3>Showroom</h3>

<table border="0" cellspacing="1" cellpadding="5" class="datatable">
<tr>
  <th>Blog</th>
  <th>#</th>
  <th>24h</th>
  <th>7d</th>
  <th>Occupation</th>
  <th>Motto</th>
</tr>
<tr>
  <td><a href="http://de-bug.de/blog/">De:Bug Blog</a></td>
  <td align="right">100</td>
  <td><img src="http://dekstop.de/weblog/2006/06/midnightbot/de-bug.de-blog_hours.png" width="50" height="14" alt="De Bug.De Blog Hours" /></td>
  <td><img src="http://dekstop.de/weblog/2006/06/midnightbot/de-bug.de-blog_days.png" width="16" height="14" alt="De Bug.De Blog Days" /></td>
  <td>Journalist and DJ</td>
  <td>"Why work on weekends?"</td>
</tr>
<tr>
  <td><a href="http://dekstop.de/weblog/">dekstop weblog</a></td>
  <td align="right">100</td>
  <td><img src="http://dekstop.de/weblog/2006/06/midnightbot/dekstop.de_hours.png" width="50" height="14" alt="Dekstop.De Hours" /></td>
  <td><img src="http://dekstop.de/weblog/2006/06/midnightbot/dekstop.de_days.png" width="16" height="14" alt="Dekstop.De Days" /></td>
  <td>Student</td>
  <td>"Why sleep?"</td>
</tr>
<tr>
  <td><a href="http://drunkenblog.com/">DrunkenBlog</a></td>
  <td align="right">100</td>
  <td><img src="http://dekstop.de/weblog/2006/06/midnightbot/drunkenblog.com_hours.png" width="50" height="14" alt="Drunkenblog.Com Hours" /></td>
  <td><img src="http://dekstop.de/weblog/2006/06/midnightbot/drunkenblog.com_days.png" width="16" height="14" alt="Drunkenblog.Com Days" /></td>
  <td>(Developer?)</td>
  <td>"I only post Fridays."</td>
</tr>
<tr>
  <td><a href="http://eigenclass.org/">eigenclass.org</a></td>
  <td align="right">56</td>
  <td><img src="http://dekstop.de/weblog/2006/06/midnightbot/eigenclass.org_hours.png" width="50" height="14" alt="Events.Ccc.De Wiki Hours" /></td>
  <td><img src="http://dekstop.de/weblog/2006/06/midnightbot/eigenclass.org_days.png" width="16" height="14" alt="Eigenclass.Org Days" /></td>
  <td>(Developer?)</td>
  <td>"Write Sunday, post Monday"</td>
</tr>
<tr>
  <td><a href="http://inessential.com/">inessential.com</a></td>
  <td align="right">100</td>
  <td><img src="http://dekstop.de/weblog/2006/06/midnightbot/inessential.com_hours.png" width="50" height="14" alt="Inessential.Com Hours" /></td>
  <td><img src="http://dekstop.de/weblog/2006/06/midnightbot/inessential.com_days.png" width="16" height="14" alt="Inessential.Com Days" /></td>
  <td>Developer</td>
  <td>"Don't post during lunch."</td>
</tr>
<tr>
  <td><a href="http://ranchero.com/">ranchero.com</a></td>
  <td align="right">100</td>
  <td><img src="http://dekstop.de/weblog/2006/06/midnightbot/ranchero.com_hours.png" width="50" height="14" alt="Ranchero.Com Hours" /></td>
  <td><img src="http://dekstop.de/weblog/2006/06/midnightbot/ranchero.com_days.png" width="16" height="14" alt="Ranchero.Com Days" /></td>
  <td>Developer</td>
  <td>"Post during lunch."</td>
</tr>
<tr>
  <td><a href="http://silentbobspeaks.com/">My Boring-Ass Life</a></td>
  <td align="right">98</td>
  <td><img src="http://dekstop.de/weblog/2006/06/midnightbot/silentbobspeaks.com_hours.png" width="50" height="14" alt="Silentbobspeaks.Com Hours" /></td>
  <td><img src="http://dekstop.de/weblog/2006/06/midnightbot/silentbobspeaks.com_days.png" width="16" height="14" alt="Silentbobspeaks.Com Days" /></td>
  <td>Movie director</td>
  <td>"My boring-ass life."</td>
</tr>
<tr>
  <td><a href="http://tim.geekheim.de/">The Lunatic Fringe</a></td>
  <td align="right">99</td>
  <td><img src="http://dekstop.de/weblog/2006/06/midnightbot/tim.geekheim.de_hours.png" width="50" height="14" alt="Tim.Geekheim.De Hours" /></td>
  <td><img src="http://dekstop.de/weblog/2006/06/midnightbot/tim.geekheim.de_days.png" width="16" height="14" alt="Tim.Geekheim.De Days" /></td>
  <td>(Evangelist?)</td>
  <td>"I peak twice."</td>
</tr>
<tr>
  <td><a href="http://villainous.biz/wordpress/">villainous.biz</a></td>
  <td align="right">58</td>
  <td><img src="http://dekstop.de/weblog/2006/06/midnightbot/villainous.biz_hours.png" width="50" height="14" alt="Villainous.Biz Hours" /></td>
  <td><img src="http://dekstop.de/weblog/2006/06/midnightbot/villainous.biz_days.png" width="16" height="14" alt="Villainous.Biz Days" /></td>
  <td>Artist</td>
  <td>"Work? Hang out? Hm."</td>
</tr>
<tr>
  <td><a href="http://wortwechsel.biz/blog/">wortwechsel.biz</a></td>
  <td align="right">19</td>
  <td><img src="http://dekstop.de/weblog/2006/06/midnightbot/wortwechsel.biz_hours.png" width="50" height="14" alt="Wortwechsel.Biz Hours" /></td>
  <td><img src="http://dekstop.de/weblog/2006/06/midnightbot/wortwechsel.biz_days.png" width="16" height="14" alt="Wortwechsel.Biz Days" /></td>
  <td>Designer/Developer</td>
  <td>"Work first."</td>
</tr>
<tr>
  <td><a href="https://events.ccc.de/congress/2006/wiki/Special:Recentchanges">23C3 Wiki</a></td>
  <td align="right">77</td>
  <td><img src="http://dekstop.de/weblog/2006/06/midnightbot/events.ccc.de-wiki_hours.png" width="50" height="14" alt="Events.Ccc.De Wiki Hours" /></td>
  <td><img src="http://dekstop.de/weblog/2006/06/midnightbot/events.ccc.de-wiki_days.png" width="16" height="14" alt="Events.Ccc.De Wiki Days" /></td>
  <td>Unwashed masses</td>
  <td></td>
</tr>
<tr>
  <td colspan="6" class="caption">
    <strong>#</strong> = number of analyzed articles.<br />
    <strong>24h</strong> = posting frequency over the time of day (starting at 0:00, ending at 23:00).<br />
    <strong>7d</strong> = posting frequency over the day of week (starting on Monday).<br />
    <strong>Occupation</strong> = author's job description (mostly a guess).<br />
    <strong>Motto</strong> = summary of my subjective evaluation.
  </td>
</tr>
</table>

<h3>Interpretation</h3>

<p>Note that there is a distinct group of people who don't seem to have stable sleep cycles -- either they travel a lot in completely different timezones, or they must have wildly interesting jobs.</p>

<p>Note that there is a distinct second group of people whose posting count before noon resembles a flat line.</p>

<p>The author of inessential.com is also the author of ranchero.com -- the former site is his private blog. Now compare the slightly different posting behavior. Then compare with "My Boring-Ass Life", who has virtually the same curve, but also posts on weekends.</p>

<p>The prominent spikes of wortwechsel.biz can be attributed to two reasons: the low article count (it's a new blog), and the fact that one of the authors for a long time didn't have an Internet connection at home, which meant he only posted from the office during lunch break and after work. (I asked.)</p>

<p>"23C3 Wiki" is not a blog, it's the Recent Changes feed of the Chaos Communication Congress Wiki for 2006. I thought it would be fun to compare all the blog curves with one 'collaborative' curve.</p>

<h3>Acquisition Problems</h3>

<p>I originally thought that the data required for this 'survey' was easy to come by: all I wanted was the date and time of the last 100 articles of a blog. Turns out it's not that easy, which means that the number of sites involved in this test is a lot smaller than I initially imagined. </p>

<p>This has two reasons: It turned out to be hard work to <em>extract</em> the data; and some authors don't even seem to publish it.</p>

<p>There seems no simple and generic method to query date and time of an arbitrary number of articles for an arbitrary blog. RSS feeds, usually a good source for extracting such data, generally publish only the last 10-20 articles; but I really wanted more than a handful of articles to make this exercise meaningful. In the end I <a href="http://en.wikipedia.org/wiki/Screen_scraping">scraped</a> the data off individual HTML pages, which involves more work than simply parsing a feed (because each site has a different HTML layout and URL scheme).</p>

<p>Another problem was that some sites I wanted to include don't seem to publish the time of day of their articles outside of their feeds -- this includes really cool sites like JoelOnSoftware, DaringFireball, The Dilbert blog, the Macromates blog, and others.</p>

<p>Let me know if you want me to include additional sites -- or even better, send me the data.</p>

<p><small>In case you wonder about the article title: while scraping the data I sent this HTTP UserAgent-header: <tt>MidnightBot 0.1 (http://dekstop.de/midnightbot/)</tt></small></p>

<h3>Related Articles</h3>
<ul class="links">
  <li><a href="http://dekstop.de/weblog/2006/01/visualization_of_numeric_data/">Visualization of Numeric Data: A Brief Historical Overview</a></li>
  <li><a href="http://dekstop.de/weblog/2006/06/data_mining_for_world_peace/">Data Mining for World Peace</a></li>
  <li><a href="http://dekstop.de/weblog/2005/10/search_engine_referers/">The Wonderful World of Logfile Analysis, Part One: Search Engine Referers</a></li>
  <li><a href="http://dekstop.de/weblog/2005/11/delicious_collaborativerank/">CollaborativeRank Says I'm an Expert on XML, Mining and Validation</a></li>
</ul>]]></description>
      <dc:creator>Martin Dittus</dc:creator>
      <category>data mining</category>
      
      <guid isPermaLink="true">http://dekstop.de/weblog/2006/06/midnightbot/</guid>
      <pubDate>Thu, 29 Jun 2006 00:39:21 GMT</pubDate>
    </item>
  </channel>
</rss>

