Archive: May 2, 2003

<<< May 1, 2003

Home

May 3, 2003 >>>


Friday,  05/02/03  08:31 AM

A little morning bloggin' while reading the news...

Victor David Hanson needs a blog!  But in the meantime he writes articles for NRO, and others link to him; a kind of virtual blog.  Anyway, his latest Geriatric Teenagers is right on the money, as usual.  The Paris-Berlin-Moscow axis is so full of contradictions you hardly get credit for pointing them out, unless you write as well as Victor...  [ via VodkaPundit ]

So, are you in a hurry to try Windows Server 2003?  I didn't think so.  Isn't it interesting that many companies figure "thrice burned, twice shy"?  According to C|Net, 60-70% of U.S. companies are still on NT 4, and are in the process of upgrading to Windows 2000 (which came out three years ago).  Gartner is quoted as saying they don't expect many Windows 2000 customers will upgrade until around 2005.

Did you see this story?  A mountain climber pinned by boulders for five days freed himself by amputating his own arm with a pocket knife.  Wow.

Finally, let me be the 2 millionth blogger to comment on Why Blogs Haven't Stormed the Business World.  So, the premise is foolish.  Why would blogs storm the business world, anyway?  We're talking about a personal publishing tool, not something to boost productivity.  The essence of the complaint seems to be "it is too hard to separate the information from the style".  But that is the fun of blogs!  They're personal!  This is why I don't like RSS aggregators (even really cool ones like SharpReader and my new favorite NewsGator): the personal style of the site doesn't come through.  If you want information exchange, use spreadsheets...

 

Outbound Trackbacks

Friday,  05/02/03  02:44 PM

For the web nerds among you...  (yeah, you!)

I implemented "outbound trackbacks" today.  Essentially a trackback is a way to tell someone: "hey, I linked to your site".  To post a trackback to somebody their site has to support "inbound trackbacks".  This is not yet a widespread feature; I discovered that since the start of the year I've made 1188 links to other sites, of which 28 were trackback-enabled.  Hardly seems worth it, except that I'm sure this will become more popular over time.

I'm still deciding whether to implement "inbound trackbacks".  This would allow me to know when someone has linked to me, but only if they have a trackback-enabled site.  I think for now I'm going to keep looking through my referer logs instead...  Not only does this cover every inbound link (including those from non-trackback-enabled sites), but it tells me when the link was used, which is actually a little more interesting than whether it exists.

Trackbacks are pretty simple; the concept was developed by the folks at Movable Type (a popular blogging tool), and the specification is on their site.  My implementation was to write a script which will run once a day and process all new posts and articles.  For each link in each post, the script retrieves the linked-to page and looks for RDF information in the page which describes the trackback.  (If there isn't any the site isn't trackback enabled, and you're done.)  If there is a trackback URL, you make an HTTP POST to it giving your URL, your site name, and an optional excerpt (there's a good example in the spec).  That's it.

The most interesting part of the script creates a reasonable "excerpt":

grep "$url" $file |
sed "s/<[^>]*>//g;s/&amp;/&/g;s/&lt;/</g;s/&gt;/>/g" |
cut -c1-252 |
sed "s/\$/%24/g;s/&/%26/g;s/+/%2B/g;s/=/%3D/g;s/?/%3F/g;s/ /+/g" |
sed "s/+[^+]*$//;;s/.$/&.../"

Yeah, I know, nerdy.  The grep gets the paragraph containing the link.  The first sed converts the HTML into text, throwing away tags.  The cut truncates the excerpt at 252 characters.  The second sed URL-encodes the excerpt, and the final sed appends a "..." to the end.  Voila.

If all sites were trackback-enabled in both directions, it would have the effect of making all links two-way; for any page you would know all the links to it, from all over the web.  I doubt this will ever happen; for one thing the information is not always useful and could be huge (imagine all the inbound links to the Google home page, for example).  But it is a cool thing in the blogosphere, and I expect all the popular blogging tools will support it...

 
 

Return to the archive.