Search This Blog

Loading...

Tuesday, May 6, 2008

Technorati Authority Calculator

Technorati is a search engine specifically for searching blogs. At least that's what the Wikipedia says. Despite this, I never am able to draw any traffic from it. To be fair, I don't draw a lot of traffic from other search engines, such as Google or Yahoo search either; search engines only constitute 8% of all the incoming traffic. Out of the 8%, traffic from Technorati contributes 0%. So Technorati is really of not much help to me in terms of drawing traffics.

But still I find Technorati fascinating. It provides a view blog reaction widget at the bottom of each post that allows me to see the posts that link to my post. The internet surfer always take links for granted. But for bloggers, links are important because it is a sign that people has read your posts. Not only that, links are the most crucial equations in any search engines. A high ranking website is always a website that has a lot of links. So the higher the blog reaction is, the better off your website is.

There is another widget that I like: The Authority widget.

The Authority widget is a widget that shows the total number of unique sites that link to you. If you get 10 links from 1 website, then your authority is still 1, because there is only one unique site that links to you. In a sense, the higher authority you have, the more well-read you are.

My current authority is 27, not 575 as shown on the image above.

But recently I want to know what are the unique websites that link to me. For me, merely knowing that there are 27 websites that link to me is not sufficient; I want to know who is linking to me. There are two ways one can do this.
  1. Go to the Technorati's page on the blog and see the blog reactions and filter out the repeated websites manually,
  2. Use Technorati API to do the job.
The only sustainable option is, of course, 2. Plus I wanted to learn some Python. So I went to the Technorati Developer Center, download the code samples and made some modifications to suit my need. The program I wrote returned me a list of the unique websites:

['http://arjansworld.blogspot.com',
'http://blog.cwa.me.uk',
'http://chadchapol.wordpress.com',
'http://code-inside.de/blog',
'http://code-inside.de/blog-in',
'http://damonpoole.wordpress.com',
'http://dellacordova.blogspot.com',
'http://dotnet.org.za/trumpi',
'http://dotnet.org.za/trumpi/default.aspx',
'http://matfoo.wordpress.com',
'http://mhinze.com',
'http://news-open-source.com',
'http://rtipton.wordpress.com',
'http://scruffylookingcatherder.com',
'http://scruffylookingcatherder.com/Default.aspx',
'http://sqlservercode.blogspot.com',
'http://startbigthinksmall.wordpress.com',
'http://stochasticgeometry.wordpress.com',
'http://sudeepdsouza.blogspot.com',
'http://the-lazy-programmer.com/blog',
'http://www.alvinashcraft.com',
'http://www.aqym.com',
'http://www.dotnetkicks.com',
'http://www.lazycoder.com',
'http://www.lazycoder.com/weblog',
'http://www.plentyofcode.com',
'http://www.theruntime.com/blogs/jacob/default.aspx']

There are 27 of them, so I guess that my program should be correct. I don't know how can Technorati count some websites twice. For example, 'http://www.lazycoder.com' and
'http://www.lazycoder.com/weblog' are treated as two different websites, even though they should be the same.

For those of you who are interested in my program, you can get the code here at Google Code.

No comments: