woensdag 19 maart 2008

Some numbercrunching.

Today we will do some numbercrunching. Basic statistical nonsense, only for the people who are interested. First some current numbers:

- Itemcount: 93,445.
- Daily items: around 4,000 during weekdays, around 2,500 during the weekends.
- Feedcount: 183.

Our short-term (within two weeks) target is a feedcount of 250 or higher.

With this target we can calculate some future numbers:

- Daily items: 5,480 during weekdays, around 3,425 during the weekends.

Our search index currently measures 65.4MB which is a nice number. We also have 6.2% of items with the exact same title meaning they were taken from press lines or copied from each other. Over three-quarters of our index are items which describe the same event or story but are from different sources.

We currently index 2.8 items per minute from all around the globe. We have feeds in 13 languages from 21 countries.

What do all these numbers mean? They show our growth, in two-weeks I will post this post again with the new numbers, we can then show some cool growth statistics and we can make this a two-weekly event showing our progress in numbers.

We are planning to go into public beta when our index hits 250.000 items which we consider to be a crititical mass to offer enough data. At our current indexing rate the 250.000 items mark will be reached in and around 6 weeks and 2 days, sooner if we keep adding feeds and sources.

We'll see.

Geen opmerkingen: