Following up from the last thread : This is what the present page counts of translations are : fr 19 en 1089 zh 407 de 7 zg 1 es 116 so, just taking a sample of 'es', here is what we get: [kbsingh at zafarnama bhashet]$ ./age_test | grep '^es(2' | head -4 es/About is 119 days older than About es/AdditionalResources looks about as old as AdditionalResources es/AdditionalResources/HardwareList is 420 days older than AdditionalResources/HardwareList es/AdditionalResources/Mirrors looks about as old as AdditionalResources/Mirrors I've attached the complete output from the age_test for 'es' along with this email, to make it slightly easier for people to import into something more productive than email buffers. Now the thing is, if this sort of a status report helps I'd like to perhaps plumb this into a wiki hosted page itself, and have it refresh every day. There are a few assumptions made in the code: - the name of the page is the same in all languages, including path - all languages start with a 2 character language code (1) - all pages are 'original' in english - 'age' is taken as timestamp of last edit Other things - we need to find someway of marking pages we dont care about ( eg. people's pages ) ( free win -> being able to generate list of pages that dont exist in a specific language but do in English ) - Can we 'inject' something into the original page that says 'this page is YY days older than the original page, so look there as well' Comments, thoughts, views ? - KB 1: the zh/tw/ namespace breaks this - and so its not a separate count from zh/ for now. I will do something about that for the next run. -------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: es_wiki_aging.output Url: http://lists.centos.org/pipermail/centos-docs/attachments/20100215/ed6b3cb1/attachment.pl