User:Mutante/Wikistats

After reading a request for bot from people manually updating large statistics tables, I created some scripts to automatically retrieve statistics from various Mediawikis, store them in a local db, sort them by size and output tables in html and wikisyntax for pasting to wiki pages. They are not bots posting automatically but just provide the wiki syntax for human pasting.

robiH: Added some wikis, updated add form, turns out many of them dont have "raw" anyways, many old mediawiki versions around (admins should probably be informed, because dangerous bugs in older Mediawiki versions anyways. Would you help add a few wikis? See details here: http://wikiindex.com/User:Mutante you can delete this after reading

Hiya. I love what you have done with this page. Just wondering whether it would be possible to do something similar for http://www.wikipedia.org which is a very high profile page, but which is much less frequently updated (because it is locked). It is often quite out of date. In this case I think the best idea would be to actually have the script update the whole page by itself. What do you think? GeorgeStepanek 07:03, 14 February 2006 (UTC)

Well, yes it probably would, but I am not an admin who can edit that page and i think you need great consensus before editing the main portal page. Is it a wiki page at all or just html? Mutante 00:21, 15 February 2006 (UTC)

The main page is wiki-editable HTML code which is kept at Www.wikipedia.org template. I agree that one would need consensus for any changes to the appearance of the page, but my suggestion is just to change the way the page is updated, not how it looks. There's been a feature request for something like this ever since the new version of the portal went live, but I don't think that what CatherineMunro has asked for is actually feasible. GeorgeStepanek 05:20, 15 February 2006 (UTC)

I love it also. Is that running every hour? Can you update the script so that it tracks tiers (100+, 1000+, 10,000+) as well as the 10 main languages? The best thing to do atm would be to have that regenerate the wikitext on the temporary Www.wikipedia.org template page on meta... We should do this asap :-) Sj 02:25, 27 February 2006 (UTC)

I included your suggestions as tasks into the Todo list further up this page. Now you see what is to be your part of it. --robiH

Its updated every 4 hours. The reason i only made the upper part with the 10 biggest ones is that for the lower part i would also need all the translations of "Main Page" in the script/database, because they all link to the local language version of that, if i could just link them all xx.wikipedia.org/wiki/ it would make it easier and would also redirect ok, but not look so nice. Mutante 11:41, 27 February 2006 (UTC)

I have included the "tiers" aka "milestones" now. They should be working correct. Please check my current result hereMutante 19:49, 27 February 2006 (UTC)

This looks great. I am very impressed. The "globe" is perfect, and could be put in place right now. There's still a couple of glitches with the tiers—it includes some top-10 languages, and I don't think the line-breaks work as well as in the previous version—but it's pretty much there. GeorgeStepanek 09:45, 28 February 2006 (UTC)

The appearance is just right now, I think: the glitches I mentioned have definitely been fixed. I'm checking it against the original version at http://www.wikipedia.org/ and the only thing I can see now that's different is the ordering. In the top 10 Dutch has recently passed Italian and Swedish. For the tiers I don't actually know how exactly the sort order is determined—but it's not quite the same as yours. We may need to enlist a language expert like Node ue to define a fixed sort order for all our languages, that your script can then use when it's populating the tiers. But as far as I'm concerned I'm very happy with it. Thank you for all the hard work you've put into this! GeorgeStepanek 05:16, 6 March 2006 (UTC)

Yes, the sort order, somehow i couldnt find any specific order used in the original site. I tried to "order by good desc" (by good articles descending) and "order by loclang" (by local language alphabetically). Now i changed all three tiers to be sorted "by good desc". The sort order in the Top 10 circle itself is still made manually though still,turned out to be harder to do automatically because of those changing layout <div's in there. Mutante 11:14, 6 March 2006 (UTC)

Oops, the top 10 now looks a bit wonky: I think in updating the order you gave Italian the wrong horizontal offset. Oh yes, and Italian has again overtaken Swedish—sorry! I might just ask Node ue (if you don't mind) what the sort order actually is, because he was one of the ones who did that bit. GeorgeStepanek 04:58, 7 March 2006 (UTC)

Sure, ask him. I fixed that Italian offset, but i still think about how to implement automatic sort order inside the actual circle without getting those problems. Mutante 17:54, 7 March 2006 (UTC)

I would sort by language code. It can be potentially confusing -- "Suomi" is at fi:; "Magyar" is at hu:... but it also solves the sort order with the scripts. If you want to try to compromise the two, you can resort some of the most confusingly offset ones (instead of ones that are just one or two places off) manually. --Node ue

Thanks for alerting me to this, George! This looks fantastic, Mutante -- it would be wonderful to have this always up to date. My inclination would be to sort by language code too, but there has been extensive discussion on this before -- you might want to review and discuss it with the prior participants as well. However, I would say if it's simple to get the bot to sort by language code, you needn't jump through great hoops to create a custom sorting order either. Good luck with this, I'll be keeping an eye on it! Catherine 21:00, 28 March 2006 (UTC)

Links to the various Special:Statistics pages at the List of Wikipedias page have all broken, because your tool now links to "[[xx:Special:Statistics?action=raw|000]]", which doesn't work. You'd either have to provide the full URL as before, or link to something like "[{{fullurle:xx:Special:Statistics|action=raw}} 000]". – Minh Nguyễn(talk, contribs) 00:43, 19 March 2006 (UTC)

Thanks for the hint. I have tried [{{fullurle.. and [{{fullurl.. but neither hides the external link arrow. Also i found this "There is a class that can be used to remove the arrow image from the external link. It is used in Template:Ref to stop the URL from expanding during printing. It should never be used in the main body of an article." [2]. Looks like there is no way to have a working link to Special:Statistics?action=raw but without the arrow then? Mutante 15:04, 19 March 2006 (UTC)

To hide the arrow, you can either use class="plainlinks" or class="plainlinksneverexpand" – when printing the latter, the URL doesn't show up in parentheses after the link. I'd suggest the latter, since the URLs would really mess up the page layout when printing. I don't think there'd be a problem with applying that class to the entire table, since you are pointing to Wikimedia projects, anyways. Also, Special:Listadmins now redirects to Special:Listusers/sysop. – Minh Nguyễn(talk, contribs) 22:07, 19 March 2006 (UTC)

By the way, all the statistics links are still broken, because they say "{{fullurle:xx:Special:Statistics?action=raw}}" instead of "{{fullurle:xx:Special:Statistics|action=raw}}". – Minh Nguyễn(talk, contribs) 06:28, 26 March 2006 (UTC)

Thanks for keeping me updated. Added them all to the table. Mutante 06:24, 26 March 2006 (UTC)

I spotted a couple errors: "Franco-Provençal/Arpitan" should just be "Arpitan" in the local name column, and Pennsylvania Dutch / Deitsch (pdc:) was incorrectly given the name of "Ligurian"/"Liguru" (should be Líguru, anyhow), putting that 100+ edition in the zero-article section. – Minh Nguyễn(talk, contribs) 06:32, 26 March 2006 (UTC)

Okay, it looks like it was because others had just added those languages manually, rather than updating from your tool. I'll go ahead and do that now. – Minh Nguyễn(talk, contribs) 06:37, 26 March 2006 (UTC)

I have seen the stats [3]. Thanks a lot for that. I was wondering where did you find the number of pages viewed (2822795 for the French wikipedia for example). Is it a monthly figure ? Thanks in advance. Poppypetty 19:42, 29 March 2006 (UTC)

Click onto the figure and you see the page where it is being taken from every six hours. RobiH 09:58, 30 March 2006 (UTC)

It is being read from Special:Statistics?action=raw URL, so the numbers are provided by Mediawiki. About the "views" column i am not sure how it is being filled exactly. It seems some Wikis have stopped counting for some reason. Mutante 11:20, 30 March 2006 (UTC) btw: I just fixed the table to be valid xhtml again and added the "Started" (starting date) column,which can now also be sorted by, so you can look at "oldest wikis" and stuff. Mutante 11:20, 30 March 2006 (UTC)

I dont think it is possible to create all this with only the information "number of admins" provided in Special:Statistics. It probably would be possible for someone with direct database access, but im not one of them. Mutante 08:53, 18 March 2007 (UTC)