Jump to content

Talk:List of Wikipedias by sample of articles/Articles

Add topic
From Meta, a Wikimedia project coordination wiki

How is this working? I'm noticing that sw:Kiswahili wikipedia has a long list of articles supposed to be missing, but that exists. Should I update this manually, or will a bot fix it? Tanzania 12:43, 28 February 2010 (UTC)Reply

This table is just the articles over 10k in length. The "Stubs" table lists the articles with less than 10k. Swahili WP has 700 stubs but that list won't contain them since it won't list wikis with more than 100 items (to avoid making an incredibly long wiki document). This list gets updated every month with a script. --MarsRover 18:26, 28 February 2010 (UTC)Reply


Could someone explain how the scores are calculated? IThe list for yiwiki articles between 10,000 and 30,000 characters includes London, whose article in yiwiki is more than 40,000 characters long.

Is this correct? --Redaktor (talk) 21:45, 14 April 2013 (UTC)Reply

No, it is 40,000 bytes long. This list uses charactors which is London is around 28,000 characters long. This is because of the encoding of the hebrew characters sometimes mean two bytes are used for one character. --MarsRover 22:43, 14 April 2013 (UTC)Reply
Thanks. How can one figure out the length of an article in characters? --Redaktor (talk) 10:21, 19 April 2013 (UTC)Reply
Cut-and-paste the article text into MSWord (or similar word processer) and then look at the statistics window. --MarsRover 16:55, 19 April 2013 (UTC)Reply

Bashkir Wikipedia

[edit]

Yerpo MarsRover Could you include the Bashkir Wikipedia in this list. We would like to improve our rating. We are in 17th place. ZUFAr (talk) 15:47, 23 April 2023 (UTC)Reply

Error for arywiki

[edit]

@Dcirovic @Yerpo There's an error for the article about the English language on arywiki which is listed as 10k-30k, but is actually above 35k. I didn't check if there are were other errors. Ideophagous (talk) 21:53, 6 September 2024 (UTC)Reply

@Ideophagous: Wikipedia uses a variable-length character encoding standard, UTF-8. Depending on the alphabet, various letters could require from 1 to 4 bytes to be stored, and therefore comparing byte sizes of articles written in different alphabets would be contraproductive.
The article-size in our context refers to the number of characters, and not to the number of bytes. The character count includes white spaces and other interruption signs. The HTML commented text is excluded.
The article about the English language on arywiki currently contains 22,758 characters. --Dcirovic (talk) 23:03, 6 September 2024 (UTC)Reply
@Dcirovic Alright, thank you for the clarification. Have an excellent day! :) Ideophagous (talk) 05:48, 7 September 2024 (UTC)Reply