Talk:List of Wikipedias by sample of articles/Articles
Add topicHow is this working? I'm noticing that sw:Kiswahili wikipedia has a long list of articles supposed to be missing, but that exists. Should I update this manually, or will a bot fix it? Tanzania 12:43, 28 February 2010 (UTC)
- This table is just the articles over 10k in length. The "Stubs" table lists the articles with less than 10k. Swahili WP has 700 stubs but that list won't contain them since it won't list wikis with more than 100 items (to avoid making an incredibly long wiki document). This list gets updated every month with a script. --MarsRover 18:26, 28 February 2010 (UTC)
Could someone explain how the scores are calculated? IThe list for yiwiki articles between 10,000 and 30,000 characters includes London, whose article in yiwiki is more than 40,000 characters long.
Is this correct? --Redaktor (talk) 21:45, 14 April 2013 (UTC)
- No, it is 40,000 bytes long. This list uses charactors which is London is around 28,000 characters long. This is because of the encoding of the hebrew characters sometimes mean two bytes are used for one character. --MarsRover 22:43, 14 April 2013 (UTC)
- Thanks. How can one figure out the length of an article in characters? --Redaktor (talk) 10:21, 19 April 2013 (UTC)
- Cut-and-paste the article text into MSWord (or similar word processer) and then look at the statistics window. --MarsRover 16:55, 19 April 2013 (UTC)
- Thanks. How can one figure out the length of an article in characters? --Redaktor (talk) 10:21, 19 April 2013 (UTC)
Bashkir Wikipedia
[edit]Yerpo MarsRover Could you include the Bashkir Wikipedia in this list. We would like to improve our rating. We are in 17th place. ZUFAr (talk) 15:47, 23 April 2023 (UTC)
Error for arywiki
[edit]@Dcirovic @Yerpo There's an error for the article about the English language on arywiki which is listed as 10k-30k, but is actually above 35k. I didn't check if there are were other errors. Ideophagous (talk) 21:53, 6 September 2024 (UTC)
- @Ideophagous: Wikipedia uses a variable-length character encoding standard, UTF-8. Depending on the alphabet, various letters could require from 1 to 4 bytes to be stored, and therefore comparing byte sizes of articles written in different alphabets would be contraproductive.
The article-size in our context refers to the number of characters, and not to the number of bytes. The character count includes white spaces and other interruption signs. The HTML commented text is excluded.
The article about the English language on arywiki currently contains 22,758 characters. --Dcirovic (talk) 23:03, 6 September 2024 (UTC)- @Dcirovic Alright, thank you for the clarification. Have an excellent day! :) Ideophagous (talk) 05:48, 7 September 2024 (UTC)