Research talk:Page view
Add topic
G
[edit]Not sure why it say G after M here[1]
Should this not be B for billion? Doc James (talk · contribs · email) 08:04, 21 February 2018 (UTC)
- Remember, international project, not US-only. Use w:Metric prefix and stay away from the mess of w:Long and short scales. — Jeblad 11:33, 31 October 2018 (UTC)
Page views by country
[edit]Looking at this as a map view is pretty much meaningless. When a traffic number is shown for countries then it must be normalized for the population, and not only the raw population count but the internet population. That is the people that has a reasonable internet connection.
The map view also follows the present borders of the countries, but for some ethnic groups this does not make sense. There are for example a Wikipedia project in Northern Sami language, and that ethnic group covers several countries. I wonder if it would be better to map the traffic to a grid and then overlay that grid with borders. Something like that would better track smaller ethnic groups inside of large countries. — Jeblad 11:48, 31 October 2018 (UTC)
Possible error in path for new pageviews files
[edit]I noticed that the recent files in pageview_complete folder have the following path:
- https://dumps.wikimedia.org/other/pageview_complete/2023/2023-03/pageviews-20230313-/user.bz2
- https://dumps.wikimedia.org/other/pageview_complete/2023/2023-03/pageviews-20230314-/user.bz2
Usually the path looked like this:
Also there is no spider pageviews for 20230313 and 20230314. Why? (just in case, ping @KZimmerman (WMF) @Mayakp.wiki) Prof.DataScience (talk) 18:42, 15 March 2023 (UTC)
- Thank you for pointing this out, @Prof.DataScience, there are two separate issues that are both my fault Milimetric (WMF) (talk) 19:21, 15 March 2023 (UTC):
- 1. the path is wrong due to a bug in my code, that's now fixed and the incorrect file paths are updated.
- 2. the lack of spider dumps is on purpose. These dumps were using up a lot of compute and space, and were not found to be useful to anyone we asked. Please let me know if this is not the case. (Spider here are well-known internet crawlers that crawl pretty much everything every day, so it usually is just a lot of noise).
What went wrong yesterday?
[edit]A number of tools are showing zero hits for English Wikipedia yesterday (Sunday 19 November). See for example the bottom of this table. Are those stats gone forever? Is this going to be a recurring problem? Thanks in advance for any help. MartinPoulter (talk) 13:08, 20 November 2023 (UTC)
- The missing data have been restored and the page view analysis is now giving the answers I want. This is good news and answers my query. MartinPoulter (talk) 12:41, 21 November 2023 (UTC)