User:Renklauf/Archive 6
Appearance
Monday July 16th
[edit]- Analysis Source Documentation
- Continue porting docs into sphinx
- link to the auto-generated docs on http://community-analytics.wikimedia.org/
- Sync up with Dario and Giovanni on metrics standardization
Tuesday July 17th
[edit]- Analysis Source Documentation
- Continue porting docs into sphinx
- finding a host for these docs (aluminium is not an option)
- Meet with Amit and Evan regarding wikipedia zero mobile log analysis - coordinating work on wikipedia zero traffic analysis
Wednesday July 18th
[edit]- Analysis Source Documentation
- Continue porting docs into sphinx
- Improve documentation in code generated by autodocs
- Chat with Ori/Dario/Evan regarding hardware for E3/Global Dev Analysts
Thursday July 19th
[edit]- Analysis Source Documentation
- Continue porting docs into sphinx
- Improve documentation in code generated by autodocs
- 1000K editor post analysis
Friday July 20th
[edit]- Analysis Source Documentation
- Continue porting docs into sphinx
- Improve documentation in code generated by autodocs
- Code review: https://www.mediawiki.org/wiki/Special:Code/MediaWiki/95382
- Sort out WikiSym travel and registration
- Co-ordinate with ops/analytics the relocation of fundraiser analytics front-end and community analytics front-end migration
Monday July 23rd
[edit]- Finalize flights for WikiSym
- Discuss metrics and central auth with Dario
- add comments to metrics etherpad (30minutes - 1hr)
- centralauth localuser and globaluser counts from fenari
- create a snapshot of globaluser on db1047
Tuesday July 24th
[edit]- Post analysis results of 1000K editors for 30 day period after barnstar
- Discuss metrics w/ Dario, Giovanni, and Aaron
- review Dario's notes
- pair down metrics
- Follow-up on code review: https://www.mediawiki.org/w/index.php?title=Special:Code/MediaWiki/95382&path=#c32923
- Determine where bucketing data is being collected for PEF-0 - chat w/ Dario
Wednesday July 25th
[edit]- Review the documentation for the "Wikimania unconference experiment"
- Add requirements as needed
- follow up with E3 on potential timeline and implementation
- Code Documentation and Refactor
- Continue building docs on existing classes
- Generalize functionality for analysis class methods where needed
- Work on metrics docs (dario, aaron, giovanni) - https://meta.wikimedia.org/wiki/Research:Metrics
- build meta pages on metric definitions
- work on any ancillary documentation including: motivation, use, implementation (data sources code)
- Add "Editor Milestones" data to datahub
Thursday July 26th
[edit]- Verify data collection in emery logs for PEF-1 deployment
- pending deployment to test wiki
- Review "Editor Milestones" conclusion on research page
- Meet with Howie et al. to discuss measurement metrics
- Determine where code base will live for metrics hooks
- PyDocs
- DataMapper module
Friday July 27th
[edit]- Metrics (new users) documentation
- come up with a clearer Parameterization
- Determine where code base will live for metrics hooks
- PyDocs & Refactor
- Additional DataLoader Classes
- metrics generation code
Monday July 30th
[edit]- Metrics (new users)
- re-review docs, try to better describe existing metrics plus use cases (links to experments)
- continue work on code base for metric generation
- Pydocs & refactor
- DataReporting Module
- "message_template" reporting code from WSOR repo
- start planning migration to Git
- Annual Review with Karyn
Tuesday July 31st
[edit]- PEF-1, verify data in logs, review analysis work, determine timeline, sync with Dario/Aaron here
- Drop PEF records into db42.rfaulk
- Count events
- Pydocs & refactor
- DataReporting Module
- Sync up on code review for this architecture (sumana)
- Work with Andrew Otto to figure out hosting on stat1
Wednesday August 1st
[edit]- Group chat with Brad Presner as per Sue's request
- PEF-1, verify data in logs, review analysis work, determine timeline, sync with Dario/Aaron here
- Timestamp mystery between click logs and rev table
- Ensure rev ids in logs are triggered by the edit generating feedback
- Find (filter) Steven's Sock Puppet
- look into dropped revs
Thursday August 2nd
[edit]- PEF-1
- Get list of registered users (user name, user id) since experiment deploy
- load additional log data
- (review with Dario) start to record data metrics about users (https://meta.wikimedia.org/wiki/Research:Edit_feedback#Overall_metrics)
- Pydocs & refactor
- DataReporting Module
- CategoryLoader Class
Friday August 3rd
[edit]- PEF-1
- Associated rev data with user data for "post-edit" events
- Began work on verifying user IDs
- Pydocs & refactor
- Fixed dependency issue on local machine with matplotlib
- Updated TableLoader class family
- Updated stat1 instance
- Updated docs on DataReporting class
Monday August 6th
[edit]- E3 Metrics
- Complete definition of EditRate metric in source
- Still need to test
- Meet with Dario on further work in defining metrics
- PEF-1
- Load additional log data
- verify user ids retrieved via rev id for "post-edit" events
- WikiSym Presentation
- Build outline
- Pydocs & refactor
- Add samples docs for update_rows and get_elem_from_nested_list methods (DataLoader module)
- Start docs on template data mining source (http://svn.wikimedia.org/viewvc/wikimedia/trunk/community-analysis/Metrics_enWiki/)
Tuesday August 7th
[edit]- Add documentation to meta for the PEF-1 clicktracking data definition (re: "[E3-team] Outstanding data analysis requirements for PEF")
- E3 Metrics
- Follow-up with Dario on changes
- PEF-1
- User verfiication
- continue analysis
- Follow up on Git repo request with ops
- Complete docs on DataReporting class
Tuesday August 7th
[edit]- PEF-1
- User verfiication
- Page ID verification
- continue analysis
- Follow up on Git repo request with ops
- Add notes about Brad Presner Panel for Sue (by 5pm)
Wednesday August 8th
[edit]- Add documentation to meta for the PEF-1 clicktracking data definition (re: "[E3-team] Outstanding data analysis requirements for PEF")
- Fix docs on DataReporting class - deploy to stat1
- PEF-1
- Get valid user ids
- Generate "Time to Milestone" and "k-n retention" metrics once user IDs are obtained
- Post results on group comparison
Thursday August 9th
[edit]- E3 Metrics
- Follow-up with Dario on changes
- Meeting with Howie - 3-4pm - demo Python metrics
- Wrote TimeTothreshold Metric Class
- PEF-1
- Generete Time to Milestone metric (MySQL)
- Generate Retention Milestone (MySQL) -- leave this for now. Generate retroactively at a later date
- Export data to tsv (Python)
Friday August 10th
[edit]- PEF-1
- Analysis, Import to R, model edit counts, time to milestone and retention metrics (R)
- WikiSym Presentation
- Review Steven & Maryana's Wikimania presentation
- Flesh out outline
- Add two new experiment ideas to E3 homepage - Article Stats, Bi-Directional sourcing
- Time permitting, start having a look at ACUX data, sync up on this with Steven
Monday August 13th
[edit]- Thoughtworks workshop
Tuesday August 14th
[edit]- Thoughtworks workshop
- WikiSym Presentation Prep (evening)
Wednesday August 15th
[edit]- Thoughtworks workshop
- Import the remainder of the PEF-1 data
- WikiSym Presentation Prep (evening)
Thursday August 16th
[edit]- Discuss WikiSym Panel With Dario
- WikiSym Presentation Prep (Dry run)
Friday August 17th
[edit]- WikiSym Presentation Prep
- Rounding off PEF-1 analysis stuff
- Assist Ori Maryana with Community Portal data generation
- Flight to Barcelona (2:45PM)