Machine learning models/Production/Polish Wikipedia damaging edit
Model card | |
---|---|
This page is an on-wiki machine learning model card. | |
Model Information Hub | |
Model creator(s) | Aaron Halfaker (User:EpochFail) and Amir Sarabadani |
Model owner(s) | WMF Machine Learning Team (ml@wikimediafoundation.org) |
Model interface | Ores homepage |
Code | ORES Github, ORES training data, and ORES model binaries |
Uses PII | No |
In production? | Yes |
Which projects? | Polish Wikipedia |
This model uses data about a revision to predict the likelihood that the revision is damaging. | |
Motivation
[edit]Some goodfaith edits are damaging to an article, and not all damaging edits are in bad faith. This model (together with a goodfaith model) is intended to differentiate between edits that are intentionally harmful (badfaith/vandalism) and edits that are intended to be harmful (good edits/goodfaith damage).
This model helps to prioritize review of potentially damaging edits or vandalism. It provides a prediction on whether or not a given revision is damaging, and provides some probabilities to serve as a measure of its confidence level.
Users and uses
[edit]- This model should be used for prioritizing the review and potential reversion of vandalism on Polish Wikipedia.
- This model should be used for detecting damaging contributions by editors on Polish Wikipedia.
- This model should not be used as an ultimate arbiter of whether or not an edit ought to be considered damaging.
- The model should not be used outside of Polish Wikipedia.
- Polish Wikipedia uses the model as a service for facilitating efficient vandalism triage, edit reviews, or newcomer support.
- On an individual basis, anyone can submit a properly-formatted API call to ORES for a given revision and get back the result of this model.
https://ores.wikimedia.org/v3/scores/plwiki/1234/damaging
Ethical considerations, caveats, and recommendations
[edit]Polish Wikipedia decided to use this model. Over time, the model has been validated through use in the community.
This model is known to give newer editors higher probability of damaging edits.
Internal or external changes that could make this model deprecated or no longer usable are:
- Data drift means training data for the model is no longer usable.
- Doesn't meet desired performance metrics in production.
- Polish Wikipedia community decides to not use this model anymore.
Model
[edit]Performance
[edit]Test data confusion matrix:
Label | n | ~True | ~False |
---|---|---|---|
True | 290 | 121 | 169 |
False | 4482 | 163 | 4319 |
Test data sample rates:
Rate | Sample | Population |
---|---|---|
sample | 0.061 | 0.939 |
population | 0.027 | 0.973 |
Test data performance:
Statistic | True | False |
---|---|---|
match_rate | 0.047 | 0.953 |
filter_rate | 0.953 | 0.047 |
recall | 0.417 | 0.964 |
precision | 0.243 | 0.983 |
f1 | 0.307 | 0.973 |
accuracy | 0.949 | 0.949 |
fpr | 0.036 | 0.583 |
roc_auc | 0.843 | 0.843 |
pr_auc | 0.259 | 0.994 |
Implementation
[edit]{
"type": "GradientBoosting",
"params": {
"scale": true,
"center": true,
"labels": [
true,
false
],
"multilabel": false,
"population_rates": null,
"ccp_alpha": 0.0,
"criterion": "friedman_mse",
"init": null,
"learning_rate": 0.01,
"loss": "deviance",
"max_depth": 7,
"max_features": "log2",
"max_leaf_nodes": null,
"min_impurity_decrease": 0.0,
"min_samples_leaf": 1,
"min_samples_split": 2,
"min_weight_fraction_leaf": 0.0,
"n_estimators": 700,
"n_iter_no_change": null,
"random_state": null,
"subsample": 1.0,
"tol": 0.0001,
"validation_fraction": 0.1,
"verbose": 0,
"warm_start": false
}
}
{
"title": "Scikit learn-based classifier score with probability",
"type": "object",
"properties": {
"prediction": {
"description": "The most likely label predicted by the estimator",
"type": "boolean"
},
"probability": {
"description": "A mapping of probabilities onto each of the potential output labels",
"type": "object",
"properties": {
"true": {
"type": "number"
},
"false": {
"type": "number"
}
}
}
}
}
https://ores.wikimedia.org/v3/scores/plwiki/1234/damaging
Output:
{
"plwiki": {
"models": {
"damaging": {
"version": "0.5.0"
}
},
"scores": {
"1234": {
"damaging": {
"score": {
"prediction": false,
"probability": {
"false": 0.896515001391509,
"true": 0.10348499860849099
}
}
}
}
}
}
}
Data
[edit]Licenses
[edit]- Code: MIT license
- Model: MIT license
Citation
[edit]Cite this model card as:
@misc{
Triedman_Bazira_2023_Polish_Wikipedia_damaging,
title={ Polish Wikipedia damaging model card },
author={ Triedman, Harold and Bazira, Kevin },
year={ 2023 },
url={ https://meta.wikimedia.org/wiki/Machine_learning_models/Production/Polish_Wikipedia_damaging_edit }
}