Jump to content
Wikimedia Meta-Wiki

Machine learning models/Production/Language-agnostic revert risk: Difference between revisions

From Meta, a Wikimedia project coordination wiki
Content deleted Content added
Line 17: Line 17:
| interface =
| interface =


<!-- [optional] Where can users go to see this model's past performance(削除) ? (削除ここまで) Please use (削除) wikilinks (削除ここまで) if possible. -->
<!-- [optional] Where can users go to see this model's past performance Please use (追記) wikilins (追記ここまで) if possible. -->
| past_performance =
| past_performance =


Line 29: Line 29:
| pii =
| pii =



<!-- [required] Is this model in production (i.e. currently hosted by the WMF ML Team)? -->
| production =
| production =


Line 37: Line 37:
<!-- [required] Single sentence description of the model -->
<!-- [required] Single sentence description of the model -->
| schema =
| schema =
This model uses revision (削除) content (削除ここまで) and metadata to predict the risk of being reverted.
This model uses revision (追記) con unotent (追記ここまで) and metadata to predict the risk of being reverted.
}}
}}


Line 61: Line 61:
Previous models had tried to solve this by creating language-specific solutions, however, that approach is difficult to escalate and maintain, because it requires as many models as languages used on the Wikimedia projects. Moreover, complex-language models are just available in certain languages, leaving out smaller Wikipedia editions. Therefore, this model is based on Language Agnostic features, making it possible to use it for any existing Wikipedia, and for new language projects that can appear in the future.
Previous models had tried to solve this by creating language-specific solutions, however, that approach is difficult to escalate and maintain, because it requires as many models as languages used on the Wikimedia projects. Moreover, complex-language models are just available in certain languages, leaving out smaller Wikipedia editions. Therefore, this model is based on Language Agnostic features, making it possible to use it for any existing Wikipedia, and for new language projects that can appear in the future.


This model was trained using the two tables from the [(削除) [wikitech (削除ここまで):Analytics/Data_Lake(削除) | (削除ここまで)Wikimedia Data Lake(削除) ] (削除ここまで)]: The [[wikitech:Analytics/Data_Lake/Edits/MediaWiki_history|MediaWiki History]] , and the [[wikitech:Analytics/Data_Lake/Content/Mediawiki_wikitext_history|Wikitext History]] one. Meta-data was extracted for the former, and other features such as number of references, images and wikilinks, were extracted from the latter.
This model was trained using the two tables from the [(追記) ./Https (追記ここまで):(追記) //wikitech.wikimedia.org/wiki/ (追記ここまで)Analytics/Data_Lake(追記) (追記ここまで)Wikimedia Data Lake]: The [[wikitech:Analytics/Data_Lake/Edits/MediaWiki_history|MediaWiki History]] , and the [[wikitech:Analytics/Data_Lake/Content/Mediawiki_wikitext_history|Wikitext History]] one. Meta-data was extracted for the former, and other features such as number of references, images and wikilinks, were extracted from the latter.





Revision as of 06:48, 20 November 2024

This model card page currently has a draft status. It is a piece of model documentation that is in the process of being written. Once the model card is completed, this template should be removed.
Model card
This page is an on-wiki machine learning model card.
A diagram of a neural network
A model card is a document about a machine learning model that seeks to answer basic questions about the model.
Model Information Hub
This model uses revision con unotent and metadata to predict the risk of being reverted.


How can we help editors to identify revisions that need to be "patrolled"? The goal of this model is to detect revisions that might be reverted, independently of whether they were made in good faith or with the intention of creating damage.

Patrolling content in more than 250+ Wikipedia projects is a difficult task. The amount of revisions plus the different languages involved requires a complex human effort. The aim of this model is to help patrollers quickly identify potential problems and revert damaging edits when needed.

Previous models had tried to solve this by creating language-specific solutions, however, that approach is difficult to escalate and maintain, because it requires as many models as languages used on the Wikimedia projects. Moreover, complex-language models are just available in certain languages, leaving out smaller Wikipedia editions. Therefore, this model is based on Language Agnostic features, making it possible to use it for any existing Wikipedia, and for new language projects that can appear in the future.

This model was trained using the two tables from the [./Https://wikitech.wikimedia.org/wiki/Analytics/Data_Lake Wikimedia Data Lake]: The MediaWiki History , and the Wikitext History one. Meta-data was extracted for the former, and other features such as number of references, images and wikilinks, were extracted from the latter.


This model is deployed on LiftWing. Right now, it is available for internal usage. Technical details can be found here. This model can be used to detect revisions that might need to be reverted. A high "revert probability" output (over .9) would provide good precision, while lower threshold (0.5) would provide recall. This model should be used just for Wikipedia Articles (namespace 0), its features won't work outside Wikipedia.

Motivation

Knowledge Integrity is one of the strategic programs of Wikimedia Research with the goal of identifying and addressing threats to content on Wikipedia, increasing the capabilities of patrollers, and providing mechanisms for assessing the reliability of sources[1] . The main goal of the project is to create a new generation of patrolling models, improving accuracy, fairness, and maintainability compared to previous state-of-the-art ORES[2] .

The current model is completely language agnostic and can run in any Wikipedia language edition.

Supported languages

['aa', 'ab', 'ace', 'ady', 'af', 'ak', 'als', 'alt', 'am', 'ami', 'an', 'ang', 'anp', 'ar', 'arc', 'ary', 'arz', 'as', 'ast', 'atj', 'av', 'avk', 'awa', 'ay', 'az', 'azb', 'ba', 'ban', 'bar', 'bat-smg', 'bcl', 'be', 'be-tarask', 'be-x-old', 'bg', 'bh', 'bi', 'bjn', 'blk', 'bm', 'bn', 'bo', 'bpy', 'br', 'bs', 'bug', 'bxr', 'ca', 'cbk-zam', 'cdo', 'ce', 'ceb', 'ch', 'cho', 'chr', 'chy', 'ckb', 'co', 'cr', 'crh', 'cs', 'csb', 'cu', 'cv', 'cy', 'da', 'dag', 'de', 'din', 'diq', 'dsb', 'dty', 'dv', 'dz', 'ee', 'el', 'eml', 'en', 'eo', 'es', 'et', 'eu', 'ext', 'fa', 'fat', 'ff', 'fi', 'fiu-vro', 'fj', 'fo', 'fr', 'frp', 'frr', 'fur', 'fy', 'ga', 'gag', 'gan', 'gcr', 'gd', 'gl', 'glk', 'gn', 'gom', 'gor', 'got', 'gpe', 'gsw', 'gu', 'guc', 'gur', 'guw', 'gv', 'ha', 'hak', 'haw', 'he', 'hi', 'hif', 'ho', 'hr', 'hsb', 'ht', 'hu', 'hy', 'hyw', 'hz', 'ia', 'id', 'ie', 'ig', 'ii', 'ik', 'ilo', 'inh', 'io', 'is', 'it', 'iu', 'ja', 'jam', 'jbo', 'jv', 'ka', 'kaa', 'kab', 'kbd', 'kbp', 'kcg', 'kg', 'ki', 'kj', 'kk', 'kl', 'km', 'kn', 'ko', 'koi', 'kr', 'krc', 'ks', 'ksh', 'ku', 'kv', 'kw', 'ky', 'la', 'lad', 'lb', 'lbe', 'lez', 'lfn', 'lg', 'li', 'lij', 'lld', 'lmo', 'ln', 'lo', 'lrc', 'lt', 'ltg', 'lv', 'lzh', 'mad', 'mai', 'map-bms', 'mdf', 'mg', 'mh', 'mhr', 'mi', 'min', 'mk', 'ml', 'mn', 'mni', 'mnw', 'mr', 'mrj', 'ms', 'mt', 'mus', 'mwl', 'my', 'myv', 'mzn', 'na', 'nah', 'nan', 'nap', 'nds', 'nds-nl', 'ne', 'new', 'ng', 'nia', 'nl', 'nn', 'no', 'nostalgia', 'nov', 'nqo', 'nrm', 'nso', 'nv', 'ny', 'oc', 'olo', 'om', 'or', 'os', 'pa', 'pag', 'pam', 'pap', 'pcd', 'pcm', 'pdc', 'pfl', 'pi', 'pih', 'pl', 'pms', 'pnb', 'pnt', 'ps', 'pt', 'pwn', 'qu', 'rm', 'rmy', 'rn', 'ro', 'roa-rup', 'roa-tara', 'ru', 'rue', 'rup', 'rw', 'sa', 'sah', 'sat', 'sc', 'scn', 'sco', 'sd', 'se', 'sg', 'sgs', 'sh', 'shi', 'shn', 'si', 'simple', 'sk', 'skr', 'sl', 'sm', 'smn', 'sn', 'so', 'sq', 'sr', 'srn', 'ss', 'st', 'stq', 'su', 'sv', 'sw', 'szl', 'szy', 'ta', 'tay', 'tcy', 'te', 'test', 'test2', 'tet', 'tg', 'th', 'ti', 'tk', 'tl', 'tly', 'tn', 'to', 'tpi', 'tr', 'trv', 'ts', 'tt', 'tum', 'tw', 'ty', 'tyv', 'udm', 'ug', 'uk', 'ur', 'uz', 've', 'vec', 'vep', 'vi', 'vls', 'vo', 'vro', 'wa', 'war', 'wo', 'wuu', 'xal', 'xh', 'xmf', 'yi', 'yo', 'yue', 'za', 'zea', 'zh', 'zh-classical', 'zh-min-nan', 'zh-yue', 'zu']

Users and uses

Use this model for
  • Automatically find revisions that requires patrolling.
  • Vandalism detection.
  • Create bots for assisting admins and patrollers to remove vandalism or non good-faith edits.
Don't use this model for
  • As ground-truth for training other models
  • Making predictions on projects other than Wikipedia language editions
  • Making predictions on the first revision of a page or a revision that is the only one for a page.
Current uses

Ethical considerations, caveats, and recommendations

The model is built using meta features that take into account user characteristics. However, it may exhibit bias against edits from new users, temporary accounts, or IP edits due to past experiences. To address this issue, we have developed an alternative Revert Risk Multilingual Model (RRML) that specifically mitigates such biases, however that model requires more processing power and might be slow. Therefore, for anonymous edits - on the 47 languages covered - we recommend using the RRML. For the reminding edits (non-anonymous or not covered by RRML), we recommend to use this model.

Model

This model uses this set of features:

  • Article features:
    • We used the features developed for the Language Agnostic Article Quality Model.
    • We computed the article quality features for the current and parent revision.
    • We measured the quality differences between these revisions.
  • User features:
    • Account "age" (difference between revision timestamp and the user creation date)
    • Number of previous revisions made.
    • Number of users groups.


Performance

Implementation

Model architecture

The model is build using the XgBoost library.

The detailed model training procedure and configuration can be found in this repository.
Output schema
{
 model_name: <model name string>
 model_version: <model version string>
 wiki_db: <wiki code string>,
 revision_id: <revision_id string>,
 output: {
 prediction: <boolean decision result>
 probabilities: {
 true: <probability of being reverted>,
 false: <probability of being NOT reverted>
 }
}
Example input and output

Example input:

curl https://api.wikimedia.org/service/lw/inference/v1/models/revertrisk-language-agnostic:predict -X POST -d '{"rev_id": 1215916855, "lang":"en"}'

Example output:

{
 "model_name":"revertrisk-language-agnostic",
 "model_version":"2",
 "wiki_db":"enwiki",
 "revision_id":123855516,
 "output":{
 "prediction":true,
 "probabilities":{
 "true":0.6868777275085449,
 "false":0.3131222724914551
 }
 }
}

Data

Data pipeline

The model was trained on a dataset collected using the two tables from the Wikimedia Data Lake. We used the MediaWiki History table, and the Wikitext History one. Snapshot dated 2023-05 was used with the observation period from 2022年01月01日 to 2023年01月01日 (12 months). We filtered out revisions created by bots. We used the 70% of the data for training, and 30% testing, using a random split.

The data collection process can by found on this repository.
Training data
We randomly selected the 70% of the data mentioned above.
Test data
We used the remaining 30%.

Licenses


Citation

Cite this model as:

@inproceedings{trokhymovych2023fair,
title={Fair multilingual vandalism detection system for Wikipedia},
author={Trokhymovych, Mykola and Aslam, Muniza and Chou, Ai-Jou and Baeza-Yates, Ricardo and Saez-Trumper, Diego},
booktitle={Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining},
pages={4981--4990},
year={2023}
}

Notice this paper also covers the Multilingual Revert Risk Model.

References

  1. Zia, Leila and Johnson, Isaac and Mansurov, Bahodir and Morgan, Jonathan and Redi, Miriam and Saez-Trumper, Diego and Taraborelli, Dario. 2019. Knowledge Integrity. https://doi.org/10.6084/m9.figshare.7704626
  2. https://www.mediawiki.org/wiki/ORES

AltStyle によって変換されたページ (->オリジナル) /