{"file_date_updated":"2018-03-21T10:29:18Z","abstract":[{"lang":"eng","text":"We report on the construction of the Wikidata Vandalism Corpus WDVC-2015, the first corpus for vandalism in knowledge bases. Our corpus is based on the entire revision history of Wikidata, the knowledge base underlying Wikipedia. Among Wikidata's 24 million manual revisions, we have identified more than 100,000 cases of vandalism. An in-depth corpus analysis lays the groundwork for research and development on automatic vandalism detection in public knowledge bases. Our analysis shows that 58% of the vandalism revisions can be found in the textual portions of Wikidata, and the remainder in structural content, e.g., subject-predicate-object triples. Moreover, we find that some vandals also target Wikidata content whose manipulation may impact content displayed on Wikipedia, revealing potential vulnerabilities. Given today's importance of knowledge bases for information systems, this shows that public knowledge bases must be used with caution."}],"title":"Towards Vandalism Detection in Knowledge Bases: Corpus Construction and Analysis","status":"public","doi":"10.1145/2766462.2767804","_id":"239","ddc":["040"],"page":"831--834","type":"conference","year":"2015","department":[{"_id":"66"}],"publication":"Proceedings of the 38th International ACM Conference on Research and Development in Information Retrieval (SIGIR 15)","author":[{"full_name":"Heindorf, Stefan","last_name":"Heindorf","first_name":"Stefan"},{"last_name":"Potthast","full_name":"Potthast, Martin","first_name":"Martin"},{"full_name":"Stein, Benno","last_name":"Stein","first_name":"Benno"},{"last_name":"Engels","id":"107","full_name":"Engels, Gregor","first_name":"Gregor"}],"citation":{"mla":"Heindorf, Stefan, et al. “Towards Vandalism Detection in Knowledge Bases: Corpus Construction and Analysis.” Proceedings of the 38th International ACM Conference on Research and Development in Information Retrieval (SIGIR 15), 2015, pp. 831--834, doi:10.1145/2766462.2767804.","apa":"Heindorf, S., Potthast, M., Stein, B., & Engels, G. (2015). Towards Vandalism Detection in Knowledge Bases: Corpus Construction and Analysis. In Proceedings of the 38th International ACM Conference on Research and Development in Information Retrieval (SIGIR 15) (pp. 831--834). https://doi.org/10.1145/2766462.2767804","short":"S. Heindorf, M. Potthast, B. Stein, G. Engels, in: Proceedings of the 38th International ACM Conference on Research and Development in Information Retrieval (SIGIR 15), 2015, pp. 831--834.","bibtex":"@inproceedings{Heindorf_Potthast_Stein_Engels_2015, title={Towards Vandalism Detection in Knowledge Bases: Corpus Construction and Analysis}, DOI={10.1145/2766462.2767804}, booktitle={Proceedings of the 38th International ACM Conference on Research and Development in Information Retrieval (SIGIR 15)}, author={Heindorf, Stefan and Potthast, Martin and Stein, Benno and Engels, Gregor}, year={2015}, pages={831--834} }","ama":"Heindorf S, Potthast M, Stein B, Engels G. Towards Vandalism Detection in Knowledge Bases: Corpus Construction and Analysis. In: Proceedings of the 38th International ACM Conference on Research and Development in Information Retrieval (SIGIR 15). ; 2015:831--834. doi:10.1145/2766462.2767804","chicago":"Heindorf, Stefan, Martin Potthast, Benno Stein, and Gregor Engels. “Towards Vandalism Detection in Knowledge Bases: Corpus Construction and Analysis.” In Proceedings of the 38th International ACM Conference on Research and Development in Information Retrieval (SIGIR 15), 831--834, 2015. https://doi.org/10.1145/2766462.2767804.","ieee":"S. Heindorf, M. Potthast, B. Stein, and G. Engels, “Towards Vandalism Detection in Knowledge Bases: Corpus Construction and Analysis,” in Proceedings of the 38th International ACM Conference on Research and Development in Information Retrieval (SIGIR 15), 2015, pp. 831--834."},"user_id":"477","date_created":"2017-10-17T12:41:38Z","language":[{"iso":"eng"}],"file":[{"file_size":735898,"access_level":"closed","file_id":"1499","date_created":"2018-03-21T10:29:18Z","success":1,"creator":"florida","content_type":"application/pdf","date_updated":"2018-03-21T10:29:18Z","relation":"main_file","file_name":"239-p831-heindorf.pdf"}],"has_accepted_license":"1","project":[{"_id":"1","name":"SFB 901"},{"_id":"17","name":"SFB 901 - Subprojekt C5"},{"_id":"4","name":"SFB 901 - Project Area C"}],"date_updated":"2022-01-06T06:56:04Z"}