---
_id: '4349'
abstract:
- lang: eng
  text: Physician Review Websites allow users to evaluate their experiences with health
    services. As these evaluations are regularly contextualized with facts from users’
    private lives, they often accidentally disclose personal information on the Web.
    This poses a serious threat to users’ privacy. In this paper, we report on early
    work in progress on “Text Broom”, a tool to detect privacy breaches in user-generated
    texts. For this purpose, we conceptualize a pipeline which combines methods of
    Natural Language Processing such as Named Entity Recognition, linguistic patterns
    and domain-specific Machine Learning approaches which have the potential to recognize
    privacy violations with wide coverage. A prototypical web application is openly
    accesible.
article_number: '97'
author:
- first_name: Frederik Simon
  full_name: Bäumer, Frederik Simon
  id: '38837'
  last_name: Bäumer
- first_name: Joschka
  full_name: Kersting, Joschka
  id: '58701'
  last_name: Kersting
- first_name: Matthias
  full_name: Orlikowski, Matthias
  id: '72334'
  last_name: Orlikowski
- first_name: Michaela
  full_name: Geierhos, Michaela
  id: '42496'
  last_name: Geierhos
  orcid: 0000-0002-8180-5606
citation:
  ama: 'Bäumer FS, Kersting J, Orlikowski M, Geierhos M. Towards a Multi-Stage Approach
    to Detect Privacy Breaches in Physician Reviews. In: Khalili A, Koutraki M, eds.
    <i>Proceedings of the Posters and Demos Track of the 14th International Conference
    on Semantic Systems Co-Located with the 14th International Conference on Semantic
    Systems (SEMANTiCS 2018)</i>. Vol 2198. CEUR Workshop Proceedings. CEUR-WS.org;
    2018.'
  apa: 'Bäumer, F. S., Kersting, J., Orlikowski, M., &#38; Geierhos, M. (2018). Towards
    a Multi-Stage Approach to Detect Privacy Breaches in Physician Reviews. In A.
    Khalili &#38; M. Koutraki (Eds.), <i>Proceedings of the Posters and Demos Track
    of the 14th International Conference on Semantic Systems co-located with the 14th
    International Conference on Semantic Systems (SEMANTiCS 2018)</i> (Vol. 2198).
    Vienna, Austria: CEUR-WS.org.'
  bibtex: '@inproceedings{Bäumer_Kersting_Orlikowski_Geierhos_2018, series={CEUR Workshop
    Proceedings}, title={Towards a Multi-Stage Approach to Detect Privacy Breaches
    in Physician Reviews}, volume={2198}, number={97}, booktitle={Proceedings of the
    Posters and Demos Track of the 14th International Conference on Semantic Systems
    co-located with the 14th International Conference on Semantic Systems (SEMANTiCS
    2018)}, publisher={CEUR-WS.org}, author={Bäumer, Frederik Simon and Kersting,
    Joschka and Orlikowski, Matthias and Geierhos, Michaela}, editor={Khalili, Ali
    and Koutraki, MariaEditors}, year={2018}, collection={CEUR Workshop Proceedings}
    }'
  chicago: Bäumer, Frederik Simon, Joschka Kersting, Matthias Orlikowski, and Michaela
    Geierhos. “Towards a Multi-Stage Approach to Detect Privacy Breaches in Physician
    Reviews.” In <i>Proceedings of the Posters and Demos Track of the 14th International
    Conference on Semantic Systems Co-Located with the 14th International Conference
    on Semantic Systems (SEMANTiCS 2018)</i>, edited by Ali Khalili and Maria Koutraki,
    Vol. 2198. CEUR Workshop Proceedings. CEUR-WS.org, 2018.
  ieee: F. S. Bäumer, J. Kersting, M. Orlikowski, and M. Geierhos, “Towards a Multi-Stage
    Approach to Detect Privacy Breaches in Physician Reviews,” in <i>Proceedings of
    the Posters and Demos Track of the 14th International Conference on Semantic Systems
    co-located with the 14th International Conference on Semantic Systems (SEMANTiCS
    2018)</i>, Vienna, Austria, 2018, vol. 2198.
  mla: Bäumer, Frederik Simon, et al. “Towards a Multi-Stage Approach to Detect Privacy
    Breaches in Physician Reviews.” <i>Proceedings of the Posters and Demos Track
    of the 14th International Conference on Semantic Systems Co-Located with the 14th
    International Conference on Semantic Systems (SEMANTiCS 2018)</i>, edited by Ali
    Khalili and Maria Koutraki, vol. 2198, 97, CEUR-WS.org, 2018.
  short: 'F.S. Bäumer, J. Kersting, M. Orlikowski, M. Geierhos, in: A. Khalili, M.
    Koutraki (Eds.), Proceedings of the Posters and Demos Track of the 14th International
    Conference on Semantic Systems Co-Located with the 14th International Conference
    on Semantic Systems (SEMANTiCS 2018), CEUR-WS.org, 2018.'
conference:
  end_date: 2018-09-13
  location: Vienna, Austria
  name: 14th International Conference on Semantic Systems (SEMANTiCS 2018)
  start_date: 2018-09-10
date_created: 2018-09-03T12:04:24Z
date_updated: 2022-01-06T07:00:57Z
ddc:
- '000'
department:
- _id: '36'
- _id: '1'
- _id: '579'
editor:
- first_name: Ali
  full_name: Khalili, Ali
  last_name: Khalili
- first_name: Maria
  full_name: Koutraki, Maria
  last_name: Koutraki
file:
- access_level: closed
  content_type: application/pdf
  creator: jkers
  date_created: 2020-09-18T09:34:30Z
  date_updated: 2020-09-18T09:34:30Z
  file_id: '19579'
  file_name: Bäumer et al. (2018), Baeumer2018c.pdf
  file_size: 434664
  relation: main_file
  success: 1
file_date_updated: 2020-09-18T09:34:30Z
has_accepted_license: '1'
intvolume: '      2198'
keyword:
- Detection of Privacy Violations
- Physician Reviews
language:
- iso: eng
main_file_link:
- open_access: '1'
  url: http://ceur-ws.org/Vol-2198/paper_97.pdf
oa: '1'
publication: Proceedings of the Posters and Demos Track of the 14th International
  Conference on Semantic Systems co-located with the 14th International Conference
  on Semantic Systems (SEMANTiCS 2018)
publication_identifier:
  issn:
  - 1613-0073
publication_status: published
publisher: CEUR-WS.org
series_title: CEUR Workshop Proceedings
status: public
title: Towards a Multi-Stage Approach to Detect Privacy Breaches in Physician Reviews
type: conference
user_id: '58701'
volume: 2198
year: '2018'
...
