--- res: bibo_abstract: - This paper presents a system that uses the domain name of a German business website to locate its information pages (e.g. company profile, contact page, imprint) and then identifies business specific information. We therefore concentrate on the extraction of characteristic vocabulary like company names, addresses, contact details, CEOs, etc. Above all, we interpret the HTML structure of documents and analyze some contextual facts to transform the unstructured web pages into structured forms. Our approach is quite robust in variability of the DOM, upgradeable and keeps data up-to-date. The evaluation experiments show high efficiency of information access to the generated data. Hence, the developed technique is adaptive to non-German websites with slight language-specific modifications, and experimental results on real-life websites confirm the feasibility of the approach.@eng bibo_authorlist: - foaf_Person: foaf_givenName: Yeong Su foaf_name: Lee, Yeong Su foaf_surname: Lee - foaf_Person: foaf_givenName: Michaela foaf_name: Geierhos, Michaela foaf_surname: Geierhos foaf_workInfoHomepage: http://www.librecat.org/personId=42496 orcid: 0000-0002-8180-5606 dct_date: 2009^xs_gYear dct_isPartOf: - http://id.crossref.org/issn/0929-0672 dct_language: eng dct_publisher: Centre for Telematics and Information Technology (CTIT), University of Twente@ dct_subject: - company search - information extraction - sublanguage dct_title: Business Specific Online Information Extraction from German Websites@ ...