A Proof-of-Concept of D³ Record Mining using Domain-Dependent Data

Y.S. Lee, M. Geierhos, S.-K. Song, H. Jung, in: C.-C. Chang, Y.E. Gelogo, R.E. Caytiles (Eds.), Software Technology: Prooceedings, International Conference, SoftTech 2012, Cebu, Philippines, May 2012, SERSC, Sandy Bay, Australia, 2012, pp. 134–139.

Conference Paper | Published | English
Author
Lee, Yeong Su; Geierhos, MichaelaLibreCat ; Song, Sa-Kwang; Jung, Hanmin
Editor
Chang, Chin-Chen; Gelogo, Yvette E.; Caytiles, Ronnie E.
Abstract
Our purpose is to perform data record extraction from onlineevent calendars exploiting sublanguage and domain characteristics. We therefore use so-called domain-dependent data (D³) completely based on language-specific key expressions and HTML patterns to recognize every single event given on the investigated web page. One of the most remarkable advantages of our method is that it does not require any additional classification steps based on machine learning algorithms or keyword extraction methods; it is a so-called one-step mining technique. Moreover, another important criteria is that our system is robust to DOM and layout modifications made by web designers. Thus, preliminary experimental results are provided to demonstrate proof-of-concept of such an approach tested on websites in the German opera domain. Furthermore, we could show that our proposed technique outperforms other data record mining applications run on event sites.
Publishing Year
Proceedings Title
Software Technology: Prooceedings, International Conference, SoftTech 2012, Cebu, Philippines, May 2012
forms.conference.field.series_title_volume.label
Advanced Science and Technology Letters (ASTL)
Volume
5
Page
134-139
Conference
1st International Conference on Software Technology (SoftTech 2012)
Conference Location
Cebu, Philippines
Conference Date
2012-05-29 – 2012-05-31
ISSN
LibreCat-ID

Cite this

Lee YS, Geierhos M, Song S-K, Jung H. A Proof-of-Concept of D3 Record Mining using Domain-Dependent Data. In: Chang C-C, Gelogo YE, Caytiles RE, eds. Software Technology: Prooceedings, International Conference, SoftTech 2012, Cebu, Philippines, May 2012. Vol 5. Advanced Science and Technology Letters (ASTL). Sandy Bay, Australia: SERSC; 2012:134-139.
Lee, Y. S., Geierhos, M., Song, S.-K., & Jung, H. (2012). A Proof-of-Concept of D3 Record Mining using Domain-Dependent Data. In C.-C. Chang, Y. E. Gelogo, & R. E. Caytiles (Eds.), Software Technology: Prooceedings, International Conference, SoftTech 2012, Cebu, Philippines, May 2012 (Vol. 5, pp. 134–139). Sandy Bay, Australia: SERSC.
@inproceedings{Lee_Geierhos_Song_Jung_2012, place={Sandy Bay, Australia}, series={Advanced Science and Technology Letters (ASTL)}, title={A Proof-of-Concept of D3 Record Mining using Domain-Dependent Data}, volume={5}, booktitle={Software Technology: Prooceedings, International Conference, SoftTech 2012, Cebu, Philippines, May 2012}, publisher={SERSC}, author={Lee, Yeong Su and Geierhos, Michaela and Song, Sa-Kwang and Jung, Hanmin}, editor={Chang, Chin-Chen and Gelogo, Yvette E. and Caytiles, Ronnie E.Editors}, year={2012}, pages={134–139}, collection={Advanced Science and Technology Letters (ASTL)} }
Lee, Yeong Su, Michaela Geierhos, Sa-Kwang Song, and Hanmin Jung. “A Proof-of-Concept of D3 Record Mining Using Domain-Dependent Data.” In Software Technology: Prooceedings, International Conference, SoftTech 2012, Cebu, Philippines, May 2012, edited by Chin-Chen Chang, Yvette E. Gelogo, and Ronnie E. Caytiles, 5:134–39. Advanced Science and Technology Letters (ASTL). Sandy Bay, Australia: SERSC, 2012.
Y. S. Lee, M. Geierhos, S.-K. Song, and H. Jung, “A Proof-of-Concept of D3 Record Mining using Domain-Dependent Data,” in Software Technology: Prooceedings, International Conference, SoftTech 2012, Cebu, Philippines, May 2012, Cebu, Philippines, 2012, vol. 5, pp. 134–139.
Lee, Yeong Su, et al. “A Proof-of-Concept of D3 Record Mining Using Domain-Dependent Data.” Software Technology: Prooceedings, International Conference, SoftTech 2012, Cebu, Philippines, May 2012, edited by Chin-Chen Chang et al., vol. 5, SERSC, 2012, pp. 134–39.
All files available under the following license(s):
Copyright Statement:
This Item is protected by copyright and/or related rights. [...]

Link(s) to Main File(s)
Access Level
Restricted Closed Access

Export

Marked Publications

Open Data LibreCat

Search this title in

Google Scholar