Efficiency-conscious propositionalization for relational learning
Kybernetika (2004)
- Volume: 40, Issue: 3, page [275]-292
- ISSN: 0023-5954
Access Full Article
topAbstract
topHow to cite
topŽelezný, Filip. "Efficiency-conscious propositionalization for relational learning." Kybernetika 40.3 (2004): [275]-292. <http://eudml.org/doc/33700>.
@article{Železný2004,
abstract = {Systems aiming at discovering interesting knowledge in data, now commonly called data mining systems, are typically employed in finding patterns in a single relational table. Most of mainstream data mining tools are not applicable in the more challenging task of finding knowledge in structured data represented by a multi-relational database. Although a family of methods known as inductive logic programming have been developed to tackle that challenge by immediate means, the idea of adapting structured data into a simpler form digestible by the wealth of AVL systems has been always tempting to data miners. To this end, we present a method based on constructing first-order logic features that conducts this kind of conversion, also known as propositionalization. It incorporates some basic principles suggested in previous research and provides significant enhancements that lead to remarkable improvements in efficiency of the feature-construction process. We begin by motivating the propositionalization task with an illustrative example, review some previous approaches to propositionalization, and formalize the concept of a first-order feature elaborating mainly the points that influence the efficiency of the designed feature-construction algorithm.},
author = {Železný, Filip},
journal = {Kybernetika},
keywords = {machine learning; inductive logic programming; propositionalization; machine learning; inductive logic programming; propositionalization},
language = {eng},
number = {3},
pages = {[275]-292},
publisher = {Institute of Information Theory and Automation AS CR},
title = {Efficiency-conscious propositionalization for relational learning},
url = {http://eudml.org/doc/33700},
volume = {40},
year = {2004},
}
TY - JOUR
AU - Železný, Filip
TI - Efficiency-conscious propositionalization for relational learning
JO - Kybernetika
PY - 2004
PB - Institute of Information Theory and Automation AS CR
VL - 40
IS - 3
SP - [275]
EP - 292
AB - Systems aiming at discovering interesting knowledge in data, now commonly called data mining systems, are typically employed in finding patterns in a single relational table. Most of mainstream data mining tools are not applicable in the more challenging task of finding knowledge in structured data represented by a multi-relational database. Although a family of methods known as inductive logic programming have been developed to tackle that challenge by immediate means, the idea of adapting structured data into a simpler form digestible by the wealth of AVL systems has been always tempting to data miners. To this end, we present a method based on constructing first-order logic features that conducts this kind of conversion, also known as propositionalization. It incorporates some basic principles suggested in previous research and provides significant enhancements that lead to remarkable improvements in efficiency of the feature-construction process. We begin by motivating the propositionalization task with an illustrative example, review some previous approaches to propositionalization, and formalize the concept of a first-order feature elaborating mainly the points that influence the efficiency of the designed feature-construction algorithm.
LA - eng
KW - machine learning; inductive logic programming; propositionalization; machine learning; inductive logic programming; propositionalization
UR - http://eudml.org/doc/33700
ER -
References
top- Agrawal R., Srikant R., Fast algorithms for mining association rules, In: Proc. 20th Internat. Conference Very Large Data Bases, VLDB, Morgan Kaufmann, xxxxxxx 1994 pp. 487–499 (1994)
- Alphonse E., Rouveirol C., Lazy propositionalization for relational learning, In: Proc. 14th European Conference on Artificial Intelligence (ECAI’2000) (W. Horn, ed.), IOS Press 2000, pp. 256–260
- Blatǎk J., Popelínský L., Feature construction with RAP, In: Proc. of the Work-in-Progress Track at the 13th Internat. Conference on Inductive Logic Programming. University of Szeged 2003
- Clark P., Niblett T., 10.1007/BF00116835, Mach. Learning 3 (1989), 261–283 (1989) DOI10.1007/BF00116835
- Džeroski S., Numerical constraints and learnability in inductive logic programming, Ph.D. Thesis. Faculty of Electrical Engineering and Computer Science, University of Ljubljana 1995
- Džeroski S., (eds.) N. Lavrač, Relational Data Mining, Springer–Verlag, Berlin 2001 Zbl1003.68039
- Emde W., Wettschereck D., Relational instance based learning, In: Machine Learning – Proc. 13th Internat. Conference on Machine Learning, Morgan Kaufmann, xxxxxxx 1996, pp. 122–130 (1996)
- Hájek P., Mechanizing Hypothesis Formation, Springer–Verlag, Berlin 1966 Zbl0371.02002
- Kietz J. U., Some lower bounds for the computational complexity of inductive logic programming, In: Machine Learning: ECML-93, Proceedings of the European Conference on Machine Learning, volume 667, Springer–Verlag, Berlin 1993, pp. 115–123 (1993) MR1235394
- Knobbe A. J., Haas, M. de, Siebes A., Propositionalisation and aggregates, In: Proc. Fifth European Conference on Principles of Data Mining and Knowledge Disovery (PKDD). Springer–Verlag, Berlin 2001 Zbl1009.68749
- Kramer S., Lavrač, N., Flach P. A., Propositionalization Approaches to relational data mining, In: Relational Data Mining (N. Lavrač and S. Džeroski, eds.), Springer–Verlag, Berlin 2001
- Krogel M. A., Rawles S., Železný F., Flach P. A., Lavrač, N., Wrobel S., Comparative evaluation of approaches to propositionalization, In: Proc. 13th Internat. Conference on Inductive Logic Programming. Springer–Verlag, Berlin 2003
- Krogel M. A., Wrobel S., Transformation-based learning using multirelational aggregation, In: Proc. 11th Internat. Conference on Inductive Logic Programming (ILP), Springer–Verlag, Berlin 2001, pp. 142–155 Zbl1006.68519
- Lavrač N., Flach P. A., 10.1145/383779.383781, ACM Trans. Comput. Logic 2 (2001), 4, 458–494 DOI10.1145/383779.383781
- Lavrač N., Džeroski S., Inductive Logic Programming: Techniques and Applications, Ellis Horwood, 1993 Zbl0830.68027
- Lavrač N., Železný, F., Flach P. A., RSD: Relational subgroup discovery through first-order feature construction, In: Proc. 12th Internat. Conference on Inductive Logic Programming (ILP). Springer–Verlag, Berlin 2002 Zbl1017.68523
- Liu H., Motoda H., Feature Selection for Knowledge Discovery and Data Mining, Kluwer, Dordrecht 1998 Zbl0908.68127
- Maloberti J., Sebag M., Theta-subsumption in a constraint satisfaction perspective, In: Proc. 11th Internat. Conference on Inductive Logic Programming (ILP) (Lectures Notes in Artificial Intelligence 2157), Springer–Verlag, Berlin 2001, pp. 164–178 Zbl1006.68517MR1906956
- Muggleton S., 10.1007/BF03037227, New Generation Computing, Special issue on Inductive Logic Programming 13 (1995), 3–4, 245–286 (1995) DOI10.1007/BF03037227
- Pfahringer B., Holmes G., Propositionalization through stochastic discrimination, In: Proc. of the Work-in-Progress Track at the 13th Internat. Conference on Inductive Logic Programming. University of Szeged 2003
- Quinlan J. Ross, C4, 5: Programs for Machine Learning. Morgan Kaufmann, xxxxxxx 1992
- Sebag M., Rouveirol C., Tractable induction and classification in first-order logic via stochastic matching, In: Proc. 15th Internat. Joint Conference on Artificial Intelligence, Morgan Kaufmann, xxxxxxx 1997, pp. 888–893 (1997)
- Srinivasan A., Muggleton S. H., Sternberg M. J. E., King R. D., 10.1016/0004-3702(95)00122-0, Artificial Intelligence 85 (1996), 1, 2, 277–299 (1996) DOI10.1016/0004-3702(95)00122-0
- Witten I. H., Frank E., Trigg L., Hall M., Holmes, G., Cunningham, Sally Jo, Weka: Practical Machine Learning Tools and Techniques with Java Implementations, Morgan Kaufmann, xxxxxxx 1999
- Zucker J. D., Ganascia J. G., Representation changes for efficient learning in structural domains, In: Internat. Conference on Machine Learning 1996, pp. 543–551 (1996)
- Železný F., Lavrač, N., Džeroski S., Constraint-based relational subgroup discovery, In: Proc. Multi-Relational Data Mining Workshop at KDD 2003, Washington 2003
NotesEmbed ?
topTo embed these notes on your page include the following JavaScript code on your page where you want the notes to appear.