Fuzzy querying: issues and perspectives

Janusz Kacprzyk; Gabriella Pasi; Peter Vojtáš; Sławomir Zadrożny

Kybernetika (2000)

  • Volume: 36, Issue: 6, page [605]-616
  • ISSN: 0023-5954

Abstract

top
The term query is widely used in the database as well as information retrieval communities. Basically, a query against a collection of information items (to be called later, for brevity, an information source) provides a formal description of the items of interest to the user posing this query. A source of information is meant here very generally. It may take the form of an archive of multimedia or textual documents, a database, or a knowledge base. In the three previous examples the information items are documents, records (rows in relational data model) and facts, respectively. In order to manage and access an information source, an appropriate system is defined which makes it possible to store, represent and retrieve information items by means of a formal query language. Information systems that make it possible to manage information items previously mentioned are information retrieval systems, data base management systems and knowledge based systems, respectively. Query languages of these systems usually refer to some features of entities represented by the items stored in an information source, e. g., keywords (index terms) in textual documents (documents archive), attributes (database) or arguments of facts (knowledge base). Thus, basically, a query may be seen as a set of selection conditions that should be met by an information item (its features) to be qualified as relevant with respect to the query. On the other hand, the query processing itself may be seen as consisting mainly of matching a query against the items of the information source. This process may be essentially more complex, as, e. g., in the case of knowledge bases where we deal with a whole chain of matching within the reasoning process. Often, a user faces the problem of how to express her or his information requirements in a formal query language supported by a given information system interface. These formal languages usually require a crisp (precise, unambiguous) specification of a query, while, for human beings, a query is best expressed in terms of a natural language – a very powerful, but ambiguous and imprecise medium. Thus, adding some flexibility to traditional querying systems seems to be a critical issue for enhancing their effectiveness and efficiency. In this paper, we discuss some recent advances and basic issues related to flexible querying based on the application of fuzzy logic. We focus on two areas corresponding to the type of information source under consideration, namely: information retrieval in which we primarily deal with archives of textual documents and database querying. Both areas share the same interest in fuzzy (linguistic) queries and flexible matching against items of information. However, they have also their specific features, and these are pointed out in the next sections. The third area, that of very broadly meant knowledge bases querying is dealt with in the paper by Peter Vojtáš, in this special issue. Specifically, the concept of matching, essential for querying, may be identified to some extent with the unification. In the mentioned paper, the issues related to the fuzzy unification are discussed. The matching of fuzzy concepts, from a slightly different perspective, is also the subject of the paper by Andrejková, in this issue. Another contribution relevant for the flexible querying of knowledge bases is the paper by Ch. Marsala, in this issue. Moreover, beside its application to querying itself, the concept of flexibility is usually extended to the representation of information to be queried. This is particularly evident in the area of information retrieval in which concepts of fuzzy logic fit very well into advanced indexing schemes for text documents. In case of database management systems, fuzzy logic based ideas have led to the development of imprecise/vague data representation models. These issues are also dealt with in the following sections. This paper is structured in two sections dealing with information retrieval and database querying, respectively. The paper is meant to provide a synthetic description of the research area of the papers appearing in this special issue of the Kybernetika. This issue is comprised of extended versions of selected papers presented at the session on fuzzy querying at the FSTA’2000 Conference held in Liptovský Mikuláš (Slovak Republic) in the winter of 2000. We refer to the other papers in this issue indicating their relevance for the topics discussed here.

How to cite

top

Kacprzyk, Janusz, et al. "Fuzzy querying: issues and perspectives." Kybernetika 36.6 (2000): [605]-616. <http://eudml.org/doc/33506>.

@article{Kacprzyk2000,
abstract = {The term query is widely used in the database as well as information retrieval communities. Basically, a query against a collection of information items (to be called later, for brevity, an information source) provides a formal description of the items of interest to the user posing this query. A source of information is meant here very generally. It may take the form of an archive of multimedia or textual documents, a database, or a knowledge base. In the three previous examples the information items are documents, records (rows in relational data model) and facts, respectively. In order to manage and access an information source, an appropriate system is defined which makes it possible to store, represent and retrieve information items by means of a formal query language. Information systems that make it possible to manage information items previously mentioned are information retrieval systems, data base management systems and knowledge based systems, respectively. Query languages of these systems usually refer to some features of entities represented by the items stored in an information source, e. g., keywords (index terms) in textual documents (documents archive), attributes (database) or arguments of facts (knowledge base). Thus, basically, a query may be seen as a set of selection conditions that should be met by an information item (its features) to be qualified as relevant with respect to the query. On the other hand, the query processing itself may be seen as consisting mainly of matching a query against the items of the information source. This process may be essentially more complex, as, e. g., in the case of knowledge bases where we deal with a whole chain of matching within the reasoning process. Often, a user faces the problem of how to express her or his information requirements in a formal query language supported by a given information system interface. These formal languages usually require a crisp (precise, unambiguous) specification of a query, while, for human beings, a query is best expressed in terms of a natural language – a very powerful, but ambiguous and imprecise medium. Thus, adding some flexibility to traditional querying systems seems to be a critical issue for enhancing their effectiveness and efficiency. In this paper, we discuss some recent advances and basic issues related to flexible querying based on the application of fuzzy logic. We focus on two areas corresponding to the type of information source under consideration, namely: information retrieval in which we primarily deal with archives of textual documents and database querying. Both areas share the same interest in fuzzy (linguistic) queries and flexible matching against items of information. However, they have also their specific features, and these are pointed out in the next sections. The third area, that of very broadly meant knowledge bases querying is dealt with in the paper by Peter Vojtáš, in this special issue. Specifically, the concept of matching, essential for querying, may be identified to some extent with the unification. In the mentioned paper, the issues related to the fuzzy unification are discussed. The matching of fuzzy concepts, from a slightly different perspective, is also the subject of the paper by Andrejková, in this issue. Another contribution relevant for the flexible querying of knowledge bases is the paper by Ch. Marsala, in this issue. Moreover, beside its application to querying itself, the concept of flexibility is usually extended to the representation of information to be queried. This is particularly evident in the area of information retrieval in which concepts of fuzzy logic fit very well into advanced indexing schemes for text documents. In case of database management systems, fuzzy logic based ideas have led to the development of imprecise/vague data representation models. These issues are also dealt with in the following sections. This paper is structured in two sections dealing with information retrieval and database querying, respectively. The paper is meant to provide a synthetic description of the research area of the papers appearing in this special issue of the Kybernetika. This issue is comprised of extended versions of selected papers presented at the session on fuzzy querying at the FSTA’2000 Conference held in Liptovský Mikuláš (Slovak Republic) in the winter of 2000. We refer to the other papers in this issue indicating their relevance for the topics discussed here.},
author = {Kacprzyk, Janusz, Pasi, Gabriella, Vojtáš, Peter, Zadrożny, Sławomir},
journal = {Kybernetika},
keywords = {information retrieval systems; fuzzy queries; information retrieval systems; fuzzy queries},
language = {eng},
number = {6},
pages = {[605]-616},
publisher = {Institute of Information Theory and Automation AS CR},
title = {Fuzzy querying: issues and perspectives},
url = {http://eudml.org/doc/33506},
volume = {36},
year = {2000},
}

TY - JOUR
AU - Kacprzyk, Janusz
AU - Pasi, Gabriella
AU - Vojtáš, Peter
AU - Zadrożny, Sławomir
TI - Fuzzy querying: issues and perspectives
JO - Kybernetika
PY - 2000
PB - Institute of Information Theory and Automation AS CR
VL - 36
IS - 6
SP - [605]
EP - 616
AB - The term query is widely used in the database as well as information retrieval communities. Basically, a query against a collection of information items (to be called later, for brevity, an information source) provides a formal description of the items of interest to the user posing this query. A source of information is meant here very generally. It may take the form of an archive of multimedia or textual documents, a database, or a knowledge base. In the three previous examples the information items are documents, records (rows in relational data model) and facts, respectively. In order to manage and access an information source, an appropriate system is defined which makes it possible to store, represent and retrieve information items by means of a formal query language. Information systems that make it possible to manage information items previously mentioned are information retrieval systems, data base management systems and knowledge based systems, respectively. Query languages of these systems usually refer to some features of entities represented by the items stored in an information source, e. g., keywords (index terms) in textual documents (documents archive), attributes (database) or arguments of facts (knowledge base). Thus, basically, a query may be seen as a set of selection conditions that should be met by an information item (its features) to be qualified as relevant with respect to the query. On the other hand, the query processing itself may be seen as consisting mainly of matching a query against the items of the information source. This process may be essentially more complex, as, e. g., in the case of knowledge bases where we deal with a whole chain of matching within the reasoning process. Often, a user faces the problem of how to express her or his information requirements in a formal query language supported by a given information system interface. These formal languages usually require a crisp (precise, unambiguous) specification of a query, while, for human beings, a query is best expressed in terms of a natural language – a very powerful, but ambiguous and imprecise medium. Thus, adding some flexibility to traditional querying systems seems to be a critical issue for enhancing their effectiveness and efficiency. In this paper, we discuss some recent advances and basic issues related to flexible querying based on the application of fuzzy logic. We focus on two areas corresponding to the type of information source under consideration, namely: information retrieval in which we primarily deal with archives of textual documents and database querying. Both areas share the same interest in fuzzy (linguistic) queries and flexible matching against items of information. However, they have also their specific features, and these are pointed out in the next sections. The third area, that of very broadly meant knowledge bases querying is dealt with in the paper by Peter Vojtáš, in this special issue. Specifically, the concept of matching, essential for querying, may be identified to some extent with the unification. In the mentioned paper, the issues related to the fuzzy unification are discussed. The matching of fuzzy concepts, from a slightly different perspective, is also the subject of the paper by Andrejková, in this issue. Another contribution relevant for the flexible querying of knowledge bases is the paper by Ch. Marsala, in this issue. Moreover, beside its application to querying itself, the concept of flexibility is usually extended to the representation of information to be queried. This is particularly evident in the area of information retrieval in which concepts of fuzzy logic fit very well into advanced indexing schemes for text documents. In case of database management systems, fuzzy logic based ideas have led to the development of imprecise/vague data representation models. These issues are also dealt with in the following sections. This paper is structured in two sections dealing with information retrieval and database querying, respectively. The paper is meant to provide a synthetic description of the research area of the papers appearing in this special issue of the Kybernetika. This issue is comprised of extended versions of selected papers presented at the session on fuzzy querying at the FSTA’2000 Conference held in Liptovský Mikuláš (Slovak Republic) in the winter of 2000. We refer to the other papers in this issue indicating their relevance for the topics discussed here.
LA - eng
KW - information retrieval systems; fuzzy queries; information retrieval systems; fuzzy queries
UR - http://eudml.org/doc/33506
ER -

References

top
  1. Bookstein A., 10.1002/asi.4630310403, J. Amer. Soc. Inform. Science 31 (1980), 4, 240–247 (1980) DOI10.1002/asi.4630310403
  2. Bordogna G., Carrara P., Pasi G., 10.1016/0306-4573(91)90028-K, Inform. Process. Management 27 (1991), 1, 15–26 (1991) DOI10.1016/0306-4573(91)90028-K
  3. Bordogna G., Pasi G., 10.1002/(SICI)1097-4571(199303)44:2<70::AID-ASI2>3.0.CO;2-I, J. Amer. Soc. Inform. Science 44 (1993), 2, 70–82 (1993) DOI10.1002/(SICI)1097-4571(199303)44:2<70::AID-ASI2>3.0.CO;2-I
  4. Bordogna G., Pasi G., 10.1002/int.4550100205, Internat. J. Intelligent Systems 10 (1995), 2, 233–248 (1995) DOI10.1002/int.4550100205
  5. Bordogna G., Pasi G., 10.1016/0888-613X(94)00036-3, Internat. J. Approx. Reason. 12 (1995), 317–339 (1995) MR1327861DOI10.1016/0888-613X(94)00036-3
  6. Bordogna G., Pasi G., The Application of Fuzzy Set Theory to Model Information Retrieval, In: Soft Computing in Information Retrieval: Techniques and Applications (F. Crestani and G. Pasi, eds.), Physica–Verlag, Heidelberg 2000 
  7. Bordogna G., Pasi G., Linguistic granules to express importance in an ordinal information retrieval model, In: Proceedings of the Eighth International Conference on Information Processing and Management of Uncertainty in Knowledge-based Systems (IPMU’2000), Madrid 2000, pp. 470–476 
  8. Bosc P., Duval L., Pivert O., Value-based and representation-based querying of possibilistic databases, In: Recent Issues on the Management of Fuzziness in Databases (G. Bordogna and G. Pasi, eds.), Physica–Verlag, Heidelberg 2000, pp. 3–28 
  9. Bosc P., Pivert O., Fuzzy querying in conventional databases, In: Fuzzy Logic for the Management of Uncertainty (L. A. Zadeh and J. Kacprzyk, eds.), Wiley, New York 1992, pp. 645–671 (1992) 
  10. Bosc P., Pivert O., 10.1109/91.366566, IEEE Trans. Fuzzy Systems 3 (1995), 1–17 (1995) DOI10.1109/91.366566
  11. Bosc P., Pivert O., SQLf query functionality on top of a regular relational database management system, In: Knowledge Management in Fuzzy Databases (O. Pons, M. A. Vila and J. Kacprzyk, eds.), Physica Verlag, Heidelberg 2000, pp. 171–190 Zbl0964.68047
  12. Buckles B. P., Petry F. E., 10.1016/0165-0114(82)90052-5, Fuzzy Sets and Systems 7 (1982), 213–226 (1982) DOI10.1016/0165-0114(82)90052-5
  13. Buell D. A., Kraft D. H., 10.1002/asi.4630320307, J. Amer. Soc. for Inform. Science 32 (1981), 3, 211–216 (1981) DOI10.1002/asi.4630320307
  14. Crestani F., Lalmas M., Rijsbergen C. J. van, Campbell I., 10.1145/299917.299920, ..probably”: a survey of probabilistic models in information retrieval. ACM Comput. Surveys 30 (1998), 4, 528–552 (1998) DOI10.1145/299917.299920
  15. (ed.) R. De Caluwe, Fuzzy and Uncertain Object-Oriented Databases: Concepts and Models, Adv. in Fuzzy Systems – Appl. and Theory 13 (1998). World Scientific Pub Co (1998) 
  16. Dubois D., Prade H., Tolerant fuzzy pattern matching: an introduction, In: Fuzziness in Database Management Systems (P. Bosc and J. Kacprzyk, eds.), Physica–Verlag (Springer–Verlag) 1995, pp. 42–58 (1995) 
  17. Fuhr N., 10.1016/0306-4573(89)90091-5, Inform. Process. Management 25 (1989), 1, 55–72 (1989) DOI10.1016/0306-4573(89)90091-5
  18. Kacprzyk J., Zadrożny S., Fuzzy querying for Microsoft Access, In: Proceedings of Third IEEE Conference on Fuzzy Systems Orlando 1994, Vol. 1, pp. 167–171 (1994) 
  19. Kacprzyk J., Zadrożny S., FQUERY for Access: fuzzy querying for a Windows – based DBMS, In: Fuzziness in Database Management Systems (P. Bosc and J. Kacprzyk, eds.), Physica–Verlag, Heidelberg 1995, pp. 415–433 (1995) 
  20. Kacprzyk J., Zadrożny S., Ziółkowski A., 10.1016/0306-4379(89)90012-4, Inform. Systems 6 (1989), 443–453 (1989) DOI10.1016/0306-4379(89)90012-4
  21. Kacprzyk J., Ziółkowski A., 10.1109/TSMC.1986.4308982, IEEE Trans. Systems Man Cybernet. SMC–16 (1986), 474–479 (1986) DOI10.1109/TSMC.1986.4308982
  22. Kraft D. H., Bordogna G., Pasi G., An extended fuzzy linguistic approach to generalize Boolean information retrieval, J. Inform. Sci. Appl. 2 (1995), 3, 119–134 (1995) 
  23. Kraft D., Bordogna G., Pasi G., Fuzzy Set Techniques in Information Retrieval, In: Fuzzy Sets in Approximate Reasoning and Information Systems (J. C. Bezdek, D. Dubois and H. Prade, eds.), The Handbooks of Fuzzy Sets Series, Kluwer Academic Publishers, Boston – Dordrecht – London 1999, pp. 469–510 (1999) Zbl0949.68527MR1799009
  24. Motro A., 10.1145/45945.48027, ACM Trans. Office Inform. Systems 6 (1988), 3, 187–214 (1988) DOI10.1145/45945.48027
  25. Petry F. E., Fuzzy Databases, Principles and Applications. Kluwer Academic Publishers, Boston – Dordrecht – London 1996 Zbl0953.68052
  26. Prade H., Testemale C., 10.1016/0020-0255(84)90020-3, Inform. Sci. 34 (1984), 115–143 (1984) MR0769961DOI10.1016/0020-0255(84)90020-3
  27. Salton G., Automatic text processing: The transformation, analysis and retrieval of information by computer, Addison Wesley, Reading 1989 
  28. Salton G., Buckley C., 10.1016/0306-4573(88)90021-0, Inform. Process. Management 24 (1988), 5, 513–523 (1988) DOI10.1016/0306-4573(88)90021-0
  29. Salton G., McGill M. J., Introduction to modern information retrieval, McGraw–Hill, New York 1983 Zbl0523.68084
  30. Shenoi S., Melton A., 10.1016/0165-0114(89)90201-7, Fuzzy Sets and Systems 31 (1989), 285–296 (1989) Zbl0677.68113MR1009262DOI10.1016/0165-0114(89)90201-7
  31. Tahani V., 10.1016/0306-4573(77)90018-8, Inform. Process. Management 13 (1977), 289–303 (1977) Zbl0361.68136DOI10.1016/0306-4573(77)90018-8
  32. Ullman J. D., Principles of Database Systems, Computer Science Press, Rockville 1982 Zbl0558.68078MR0669881
  33. Rijsbergen C. J. Van, Information Retrieval, Butterworths & Co., Ltd, London 1979 
  34. Yager R. R., 10.1002/(SICI)1097-4571(198701)38:1<23::AID-ASI4>3.0.CO;2-3, J. Amer. Soc. Inform. Sci. 38 (1987), 1, 23–24 (1987) DOI10.1002/(SICI)1097-4571(198701)38:1<23::AID-ASI4>3.0.CO;2-3
  35. Yager R. R., 10.1109/21.87068, IEEE Trans. Systems Man Cybernet. 18 (1988), 1, 183–190 (1988) MR0931863DOI10.1109/21.87068
  36. Yazici A., Cibiceli D., 10.1016/S0020-0255(98)10079-8, Inform. Sci. 115 (1999), 1–4, 137–163 (1999) DOI10.1016/S0020-0255(98)10079-8
  37. Zadeh L. A., 10.1016/0898-1221(83)90013-5, Computers and Math. Appl. 9 (1983), 149–184 (1983) Zbl0517.94028MR0719073DOI10.1016/0898-1221(83)90013-5

NotesEmbed ?

top

You must be logged in to post comments.

To embed these notes on your page include the following JavaScript code on your page where you want the notes to appear.

Only the controls for the widget will be shown in your chosen language. Notes will be shown in their authored language.

Tells the widget how many notes to show per page. You can cycle through additional notes using the next and previous controls.

    
                

Note: Best practice suggests putting the JavaScript code just before the closing </body> tag.