Improving Mathematics Retrieval

Kamali, Shahab; Tompa, Frank Wm.

  • Towards a Digital Mathematics Library. Grand Bend, Ontario, Canada, July 8-9th, 2009, Publisher: Masaryk University Press(Brno), page 37-48

Abstract

top
Despite the popularity of storing mathematical objects on the web, searching for mathematical expressions is extremely limited. Conventional retrieval systems are inadequate for mathematical expressions, because they are not tuned for text with complex structures that include only a few distinct terms. Surprisingly current approaches to the problem of retrieving mathematical information do not include a formal definition of the similarity between two expressions, and thus fail to find many relevant documents. In this paper, we present steps to advance mathematics retrieval to incorporate best practices from modern information retrieval. We first review encodings of mathematical expressions currently found on the web, and present the results of our efforts to create an experimental testbed. We formally define the similarity between two mathematical expressions and present the problem of searching for similar mathematical expressions.

How to cite

top

Kamali, Shahab, and Tompa, Frank Wm.. "Improving Mathematics Retrieval." Towards a Digital Mathematics Library. Grand Bend, Ontario, Canada, July 8-9th, 2009. Brno: Masaryk University Press, 2009. 37-48. <http://eudml.org/doc/221111>.

@inProceedings{Kamali2009,
abstract = {Despite the popularity of storing mathematical objects on the web, searching for mathematical expressions is extremely limited. Conventional retrieval systems are inadequate for mathematical expressions, because they are not tuned for text with complex structures that include only a few distinct terms. Surprisingly current approaches to the problem of retrieving mathematical information do not include a formal definition of the similarity between two expressions, and thus fail to find many relevant documents. In this paper, we present steps to advance mathematics retrieval to incorporate best practices from modern information retrieval. We first review encodings of mathematical expressions currently found on the web, and present the results of our efforts to create an experimental testbed. We formally define the similarity between two mathematical expressions and present the problem of searching for similar mathematical expressions.},
author = {Kamali, Shahab, Tompa, Frank Wm.},
booktitle = {Towards a Digital Mathematics Library. Grand Bend, Ontario, Canada, July 8-9th, 2009},
keywords = {MathML},
location = {Brno},
pages = {37-48},
publisher = {Masaryk University Press},
title = {Improving Mathematics Retrieval},
url = {http://eudml.org/doc/221111},
year = {2009},
}

TY - CLSWK
AU - Kamali, Shahab
AU - Tompa, Frank Wm.
TI - Improving Mathematics Retrieval
T2 - Towards a Digital Mathematics Library. Grand Bend, Ontario, Canada, July 8-9th, 2009
PY - 2009
CY - Brno
PB - Masaryk University Press
SP - 37
EP - 48
AB - Despite the popularity of storing mathematical objects on the web, searching for mathematical expressions is extremely limited. Conventional retrieval systems are inadequate for mathematical expressions, because they are not tuned for text with complex structures that include only a few distinct terms. Surprisingly current approaches to the problem of retrieving mathematical information do not include a formal definition of the similarity between two expressions, and thus fail to find many relevant documents. In this paper, we present steps to advance mathematics retrieval to incorporate best practices from modern information retrieval. We first review encodings of mathematical expressions currently found on the web, and present the results of our efforts to create an experimental testbed. We formally define the similarity between two mathematical expressions and present the problem of searching for similar mathematical expressions.
KW - MathML
UR - http://eudml.org/doc/221111
ER -

References

top
  1. [unknown], http://db.uwaterloo.ca/mathretrieval. 
  2. [unknown], http://www.wikipedia.org. 
  3. [unknown], http://www.wolfram.com. 
  4. Maple learning guide, . Maplesoft, a division of Waterloo Maple Inc, 2003. (2003) 
  5. Mathematica 6, . Wolfram Research Documentation Center, 2008. (2008) Zbl1147.30002
  6. Bancerek, G., Information retrieval and rendering with mml query, . In Proc. of MKM 2006, Lecture Notes in Artificial Intelligence 4108, pages 266–279. Springer Verlag, 2006. (2006) Zbl1188.68125
  7. Caprotti, O., Carlisle, D., Cohen, A., The OpenMath standard, . The OpenMath Esprit Consortium, 2002. (2002) 
  8. Carlisle, D., Ion, P., Miner, R., Mathematical Markup Language (MathML) version 3.0, . In W3C Working draft, 2007. (2007) 
  9. Einwohner, T. H., Fateman, R. J., Searching techniques for integral tables, . In International Symposium on Symbolic and Algebraic Computation, pages 133–139, 1995. (1995) Zbl0922.68041
  10. Grimm, J., Tralics, a latex to xml translator, . In INRIA, Institut National de Recherche en Informatique et Atomatique, 2008. (2008) 
  11. Guidi, F., Schena, I., A query language for a metadata framework about mathematical resources, . In Asperti, In et al, pages 105–118, 2003. (2003) Zbl1022.68616
  12. Halldórsson, M., Tanaka, K., Approximation and special cases of common subtrees and editing distance, . In Proc. 7th Ann. Int. Symp. on Algorithms and Computation, Lecture Notes in Comput. Sci. 1178, Springer-Verlag, 1996. (1996) MR1615179
  13. Kohlhase, M., Sďż˝ucan, I. A., A search engine for mathematical formulae, . In Artificial Intelligence and Symbolic Computation, LNCS, pages 241–253, 2006. (2006) Zbl1156.68306
  14. Miller, B., Youssef, A., Technical aspects of the digital library of mathematical functions, . Ann. Math. Artificial Intelligence, 2002. (2002) MR1990417
  15. Munavalli, R., Miner, R., Mathfind: a math-aware search engine, . In SIGIR, page 735, 2006. (2006) 
  16. Youssef, A., Search of mathematical contents: Issues and methods, . IASSE, 2005. (2005) 
  17. Youssef, A., Methods of relevance ranking and hit-content generation in math search, . Calculemus/MKM, 2007. (2007) Zbl1202.68161

NotesEmbed ?

top

You must be logged in to post comments.

To embed these notes on your page include the following JavaScript code on your page where you want the notes to appear.

Only the controls for the widget will be shown in your chosen language. Notes will be shown in their authored language.

Tells the widget how many notes to show per page. You can cycle through additional notes using the next and previous controls.

    
                

Note: Best practice suggests putting the JavaScript code just before the closing </body> tag.