Search:
Lehrstuhl  |  Institut  |  Fakultät  |  LMU
print

Similarity Estimation using Bayes Ensembles

Published at 22nd International Conference on Scientific and Statistical Database Management (SSDBM)

Conference Date: June 30th to July 2nd of 2010
Conference Location: Heidelberg, Germany
Conference Title: 22nd International Conference on Scientific and Statistical Database Management (SSDBM), Heidelberg, Germany, 2010
Conference Chairs: Andreas Reuter, Michael Gertz Conference Co-Chairs: Tony Hey, Bertram Ludäscher

Abstract

Similarity search and data mining often rely on distance or similarity functions in order to provide meaningful results and semantically meaningful patterns. However, standard distance measures like Lp-norms are often not capable to accurately mirror the expected similarity between two objects. To bridge the so-called semantic gap between feature representation and object similarity, the distance function has to be adjusted to the current application context or user. In this paper, we propose a new probabilistic framework for estimating a similarity value based on a Bayesian setting. In our framework, distance comparisons are modeled based on distribution functions on the difference vectors. To combine these functions, a similarity score is computed by an Ensemble of weak Bayesian learners for each dimension in the feature space. To find independent dimensions of maximum meaning, we apply a space transformation based on eigenvalue decomposition. In our experiments, we demonstrate that our new method shows promising results compared to related Mahalanobis learners on several test data sets w.r.t. nearest-neighbor classification and precision-recall-graphs.

Copyright Notes

Tobias Emrich, Franz Graf, Hans-Peter Kriegel, Matthias Schubert, Marisa Thoma
"Similarity Estimation using Bayes Ensembles", 22nd International Conference on Scientific and Statistical Database Management (SSDBM), Heidelberg, Germany, 2010.

M. Gertz and B. Ludäscher (Eds.): SSDBM 2010, LNCS 6187, pp. 537–554, 2010.
© Springer-Verlag Berlin Heidelberg 2010

DOI: 10.1007/978-3-642-13818-8_37

Documents

This is the author’s version of the work. It is posted here by permission of Springer for your personal use. Not for redistribution.

Paper pdf.gif
Poster pdf.gif
Talk pdf.gif

BibTex

@INPROCEEDINGS{EmrGraKriSchetal10a,
  AUTHOR    = {Emrich, Tobias and Graf, Franz and Kriegel, Hans-Peter
              and Schubert, Matthias and Thoma, Marisa},
  TITLE     = {Similarity Estimation using Bayes Ensembles},
  BOOKTITLE = {Proceedings of the 22nd International Conference on
              Scientific and Statistical Database Management (SSDBM),
              Heidelberg, Germany},
  VOLUME    = {6187},
  PAGES     = {537–-554},
  YEAR      = {2010},
  DOI       = {10.1007/978-3-642-13818-8_37}
}

logo: Supported by BMWi

blank