Skip to main content

Learning Distance Measures

  • Reference work entry
  • 79 Accesses

Synonyms

Flexible metric computation; Adaptive metric techniques

Definition

Many problems in data mining (e.g., classification, clustering, information retrieval) are concerned with the discovery of homogeneous groups of data according to a certain similarity (or distance) measure. The distance measure in use strongly affects the nature of the patterns (clusters, classes, or retrieved images) emerging from the given data. Typically, any chosen fixed distance measure, such as Euclidean or Manhattan distance, does not capture the underlying structure of the data, and fails to find meaningful patterns which correspond to the user’s preferences. To address this issue, techniques have been developed that learn from the data how to compute dissimilarities between pairs of objects. Since objects are commonly represented as vectors of measurements in a given feature space, distances between two objects are computed in terms of the dissimilarity between their corresponding feature components....

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   2,500.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Recommended Reading

  1. Bellman R. Adaptive Control Processes. Princeton University Press, 1961.

    Google Scholar 

  2. Blansch A., Ganarski P., and Korczak J. Maclaw: a modular approach for clustering with local attribute weighting. Pattern Recognit. Lett., 27(11):1299–1306, 2006.

    Article  Google Scholar 

  3. Domeniconi C., Gunopulos D., and Peng J. Large margin nearest neighbor classifiers. IEEE Trans. Neural Netw., 16:899–909, 2005.

    Article  Google Scholar 

  4. Domeniconi C., Gunopulos D., Yan S., Ma B., Al-Razgan M., and Papadopoulos D. Locally adaptive metrics for clustering high dimensional data. Data Mining Knowl. Discov. J., 14:63–97, 2007.

    Article  MathSciNet  Google Scholar 

  5. Domeniconi C., Peng J., and Gunopulos D. Locally adaptive metric nearest neighbor classification. IEEE Trans. Pattern Anal. Mach. Intell., 24:1281–1285, 2002.

    Article  Google Scholar 

  6. Friedman J. Flexible metric nearest neighbor classification. In Tech. Report, Dept. of Statistics, Stanford University, 1994.

    Google Scholar 

  7. Friedman J. and Meulman J. Clustering Objects On Subsets of Attributes. Technical Report, Stanford University, 2002.

    Google Scholar 

  8. Frigui H. and Nasraoui O. Unsupervised learning of prototypes and attribute weights. Pattern Recognit., 37(3):943–952, 2004.

    Article  Google Scholar 

  9. Hartigan J.A. Direct clustering of a data matrix. J. Am. Stat. Assoc., 67(337):123–129, 1972.

    Article  Google Scholar 

  10. Hastie T. and Tibshirani R. Discriminant adaptive nearest neighbor classification. IEEE Trans. Pattern Anal. Machine Intell., 18:607–615, 1996.

    Article  Google Scholar 

  11. Jain A., Mutty M., and Flyn P. Data clustering: a review. ACM Comput. Surv., 31(3), 1999.

    Google Scholar 

  12. Modha D. and Spangler S. Feature weighting in K-means clustering. Mach. Learn., 52(3):217–237, 2003.

    Article  MATH  Google Scholar 

  13. Shawe-Taylor J. and Fiege N. Pietzuch P. Kernel Methods for Pattern Analysis. Cambridge University Press, London, 2004.

    Google Scholar 

  14. Xing E., Ng A., Jordan M., and Russell S. Distance metric learning, with application to clustering with side-information. Advances in NIPS, vol. 15, 2003.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer Science+Business Media, LLC

About this entry

Cite this entry

Domeniconi, C. (2009). Learning Distance Measures. In: LIU, L., ÖZSU, M.T. (eds) Encyclopedia of Database Systems. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-39940-9_614

Download citation

Publish with us

Policies and ethics