Abstract
Solving many scientific problems requires effective regression and/or classification models for large high-dimensional datasets. Experts from these problem domains (e.g. biologists, chemists, financial analysts) have insights into the domain which can be helpful in developing powerful models but they need a modelling framework that helps them to use these insights. Data visualisation is an effective technique for presenting data and requiring feedback from the experts. A single global regression model can rarely capture the full behavioural variability of a huge multi-dimensional dataset. Instead, local regression models, each focused on a separate area of input space, often work better since the behaviour of different areas may vary. Classical local models such as Mixture of Experts segment the input space automatically, which is not always effective and it also lacks involvement of the domain experts to guide a meaningful segmentation of the input space. In this paper we addresses this issue by allowing domain experts to interactively segment the input space using data visualisation. The segmentation output obtained is then further used to develop effective local regression models.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Jacobs, R.A., Jordan, M.I., Nowlan, S.J., Hinton, G.E.: Adaptive mixture of local experts. Neural Computation 3, 79–87 (1991)
Tiňo, P., Nabney, I.T.: Constructing localized non-linear projection manifolds in a principled way: hierarchical generative topographic mapping. IEEE T. Pattern Analysis and Machine Intelligence 24, 639–656 (2002)
Bishop, C.M., Svensén, M., Williams, C.K.I.: GTM: The generative topographic mapping. Neural Computation 10, 215–234 (1998)
Aurenhammer, F.: Voronoi diagrams - survey of a fundamental geometric data structure. ACM Computing Surveys 3, 345–405 (1991)
Bishop, C.M., Svensén, M., Williams, C.K.I.: Magnification factors for the GTM algorithm. In: Proceedings IEE Fifth International Conference on Artificial Neural Networks, pp. 64–69 (1997)
Tiňo, P., Nabney, I.T., Sun, Y.: Using directional curvatures to visualize folding patterns of the GTM projection manifolds. In: Dorffner, G., Bischof, H., Hornik, K. (eds.) ICANN 2001. LNCS, vol. 2130, pp. 421–428. Springer, Heidelberg (2001)
Ellis, R.: Entropy, Large Deviations, and Statistical Mechanics. Springer, New York (1985)
Bishop, C.M.: Neural Networks for Pattern Recognition, 1st edn. Oxford University Press, Oxford (1995)
Weiss, N.A.: Elementary Statistics, 3rd edn. Addison Wesley, Reading (1996)
Good, A.C., Krystek, S.R., Mason, J.S.: High-throughput and virtual screening: core lead discovery techologies move towards integration. Drug Discovery Today 5, S61–S69 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Maniyar, D.M., Nabney, I.T. (2005). Guiding Local Regression Using Visualisation. In: Winkler, J., Niranjan, M., Lawrence, N. (eds) Deterministic and Statistical Methods in Machine Learning. DSMML 2004. Lecture Notes in Computer Science(), vol 3635. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11559887_6
Download citation
DOI: https://doi.org/10.1007/11559887_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29073-5
Online ISBN: 978-3-540-31728-9
eBook Packages: Computer ScienceComputer Science (R0)