Abstract
The steadily growing amounts of digital neuroscientific data demands for a reliable, systematic, and computationally effective retrieval algorithm. In this paper, we present Neuron-Miner, which is a tool for fast and accurate reference-based retrieval within neuron image databases. The proposed algorithm is established upon hashing (search and retrieval) technique by employing multiple unsupervised random trees, collectively called as Hashing Forests (HF). The HF are trained to parse the neuromorphological space hierarchically and preserve the inherent neuron neighborhoods while encoding with compact binary codewords. We further introduce the inverse-coding formulation within HF to effectively mitigate pairwise neuron similarity comparisons, thus allowing scalability to massive databases with little additional time overhead. The proposed hashing tool has superior approximation of the true neuromorphological neighborhood with better retrieval and ranking performance in comparison to existing generalized hashing methods. This is exhaustively validated by quantifying the results over 31266 neuron reconstructions from Neuromorpho.org dataset curated from 147 different archives. We envisage that finding and ranking similar neurons through reference-based querying via Neuron Miner would assist neuroscientists in objectively understanding the relationship between neuronal structure and function for applications in comparative anatomy or diagnosis.











Similar content being viewed by others
References
Albalate, A., & Suendermann, D. (2009). A combination approach to cluster validation based on statistical quantiles. In 2009. IJCBS’09. International Joint Conference on (pp. 549-555) Bioinformatics, Systems Biology and Intelligent Computing: IEEE.
Ascoli, G.A., Donohue, D.E., & Halavi, M. (2007). Neuromorpho. Org: a central resource for neuronal morphologies. The Journal of Neuroscience, 27(35), 9247–9251.
Costa, M., Ostrovsky, A.D., Manton, J.D., Prohaska, S., & Jefferis, G.S. (2014). NBLAST: Rapid, sensitive comparison of neuronal structure and construction of neuron family databases. bioRxiv, p.006346.
Costa, L.D.F., Zawadzki, K., Miazaki, M., Viana, M.P., & Taraskin, S. (2010). Unveiling the neuromorphological space. Frontiers in Computational Neuroscience, 4, 150.
Desgraupes, B. (2013). Clustering indices. University of Paris Ouest-Lab Modal’X, 1, 34.
Gionis, A., Indyk, P., & Motwani, R. (1999). Similarity search in high dimensions via hashing. In VLDB 99(6), p. 518-529. Vancouver.
He, X., Cai, D., Yan, S., & Zhang, H.J. (2005). Neighborhood preserving embedding. In 2005. ICCV 2005. Tenth IEEE International Conference on (Vol. 2, pp. 1208-1213) Computer Vision: IEEE.
Hotelling, H. (1933). Analysis of a complex of statistical variables into principal components. Journal of educational psychology, 24(6), 417.
Joly, A., & Buisson, O. (2011). Random maximum margin hashing. In 2011 IEEE Conference on (pp. 873-880) Computer Vision and Pattern Recognition (CVPR): IEEE.
Kendall, M.G. (1948). Rank correlation methods. Biometrika, 44(1/2), 298.
Kovács, F., Legány, C., & Babos, A. (2005). Cluster validity measurement techniques. In Proceedings of the 6th International Symposium of Hungarian Researchers on Computational Intelligence (pp. 18–19). Budapest.
Literature Search Main Results (2015). Available at: http://neuromorpho.org/neuroMorpho/LS_queryStatus.jsp, (Accessed: 09 February 2016).
Liu, X., Huang, L., Deng, C., Lu, J., & Lang, B. (2015). Multi-View Complementary hash tables for nearest neighbor search. In Proceedings of the IEEE International Conference on Computer Vision (pp. 1107–1115).
Louppe, G. (2014). Understanding random forests: From theory to practice.arXiv preprint arXiv:1407.7502.
Menze, B.H., Kelm, B.M., Splitthoff, D.N., Koethe, U., & Hamprecht, F.A. (2011). On oblique random forests. In Machine Learning and Knowledge Discovery in Databases (pp. 453-469): Springer Berlin Heidelberg.
Mesbah, S., Conjeti, S., Kumaraswamy, A., Rautenberg, P., Navab, N., & Katouzian, A. (2015). Hashing Forests for Morphological Search and Retrieval in Neuroscientific Image Databases. In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015 (pp. 135-143): Springer International Publishing.
Overview of L-Measure (2015). Available at: http://cng.gmu.edu:8080/Lm/help/index.htm, (Accessed: 09 February 2016).
Polavaram, S., Gillette, T.A., Parekh, R., & Ascoli, G.A. (2014). Statistical analysis and data mining of digital reconstructions of dendritic morphologies. Frontiers in Neuroanatomy, 8, 138.
Rautenberg, P.L., Grothe, B., & Felmy, F. (2009). Quantification of the three dimensional morphology of coincidence detector neurons in the medial superior olive of gerbils during late postnatal development. Journal of Comparative Neurology, 517(3), 385– 396.
Rautenberg, P.L., Kumaraswamy, A., Tejero-Cantero, A., Doblander, C., Norouzian, M.R., Kai, K., Jacobsen, H.A., Ai, H., Wachtler, T., & Ikeno, H. (2014). Neurondepot: keeping your colleagues in sync by combining modern cloud storage services, the local file system, and simple web applications. Frontiers in Neuroinformatics, 8, 55.
Scorcioni, R., Polavaram, S., & Ascoli, G.A. (2008). L-measure: a web-accessible tool for the analysis, comparison and search of digital reconstructions of neuronal morphologies. Nature protocols, 3(5), 866–876.
Search by Morphometry (2015). Available at: http://neuromorpho.org/neuroMorpho/MorphometrySearch.jsp.
Slaney, M., & Casey, M. (2008). Locality-sensitive hashing for finding nearest neighbors [lecture notes]. IEEE Signal Processing Magazine, 25(2), 128–131.
Scikit-learn: machine learning in Python – scikit-learn 0.16.1 documentation (2015) Available at: http://scikit-learn.org/stable/ (Accessed: 25 August 2015).
Wan, Y., Long, F., Qu, L., Xiao, H., Hawrylycz, M., Myers, E.W., & Peng, H. (2015). BlastNeuron for automated comparison, retrieval and clustering of 3D neuron morphologies. Neuroinformatics, 13(4), 487–499.
Wang, J., Liu, W., Kumar, S., & Chang, S.F. (2016). Learning to hash for indexing big Data–A survey. Proceedings of the IEEE, 104(1), 34–57.
Weiss, Y., Fergus, R., & Torralba, A. (2012). Multidimensional spectral hashing. In Computer Vision–ECCV 2012 (pp. 340–353): Springer Berlin Heidelberg.
Yu, G., & Yuan, J. (2014). Scalable forest hashing for fast similarity search. In 2014 IEEE International Conference on (pp. 1-6) Multimedia and Expo (ICME): IEEE.
Zhang, D., Wang, J., Cai, D., & Lu, J. (2010). Self-taught hashing for fast similarity search. In proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval (pp. 18-25): ACM.
Acknowledgments
We thank Ajayrama Kumaraswamy, Computational Neuroscience Department Biology II, Ludwigs Maximillian Universität München, Germany for insightful discussion in the early conception of this work. We thank the assistance of Bastien Saquet of Max Plank Digital Library, München, Germany in maintaining the web-service. We would like to thank the Max Plank Digital Library, München, Germany for providing computing resources for hosting the Neuron-Miner software and making it publicly accessible.
Conflict of interests
We have no conflict of interest to declare.
Author information
Authors and Affiliations
Corresponding author
Additional information
S. Conjeti and S. Mesbah contributed equally towards the work.
Electronic supplementary material
Below is the link to the electronic supplementary material.
(WMV 3.49 MB)
Rights and permissions
About this article
Cite this article
Conjeti, S., Mesbah, S., Negahdar, M. et al. Neuron-Miner: An Advanced Tool for Morphological Search and Retrieval in Neuroscientific Image Databases. Neuroinform 14, 369–385 (2016). https://doi.org/10.1007/s12021-016-9300-2
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12021-016-9300-2