Skip to main content

Counting on Uncertainty: Obtaining Fish Counts from Machine Learning Decisions

  • Chapter
  • First Online:
Fish4Knowledge: Collecting and Analyzing Massive Coral Reef Fish Video Data

Part of the book series: Intelligent Systems Reference Library ((ISRL,volume 104))

  • 950 Accesses

Abstract

Most questions in the Fish4Knowledge project are related to the ability to count fish, fish species or events using video analysis Chap. 2. Automatic video analysis however brings uncertainty due to False Positive/False Negative classifications, which makes determining the counts based on automatic video analysis difficult. Automatic video analysis software is often able to express a measure of uncertainty for a single detection/recognition using a similarity score, indicating how certain the software is about a single decision. Logistic Regression allows us to combine these similarity scores to compute an estimated count, outperforming the estimates of the underlying machine learning methods used for the original video analysis. We show this works both for the two-class and multi-class problem. Furthermore, we identify potential pitfalls where ensuring a correct sampling procedure is essential. The error in the estimated species counts using Logistic Regression was on average around 15 on a set of 11,585 fish images, while the machine methods had an average error of 800. The key to understanding this huge improvement in accuracy is that we use Logistic Regression over the individual classifications to estimate the size of the whole population, rather than the species classification of any individual observation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    In our experiments, we observed that both the logit kernel, described here, and the probit kernel (Bliss 1934) obtain almost similar results.

References

  • Bliss, C. 1934. The method of probits. Science 79(2037): 38–39.

    Article  Google Scholar 

  • Chan, A. B., Z.-S. Liang., and N. Vasconcelos. 2008. Privacy preserving crowd monitoring: Counting people without people models or tracking. In IEEE conference on computer vision and pattern recognition, 2008. CVPR 2008, 1–7. New York: IEEE.

    Google Scholar 

  • Hay, A. 1988. The derivation of global estimates from a confusion matrix. International Journal of Remote Sensing 9(8): 1395–1398.

    Article  Google Scholar 

  • Huang, P.X., B.J. Boom, and R.B. Fisher. 2014. GMM improves the reject option in hierarchical classification for fish recognition. In Proceedings of the workshop on applications of computer vision 2014: 371–376.

    Google Scholar 

  • Jupp, D.L.B. 1989. The stability of global estimates from confusion matrices. International Journal of Remote Sensing 10(9): 1563–1569.

    Article  Google Scholar 

  • Lemeshow, S., and D.W. Hosmer. 1982. A review of goodness of fit statistics for use in the development of logistic regression models. American Journal of Epidemiology 115(1): 92–106.

    Google Scholar 

  • Lempitsky, V. S., and A. Zisserman. 2010. Learning to count objects in images. In NIPS, vol. 1, p. 2.

    Google Scholar 

  • Lip, C., and D. Ramli. 2012. Comparative study on feature, score and decision level fusion schemes for robust multibiometric systems. In Frontiers in computer education, vol. 133, ed. S. Sambath, and E. Zhu, 941–948., Advances in Intelligent and Soft Computing Springer: Berlin.

    Chapter  Google Scholar 

  • Platt, J.C. 1999. Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. In Advances in large margin classifiers. Citeseer.

    Google Scholar 

  • Saerens, M., P. Latinne, and C. Decaestecker. 2002. Adjusting the outputs of a classifier to new a priori probabilities: a simple procedure. Neural Computation 14(1): 21–41.

    Article  MATH  Google Scholar 

  • Vucetic, S., and Z. Obradovic, 2001. Classification on data with biased class distribution. In Machine learning: ECML 2001, 527–538. Heidelberg: Springer.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bastiaan J. Boom .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Boom, B.J. (2016). Counting on Uncertainty: Obtaining Fish Counts from Machine Learning Decisions. In: Fisher, R., Chen-Burger, YH., Giordano, D., Hardman, L., Lin, FP. (eds) Fish4Knowledge: Collecting and Analyzing Massive Coral Reef Fish Video Data. Intelligent Systems Reference Library, vol 104. Springer, Cham. https://doi.org/10.1007/978-3-319-30208-9_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-30208-9_15

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-30206-5

  • Online ISBN: 978-3-319-30208-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics