Skip to main content

Mining the Potential Relationships Between Cancer Cases and Industrial Pollution Based on High-Influence Ordered-Pair Patterns

  • Conference paper
  • First Online:
Advanced Data Mining and Applications (ADMA 2022)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13725))

Included in the following conference series:

  • 1190 Accesses

Abstract

Co-location pattern mining aims to discover the relationships between spatial features. Traditional co-location patterns are based on clique relationships and only consider the prevalence of patterns. However, pollution sources and cancer cases do not satisfy the clique relationship, and users focus on the influence of pollution sources on cancer cases. Therefore, we propose high-influence ordered-pair patterns to study their relationships. First, we measure the influence of pollution sources on cancer cases. Then, to efficiently mine high-influence ordered-pair patterns, we propose a basic algorithm with two pruning strategies and an optimizing algorithm based on participating instances. Extensive experiments on real and synthetic datasets show that our mining results are more reasonable than existing algorithms and can provide guidance for cancer prevention. Moreover, our algorithm is also highly efficient and scalable.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    https://github.com/juanjuanShu/paper/blob/main/supplement.pdf.

  2. 2.

    https://wryjc.cnemc.cn/gkpt/mainZxjc/530000.

References

  1. Yao, X., Peng, L., Yang, L., Chi, T.: A fast space-saving algorithm for maximal co-location pattern mining. Expert Syst. Appl. 63(C), 310–323 (2016)

    Article  Google Scholar 

  2. Bao, X., Wang, L.: A clique-based approach for co-location pattern mining. Inf. Sci. 490, 244–264 (2019)

    Article  Google Scholar 

  3. Bing, Z., Zuqiang, M., Liangliang, S., Hongli, L.I., Computer, S.O.: Adaptive k neighbor algorithm based on local density and purity. J. Guangxi Acad. Sci. 33(1), 19–24 (2017)

    Google Scholar 

  4. García-Pérez, J., et al.: Residential proximity to industrial pollution sources and colorectal cancer risk: a multicase-control study (MCC-Spain). Environ. Int. 144, 106055 (2020)

    Google Scholar 

  5. Lei, L., Wang, L., Zeng, Y., Zeng, L.: Discovering high influence co-location patterns from spatial data sets. In: ICBK, pp. 137–144. IEEE (2019)

    Google Scholar 

  6. Ramis, R., Diggle, P., Cambra, K., López-Abente, G.: Prostate cancer and industrial pollution. Environ. Int. 37(3), 577–585 (2011)

    Article  Google Scholar 

  7. Terrell, K.A., St Julien, G.: Air pollution is linked to higher cancer rates among black or impoverished communities in Louisiana. Environ. Res. Lett. 17(1), 014033 (2022)

    Article  Google Scholar 

  8. Wang, L., Zhou, L., Lu, J., Yip, J.: An order-clique-based approach for mining maximal co-locations. Inf. Sci. 179(19), 3370–3382 (2009)

    Article  MATH  Google Scholar 

  9. Yang, P., Wang, L., Wang, X., Zhou, L.: SCPM-CR: a novel method for spatial co-location pattern mining with coupling relation consideration. IEEE Trans. Knowl. Data Eng. (2021). https://doi.org/10.1109/TKDE.2021.3060119

    Article  Google Scholar 

  10. Yoo, J.S., Shekhar, S.: A joinless approach for mining spatial colocation patterns. IEEE Trans. Knowl. Data Eng. 18(10), 1323–1337 (2006)

    Article  Google Scholar 

  11. Zambon, P., et al.: Sarcoma risk and dioxin emissions from incinerators and industrial plants: a population-based case-control study (Italy). Environ. Health 6(1), 19 (2007)

    Article  Google Scholar 

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China (61966036, 62062066), and Yunnan University Postgraduate Technological Innovation Project(2021Y175).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lizhen Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Shu, J., Wang, L., Yang, P., Tran, V. (2022). Mining the Potential Relationships Between Cancer Cases and Industrial Pollution Based on High-Influence Ordered-Pair Patterns. In: Chen, W., Yao, L., Cai, T., Pan, S., Shen, T., Li, X. (eds) Advanced Data Mining and Applications. ADMA 2022. Lecture Notes in Computer Science(), vol 13725. Springer, Cham. https://doi.org/10.1007/978-3-031-22064-7_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-22064-7_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-22063-0

  • Online ISBN: 978-3-031-22064-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics