Skip to main content

Counterfactual Retrieval for Augmentation and Decisions

  • Conference paper
  • First Online:
Machine Learning for Cyber Security (ML4CS 2020)

Part of the book series: Lecture Notes in Computer Science ((LNSC,volume 12487))

Included in the following conference series:

  • 1172 Accesses

Abstract

The optimal situation to make a decision is to have all variables in grasp. This however, almost never occurs. There has been research on counterfactuals as a way to provide more explainable systems and models. In furtherance of this research, this paper proposes CORFAD, Counterfactual Retrieval for Augmentation and Decisions. We explore user generated counterfactual tweets and by aggregating counterfactual statements that relate to pre-determined keywords, CORFAD simplifies data analysis by suggesting variables towards which future actions might have the greater or lesser effects towards a defined goal. This has the dual purpose of making synthetic counterfactual data generation more focused and less likely to generate non-useful explanations, while also able to stand alone to assist decision makers. This paper uses as test case, Counterfactual Statements connected with the Tesla Model 3 to explore insights that can guide decision-making in situations where multiple variables are possible and exist.

This work was supported by the National Natural Science Foundation of China [71901150] and China Postdoctoral Science Foundation Grant [2019M663083], Guangdong Province Postgraduate Education Innovation Plan (2019SFKC46).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 99.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 129.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    A fair idea of 1 GB of text data is reading 1000 books, each of 600 pages, and containing 300 words per page.

  2. 2.

    https://github.com/Kc2fresh/Counterfactuals.

  3. 3.

    https://colab.research.google.com.

  4. 4.

    https://huggingface.co.

  5. 5.

    https://pytorch.org.

  6. 6.

    This hinges on improvements made on the underlying model, which achieves an F1 score of 86.9% on the SemEval Post-Evaluation Leader-board.

References

  1. Nutt, P.C.: Models for decision making in organizations and some contextual variables which stipulate optimal use. Acad. Manag. Rev. 1(2), 84–98 (1976). https://doi.org/10.5465/1976.4408670

    Article  Google Scholar 

  2. Bottou, L., et al.: Counterfactual reasoning and learning systems: the example of computational advertising. J. Mach. Learn. Res. 14(1), 3207–3260 (2013)

    MathSciNet  MATH  Google Scholar 

  3. Pearl, J.: Causal and counterfactual inference. In: The Handbook of Rationality, pp. 1–41 (2018)

    Google Scholar 

  4. Son, Y., et al.: Recognizing counterfactual thinking in social media texts. In: ACL (2017). https://doi.org/10.18653/v1/P17-2103

  5. Whitty, M.T.: Liar, liar! An examination of how open, supportive and honest people are in chat rooms. Comput. Hum. Behav. 18(4), 343–352 (2002)

    Article  Google Scholar 

  6. Hendricks, L. A., Hu, R., Darrell, T., Akata, Z.: Generating counterfactual explanations with natural language. arXiv preprint arXiv:1806.09809. (2018)

  7. Ramaravind, K.M., Amit, S., Chenhao T.: Explaining machine learning classifiers through diverse counterfactual explanations. In Conference on Fairness, Accountability, and Transparency, 27–30 January (2020). https://doi.org/10.1145/3351095.3372850

  8. Lewis, D.: Counterfactuals. John Wiley & Sons, Hoboken (2013)

    MATH  Google Scholar 

  9. Brill, E.: A simple rule-based part of speech tagger. In: Proceedings of the Third Conference on Applied Natural Language Processing, pp. 152–155. Association for Computational Linguistics (1992)

    Google Scholar 

  10. Manning, C.D., Manning, C.D., Schütze, H.: Foundations of Statistical Natural Language Processing. MIT Press, Cambridge (1999)

    MATH  Google Scholar 

  11. Vaswani, A., et al.: Attention is all you need. In: Advances in Neural Information Processing Systems, pp. 5998–6008 (2017)

    Google Scholar 

  12. Raffel, C., et al.: Exploring the limits of transfer learning with a unified text-to-text transformer. arXiv preprint arXiv:1910.10683 (2019)

  13. Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding arXiv preprint: 1810.04805. (2018). https://doi.org/10.18653/v1/N19-1423

  14. Peters, M.E., Ruder, S., Smith, N.A.: To Tune or not to tune? Adapting pretrained representations to diverse tasks. In: ACL (2019). https://doi.org/10.18653/v1/W19-4302

  15. Yang, X., Obadinma, S., Zhao, H., Zhang, Q., Matwin, S., Zhu, X.: SemEval-2020 Task 5: counterfactual recognition. In: Proceedings of the 14th International Workshop on Semantic Evaluation (SemEval-2020) (2020)

    Google Scholar 

  16. Russell, M.A.: Mining the Social Web: Data Mining Facebook, Twitter, LinkedIn, Google+, GitHub, and More. O’Reilly Media, Inc. (2013). https://doi.org/10.1080/15536548.2015.1046287

  17. Nwaike, K., Jiao, L. : Counterfactual detection meets transfer learning. In: Modelling Causal Reasoning in Language: Detecting Counterfactuals at SemEval-2020 Task [5] (2020, accepted)

    Google Scholar 

  18. Agerri, R., Artola, X., Beloki, Z., Rigau, G., Soroa, A.: Big data for natural language processing: a streaming approach. Knowl.-Based Syst. 79, 36–42 (2015)

    Article  Google Scholar 

  19. Liu, Y., et al.: RoBERTa: a robustly optimized BERT pretraining approach. arXiv preprint 1907.11692 (2019)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nwaike Kelechi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Kelechi, N., Geng, S. (2020). Counterfactual Retrieval for Augmentation and Decisions. In: Chen, X., Yan, H., Yan, Q., Zhang, X. (eds) Machine Learning for Cyber Security. ML4CS 2020. Lecture Notes in Computer Science(), vol 12487. Springer, Cham. https://doi.org/10.1007/978-3-030-62460-6_30

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-62460-6_30

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-62459-0

  • Online ISBN: 978-3-030-62460-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics