Skip to main content

Towards Sentiment Analysis on German Literature

  • Conference paper
  • First Online:
KI 2017: Advances in Artificial Intelligence (KI 2017)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10505))

Abstract

Sentiment Analysis is a Natural Language Processing-task that is relevant in a number of contexts, including the analysis of literature. We report on ongoing research towards enabling, for the first time, sentence-level Sentiment Analysis in the domain of German novels. We create a labelled dataset from sentences extracted from German novels and, by adapting existing sentiment classifiers, reach promising F1-scores of 0.67 for binary polarity classification.

Und sie lebten glücklich bis ans Ende ihrer Tage.    (German fairy tales)

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Ternary labels are transformed into binary labels by omission of the neutral class (0).

  2. 2.

    We use the words “nicht” (not), “kein” (no), “ohne” (without), “nie” (never), “niemals” (never), “nirgends” (nowhere), “niemand” (nobody), and “keiner”(nobody) as negation markers.

  3. 3.

    https://code.google.com/archive/p/word2vec/.

  4. 4.

    https://github.com/yoonkim/CNN_sentence.

  5. 5.

    https://textgrid.de/digitale-bibliothek.

  6. 6.

    https://www.dmir.org/datasets/german_novel_dataset.

  7. 7.

    We also evaluated other selection schemes, but found that random selection yielded too many unemotional sentences, while \(r=e\) preferred very long ones.

  8. 8.

    Available on http://dmir.org/senticrowd/senticrowd. Login is possible with both “Microworkers-ID” and “Kampagnen-ID” set to “demo” in the upper form.

  9. 9.

    http://www.microworkers.com and https://www.crowdflower.com.

  10. 10.

    http://scikit-learn.org/0.17/modules/generated/sklearn.metrics.f1_score.

  11. 11.

    http://www.wildml.com/2015/11/understanding.

References

  1. Bergstra, J., Bengio, Y.: Random search for hyper-parameter optimization. JMLR 13, 281–305 (2012)

    MathSciNet  MATH  Google Scholar 

  2. Elsner, M.: Abstract representations of plot struture. LiLT 12 (2015)

    Google Scholar 

  3. Joachims, T.: Text categorization with support vector machines: learning with many relevant features. In: Nédellec, C., Rouveirol, C. (eds.) ECML 1998. LNCS, vol. 1398, pp. 137–142. Springer, Heidelberg (1998). doi:10.1007/BFb0026683

    Chapter  Google Scholar 

  4. Jockers, M.L.: A novel method for detecting plot, June 2014

    Google Scholar 

  5. Kalchbrenner, N., Grefenstette, E., Blunsom, P.: A convolutional neural network for modelling sentences. In: Proceedings of the 52nd ACL, pp. 212–217

    Google Scholar 

  6. Kim, Y.: Convolutional neural networks for sentence classification. In: Proceedings of the 2014 Conference on EMNLP, pp. 1746–1751 (2014)

    Google Scholar 

  7. Le, Q.V., Mikolov, T.: Distributed representations of sentences and documents. In: ICML, vol. 14, pp. 188–1196 (2014)

    Google Scholar 

  8. LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86, 2278–2324 (1998)

    Article  Google Scholar 

  9. Mesnil, G., Mikolov, T., Ranzato, M., Bengio, Y.: Ensemble of generative and discriminative techniques for sentiment analysis of movie reviews (2014). arXiv:1412.5335

  10. Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space (2013). arXiv:1301.3781

  11. Mikolov, T., Sutskever, I., Chen, K., Corrado, G., Dean, J.: Distributed representations of words and phrases and their compositionality. In: NIPS (2013)

    Google Scholar 

  12. Mohammad, S.: From once upon a time to happily ever after: tracking emotions in novels and fairy tales. In: LaTeCH 2011, pp. 105–114. Association for Computational Linguistics, Stroudsburg, PA (2011)

    Google Scholar 

  13. Mohammad, S.M., Kiritchenko, S., Zhu, X.: Nrc-canada: building the state-of-the-art in sentiment analysis of tweets. In: SemEval-2013 (2013). arXiv:1308.6242

  14. Mohammad, S.M., Turney, P.D.: Crowdsourcing a word-emotion association lexicon. Comput. Intell. 29(3), 436–465 (2013)

    Article  MathSciNet  Google Scholar 

  15. Nakov, P., Kozareva, Z., Ritter, A., Rosenthal, S., Stoyanov, V., Wilson, T.: SemEval-2013 task 2: Sentiment analysis in twitter (2013)

    Google Scholar 

  16. Plutchik, R.: A general psychoevolutionary theory of emotion. Theor. Emotion 1, 3–31 (1980)

    Article  Google Scholar 

  17. Radford, A., Jozefowicz, R., Sutskever, I.: Learning to generate reviews and discovering sentiment (2017). arXiv:1704.01444

  18. Řehůřek, R., Sojka, P.: Software framework for topic modelling with large corpora. In: Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks, pp. 45–50. ELRA, Valletta, Malta, May 2010

    Google Scholar 

  19. Schmid, H.: Probabilistic part-of-speech tagging using decision trees. In: New Methods in Language Processing, p. 154 (2013)

    Google Scholar 

  20. Socher, R., Perelygin, A., Wu, J.Y., Chuang, J., Manning, C.D., Ng, A.Y., Potts, C., et al.: Recursive deep models for semantic compositionality over a sentiment treebank. In: Proceedings of the EMNLP, vol. 1631, p. 1642 (2013)

    Google Scholar 

  21. Zehe, A.: Sentiment Analysis on German Novels. Master’s thesis (2017)

    Google Scholar 

  22. Zehe, A., Becker, M., Hettinger, L., Hotho, A., Reger, I., Jannidis, F.: Prediction of happy endings in german novels. In: Cellier, P., Charnois, T., Hotho, A., Matwin, S., Moens, M.F., Toussaint, Y. (eds.) DMNLP@PKDD/ECML, pp. 9–16, July 2016

    Google Scholar 

  23. Zhang, Y., Wallace, B.: A sensitivity analysis of (and practitioners’ guide to) convolutional neural networks for sentence classification (2015). arXiv:1510.03820

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Albin Zehe .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Zehe, A., Becker, M., Jannidis, F., Hotho, A. (2017). Towards Sentiment Analysis on German Literature. In: Kern-Isberner, G., Fürnkranz, J., Thimm, M. (eds) KI 2017: Advances in Artificial Intelligence. KI 2017. Lecture Notes in Computer Science(), vol 10505. Springer, Cham. https://doi.org/10.1007/978-3-319-67190-1_36

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-67190-1_36

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-67189-5

  • Online ISBN: 978-3-319-67190-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics