Abstract
Plagiarism and copyright infringement are major problems in academic and corporate environments. Importance of source code authorship attribution arises as it is the starting point of detection for plagiarism, copyright infringement and law suit prosecution etc. There have been many research regard to this topic. Majority of these researches are based on various algorithms which compute similarity amongst source code files. However, for this Paper we have proposed Deep Neural Network (DNN) based technique to be used for source code authorship attribution. Results proved that DNN based author identification brings promising results once compared the accuracy against previously published research.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Frantzeskou, G., Stamatatos, E., Gritzalis, E., Katsikas, S.: Source Code Author Identification Based on N-gram Author Profiles. In: Maglogiannis, I., Karpouzis, K., Bramer, M. (eds.) Artificial Intelligence Applications and Innovations. IFIP, vol. 204, pp. 508–515. Springer, Boston (2006)
Larochelle, H., Bengio, Y., Louradour, J., Lamblin, P.: Exploring Strategies for Training Deep Neural Networks. Journal of Machine Learning Research 10, 1–40 (2009)
Hinton, G.H.: Reducing the dimensionality of data with neural networks. Science, 504–507 (2006)
Lange, R., Spiros, M.: Using code metric histograms and genetic algorithms to perform author identification for software forensics. In: 9th Annual Conference on Genetic and Evolutionary Computation, London, pp. 2082–2089 (2007)
Burrows, S., Tahaghoghi, S.: Source Code Authorship Attribution using N-Grams. In: Wu, A. (ed.) Source Code Authorship Attribution using N-Grams, Melbourne, Melbou Australiarne, pp. 32–39 (2007)
Elenbogen, B., Seliya, N.: Detecting outsourced student programming assignments, pp. 50–57 (January 2008)
Shevertalov, M., Kothari, J., Stehle, E., Mancoridis, S.: On the Use of Discretized Source Code Metrics for Author Identification. In: Proceedings of the 2009 1st International Symposium on Search Based Software Engineering, Washington, DC, USA, pp. 69–78 (2009)
Hinton, G., Deng, L., Yu, D., Dahl, G., Mohamed, A.-R., Jaitly, N., Senior, A., Vanhoucke, V.: Deep Neural Networks for Acoustic Modeling in Speech Recognition. IEEE Signal Processing Magazine, 82–97 (November 2012)
Smolensky, P.: Information processing in dynamical systems: Foundations of harmony theory. In: Parallel Distributed Processing Explorations in the Microstructure of Cognition, pp. 194–281 (1986)
Hinton, G.: To recognize shapes, first learn to generate images. Progress in Brain Research 165(3), 535–547 (2007)
Bergstra, J., Bengio, Y.: Random Search for Hyper-Parameter Optimization. The Journal of Machine Learning Research, 281–305 (2012)
Lange, R., Mancoridis, S.: Using code metric histograms and genetic algorithms to perform author identification for software forensics. In: Proceedings of the 9th Annual Conference on Genetic and Evolutionary Computation, GECCO 2007 (2007)
Bandara, U., Wijayarathna, G.: A machine learning based tool for source code plagiarism detection. International Journal of Machine Learning and Computing 1(4), 337–343 (2011)
Bandara, U., Wijayarathna, G.: Source code author identification with unsupervised feature learning. Pattern Recognition Letters 34(3), 330–334 (2013)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bandara, U., Wijayarathna, G. (2013). Deep Neural Networks for Source Code Author Identification. In: Lee, M., Hirose, A., Hou, ZG., Kil, R.M. (eds) Neural Information Processing. ICONIP 2013. Lecture Notes in Computer Science, vol 8227. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-42042-9_46
Download citation
DOI: https://doi.org/10.1007/978-3-642-42042-9_46
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-42041-2
Online ISBN: 978-3-642-42042-9
eBook Packages: Computer ScienceComputer Science (R0)