skip to main content
10.1145/3209281.3209367acmotherconferencesArticle/Chapter ViewAbstractPublication Pagesdg-oConference Proceedingsconference-collections
research-article

Text mining as a transparency enabler to support decision making in a people management process

Published: 30 May 2018 Publication History

Abstract

This paper discusses a case study that was performed using mining techniques to analyze data pertinent to a people management process of a Federal University. This process consists in observing documents containing data concerning the organizational environment, the duties required by the position, the course program carried out by the employee, and whether they have direct or indirect correlation. Currently, this correlation evaluation is performed subjectively and there are no instruments that can indicate the degree of similarity between the information. We use text mining techniques to automatically identify correlation through textual representation approaches and syntactic and semantic modeling, which retrieve terms and dimension their respective meanings. To obtain the degree of similarity between the respective documents, the measure of the cosines similarity was used. The results showed that the documents evaluated as correlated by the domain expert presented a degree of similarity consistent with the automatic evaluation. For the uncorrelated cases, it was perceived that the degree of high similarity was influenced by the comprehensiveness of the organizational environment common to all documents. After investigation and identification of the appropriate environment specification, the grades obtained represented the evaluation correctly. The proposed approach contributes to the speed of process judgment, as well as to promote formulations of criticism about the content of political qualifications. In addition, it enhanced processes and information transparency by tracking and publicizing all steps. Lastly, we present a simulation for a course recommendation task, considering position profiles and organizational environment.

References

[1]
Marco Baroni, Georgiana Dinu, and Germán Kruszewski. 2014. Don't count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Vol. 1. 238--247.
[2]
Marizaura Reis de Souza Organizadora Camões, Maria Júlia Organizadora Pantoja, and Sandro Trescastro Organizador Bergue. 2010. GestÃo de pessoas: bases teóricas e experiências no setor público. (2010).
[3]
AC Costa and CG da Silva Júnior. 2010. Similaridade entre Documentos de EspecificaçÃo de Requisitos de Software Utilizando o Modelo Vetorial de RecuperaçÃo de InformaçÃo. V Connepi-2010 1 (2010). http://connepi.ifal.edu.br/ocs/index.php/connepi/CONNEPI2010/paper/view/1359
[4]
Presidência da República. 2005. Lei no 11.091, de 12 de Janeiro de 2005. Dispõe sobre a estruturaçÃo do Plano de Carreira dos Cargos Técnico-Administrativos em EducaçÃo, no âmbito das Instituições Federais de Ensino vinculadas ao Ministério da EducaçÃo, e dá outras providências. DOU (2005), 1--10. Retrieved January 09, 2018 from http://pesquisa.in.gov.br/imprensa/jsp/visualiza/index.jsp?data=13/01/2005&jornal=1&pagina=1&totalArquivos=72
[5]
Ronen Feldman and James Sanger. 2007. The text mining handbook: advanced approaches in analyzing unstructured data. Cambridge university press.
[6]
Sonali Vijay Gaikwad, Archana Chaugule, and Pramod Patil. 2014. Text mining methods and techniques. International Journal of Computer Applications 85, 17 (2014).
[7]
SALTON Gerard and J McGILL Michael. 1983. Introduction to modern information retrieval. (1983).
[8]
Jiawei Han, Jian Pei, and Micheline Kamber. 2011. Data mining: concepts and techniques. Elsevier.
[9]
Zellig Sabbettai Harris. 1968. Mathematical structures of language. (1968).
[10]
Nathan Hartmann, Erick Fonseca, Christopher Shulby, Marcos Treviso, Jessica Rodrigues, and Sandra Aluisio. 2017. Portuguese Word Embeddings: Evaluating on Word Analogies and Natural Language Tasks. arXiv preprint arXiv:1708.06025 (2017).
[11]
Andreas Hotho, Andreas Nürnberger, and Gerhard Paaß. 2005. A brief survey of text mining. In Ldv Forum, Vol. 20. Citeseer, 19--62.
[12]
Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems. 3111--3119.
[13]
Solange O Rezende, Ricardo M Marcacini, and Maria F Moura. 2011. O uso da mineraçÃo de textos para extraçÃo e organizaçÃo nÃo supervisionada de conhecimento. Revista de Sistemas de InformaçÃo da FSMA 7 (2011), 7--21.
[14]
Gerard Salton and Christopher Buckley. 1988. Term-weighting approaches in automatic text retrieval. Information processing & management 24, 5 (1988), 513--523.
[15]
Gerard Salton, Anita Wong, and Chung-Shu Yang. 1975. A vector space model for automatic indexing. Commun. ACM 18, 11 (1975), 613--620.
[16]
Breno Santana Santos. 2015. Análise comparativa de algoritmos de mineraçÃo de texto aplicados a históricos de contas públicas. (2015).
[17]
Elcelina Silva. 2017. AplicaçÃo de text mining na deteçÃo de evidência de fraude em documentos de texto. In Information Systems and Technologies (CISTI), 2017 12th Iberian Conference on. IEEE, 1--6.
[18]
Pascal Soucy and Guy W Mineau. 2005. Beyond TFIDF weighting for text categorization in the vector space model. In IJCAI, Vol. 5. 1130--1135.
[19]
JG de Souza. 2011. Uma aplicaçÃo de mineraçÃo de texto para descoberta de características psicológicas de indivíduos. UFS, Itabaiana (2011).
[20]
José Manuel Ferrández Vicente, José Ramón Álvarez-Sánchez, Félix De la Paz López, Javier Toledo Moreo, and Hojjat Adeli. 2017. Biomedical Applications Based on Natural and Artificial Computing: International Work-Conference on the Interplay Between Natural and Artificial Computation, IWINAC 2017, Corunna, Spain, June 19-23, 2017, Proceedings. Vol. 10338. Springer.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
dg.o '18: Proceedings of the 19th Annual International Conference on Digital Government Research: Governance in the Data Age
May 2018
889 pages
ISBN:9781450365260
DOI:10.1145/3209281
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 May 2018

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. governance processes
  2. similarity measure
  3. text minning

Qualifiers

  • Research-article

Funding Sources

  • UNIRIO

Conference

dg.o '18

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 109
    Total Downloads
  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)0
Reflects downloads up to 07 Mar 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media