Abstract
This paper presents the work of building a knowledge base for the domain of economic mobility for older workers. To extract high-quality entities and relations that are important to the specific domain, domain specificity scores for entities and relations are designed and applied. To assist human-in-the-loop ontology construction, a novel topic modeling method, named “description guided topic modeling”, is developed. It clusters domain entities based on their embedding and organizes those clusters according to descriptions of potential topics important to the domain. To demonstrate feasibility, these methods are applied to a collection of knowledge sources related to economic mobility for older workers. These methods are further tested through a case study on one specific barrier for economic mobility, i.e., limited broadband access for older workers, to show the potential of these methods.
Funding for this research was partially provided by CWI Labs, a wholly-owned subsidiary of the Center for Workforce Inclusion, a national nonprofit organization.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Policy brief: the impact of covid-19 on older persons (2020). unsdg.un.org/sites/default/files/2020-05/Policy-Brief-The-Impact-of-COVID-19-on-Older-Persons.pdf
Akinola, S.: Covid-19 has worsened ageism. here’s how to help older adults thrive. (2020). www.weforum.org/agenda/2020/10/covid-19-has-worsened-ageism-here-s-how-to-help-older-adults-thrive//
Alexopoulos, P.: Building a large knowledge graph for the recruitment domain with text kernel’s ontology. www.textkernel.com/newsroom/building-a-large-knowledge-graph-for-the-recruitment-domain-with-textkernels-ontology/
Bojanowski, P., Grave, E., Joulin, A., Mikolov, T.: Enriching word vectors with subword information. arXiv preprint arXiv:1607.04606 (2016)
Deane, P.: A nonparametric method for extraction of candidate phrasal terms. In: Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL’05), pp. 605–613. Association for Computational Linguistics, Ann Arbor, Michigan. 1219840.1219915 (2005). www.aclweb.org/anthology/P05-1075
Doan, A.: Human-in-the-loop data analysis: a personal perspective, pp. 1–6, 3209900.3209913 (2018)
Ehrlinger, L., Woss, W.: Towards a definition of knowledge graphs. In: SEMANTiCS (2016)
Goger, A.: For millions of low-income seniors, coronavirus is a food-security issue (2020). www.brookings.edu/blog/the-avenue/2020/03/16/for-millions-of-low-income-seniors-coronavirus-is-a-food-security-issue/
Gruber, T.: Toward principles for the design of ontologies used for knowledge sharing. Int. J. Hum. Comput. Stud. 43, 907–928 (1995)
Hall, D., Jurafsky, D., Manning, C.D.: Studying the history of ideas using topic models. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing. EMNLP ’08, Association for Computational Linguistics, pp. 363–371. USA (2008)
Kamdar, M.R., Hamamsy, T., Shelton, S., Vala, A., Eftimov, T., Zou, J., Tamang, S.: A knowledge graph-based approach for exploring the u.s. opioid epidemic (2019). arxiv.org/abs/1905.11513
Kejriwal, M., Szekely, P.: Knowledge graphs for social good: an entity-centric search engine for the human trafficking domain. IEEE Tran. Big Data, 1 (2017). TBDATA.2017.2763164
Kejriwal, M., Shao, R., Szekely, P.: Expert-guided entity extraction using expressive rules. pp. 1353–1356(072019). 3331184.3331392
Kim, S.N., Baldwin, T., Kan, M.Y.: Extracting domain-specific words - a statistical approach. In: Proceedings of the Australasian Language Technology Association Workshop 2009, pp. 94–98. Sydney, Australia (2009). www.aclweb.org/anthology/U09-1013
Larocca Neto, J., Alexandre, N., Santos, D., Kaestner, C., Freitas, A.: Document clustering and text summarization. In: Proceedings of 4th International Conference Practical Applications of Knowledge Discovery and Data Mining (PADD-2000), pp. 41–55. London (2000)
Meng, Y., Huang, J., Wang, G., Wang, Z., Zhang, C., Zhang, Y., Han, J.: Discriminative topic mining via category-name guided text embedding. In: Proceedings of The Web Conference 2020. WWW’20, Association for Computing Machinery, pp. 2121–2132. New York, NY, USA. 3366423.3380278 (2020)
Mikolov, T., Chen, K., Corrado, G.S., Dean, J.: Efficient estimation of word representations in vector space (2013). arXiv:1301.3781
Mintz, M., Bills, S., Snow, R., Jurafsky, D.: Distant supervision for relation extraction without labeled data. In: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP. Association for Computational Linguistics, pp. 1003–1011. Suntec, Singapore (2009). www.aclweb.org/anthology/P09-1113
Noy, N., Mcguinness, D.: Ontology development 101: a guide to creating your first ontology. Knowl. Syst. Laboratory 32 (2001)
Paulheim, H.: Knowledge graph refinement: a survey of approaches and evaluation methods. Semantic Web 8, 489–508 (2016)
Pennington, J., Socher, R., Manning, C.D.: Glove: Global vectors for word representation. In: Empirical Methods in Natural Language Processing (EMNLP), pp. 1532–1543 (2014). www.aclweb.org/anthology/D14-1162
Sahu, S.K., Christopoulou, F., Miwa, M., Ananiadou, S.: Inter-sentence relation extraction with document-level graph convolutional neural network. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, pp. 4309–4316. Florence, Italy, P19–1423 (2019). www.aclweb.org/anthology/P19-1423
Shang, J., Liu, J., Jiang, M., Ren, X., Voss, C.R., Han, J.: Automated phrase mining from massive text corpora. CoRR arXiv:1702.04457 (2017). arXiv:1702.04457
CWI Labs, Giving Tech Labs, and X4Impact. Limited broadband access as a barrier to economic mobility of older americans knowledge extraction and data analysis (2021). giving.tech/wp-content/uploads/2021/01/CWI-Labs-Giving-Tech-Labs-X4Impact-White-Paper-Limited-Broadband-Access-as-a-Barrier-to-Economic-Mobility-of-Older-Americans.pdf
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Li, Y., Zakhozhyi, V., Fu, Y., He-Yueya, J., Pardeshi, V., Salazar, L.J. (2022). Building Knowledge Base for the Domain of Economic Mobility of Older Workers. In: Nicosia, G., et al. Machine Learning, Optimization, and Data Science. LOD 2021. Lecture Notes in Computer Science(), vol 13164. Springer, Cham. https://doi.org/10.1007/978-3-030-95470-3_19
Download citation
DOI: https://doi.org/10.1007/978-3-030-95470-3_19
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-95469-7
Online ISBN: 978-3-030-95470-3
eBook Packages: Computer ScienceComputer Science (R0)