JEAPC: A Joint Extraction Model of Action Sequence from Chinese Instructions for Home Service Robot

Wang, Bin; Wang, Haoyu; Li, Xianshan; Zhao, Fenda

doi:10.1007/978-981-97-7707-5_44

Bin Wang¹⁵,
Haoyu Wang¹²,
Xianshan Li ORCID: orcid.org/0000-0003-0101-3973^12,14 &
…
Fenda Zhao ORCID: orcid.org/0000-0001-9085-9633^12,13,14

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14883))

Included in the following conference series:

International Conference on Web Information Systems and Applications

511 Accesses

Abstract

The language interaction between humans and robots is one of the critical issues in the field of home service robots. In particular, irrelevant information in feature vectors interferes with the extraction task during Chinese instruction parsing. Moreover, the relations between feature vectors of different time steps affect the accuracy of action sequence extraction. In this paper, overlapping action entities in Chinese instructions are labeled through span-based mode, and a Joint Extraction Model of Action Sequences with Partition Coding(JEAPC) is proposed for Chinese instructions. The JEAPC is divided into four modules: BERT, partition encoder, and two decoders. BERT is utilized to obtain the feature vector of each Chinese character in the instructions. The partition encoder is composed of an entity gate, and an action gate, in which the features in the vector are classified, and the irrelevant features are filtered through multi-dimensional vector operations. Furthermore, adversarial training is employed to improve the robustness of JEAPC. Extensive experiments are conducted on a self-built Chinese instructions dataset(FCI) and three entity and relation extraction datasets (CoNLL04, ADE, and SciERC). The experimental results show that the JEAPC can accurately generate action sequences from Chinese instructions and obtain optimal results compared to several competitive approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 119.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Martins, P.H., Custódio, L., Ventura, R.: A deep learning approach for understanding natural language commands for mobile service robots. arXiv preprint arXiv:1807.03053 (2018)
Ishikawa, S., Sugiura, K.: Target-dependent UNITER: a transformer-based multimodal language comprehension model for domestic service robots. IEEE Robot. Autom. Lett. 6(4), 8401–8408 (2021)
Article Google Scholar
Chen, H., Tan, H., Kuntz, A., Bansal, M., Alterovitz, R.: Enabling robots to understand incomplete natural language instructions using commonsense reasoning. In: 2020 IEEE International Conference on Robotics and Automation (ICRA), Paris, France, pp. 1963–1969. IEEE (2020)
Google Scholar
Li, X., et al.: Entity-relation extraction as multi-turn question answering. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy, pp. 1340–1350. Association for Computational Linguistics (2019)
Google Scholar
Eberts, M., Ulges, A.: Span-based joint entity and relation extraction with transformer pre-training. In: Proceedings of the 24th European Conference on Artificial Intelligence (ECAI) - Including 10th Conference on Prestigious Applications of Artificial Intelligence (PAIS), vol. 325, pp. 2006–2013. Santiago de Compostela, Spain (2020)
Google Scholar
Zhao, S., Cai, Z., Chen, H., Wang, Y., Liu, F., Liu, A.: Adversarial training based lattice LSTM for Chinese clinical named entity recognition. J. Biomed. Inform. 99, 103290 (2019)
Google Scholar
Misra, D.K., Sung, J., Lee, K., Saxena, A.: Tell me dave: context-sensitive grounding of natural language to manipulation instructions. Int. J. Robot. Res. 35(1–3), 281–300 (2016)
Article Google Scholar
Zhang, S., Jiang, J., He, Z., Zhao, X., Fang, J.: A novel slot-gated model combined with a key verb context feature for task request understanding by service robots. IEEE Access 7, 105937–105947 (2019)
Article Google Scholar
Mensio, M., Bastianelli, E., Tiddi, I., Rizzo, G.: Mitigating bias in deep nets with knowledge bases: the case of natural language understanding for robots. In: Proceedings of the AAAI 2020 Spring Symposium on Combining Machine Learning and Knowledge Engineering in Practice (AAAI-MAKE), Palo Alto, CA, USA, vol. 2600, p. 20 (2020)
Google Scholar
Sharma, S., Gupta, J., Tuli, S., Paul, R.: Goalnet: inferring conjunctive goal predicates from human plan demonstrations for robot instruction following. arXiv preprint arXiv:2205.07081 (2022)
Zhao, S., Hu, M., Cai, Z., Chen, H., Liu, F.: Dynamic modeling cross-and self-lattice attention network for Chinese NER. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 16, pp. 14515–14523. AAAI Press (2021)
Google Scholar
Wei, Z., Su, J., Wang, Y., Tian, Y., Chang, Y.: A novel cascade binary tagging framework for relational triple extraction. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 1476–1488. Association for Computational Linguistics, Online (2020)
Google Scholar
Wang, Y., Yu, B., Zhang, Y., Liu, T., Zhu, H., Sun, L.: Tplinker: single-stage joint extraction of entities and relations through token pair linking. In: Proceedings of the 28th International Conference on Computational Linguistics (COLING), Barcelona, Spain (Online), pp. 1572–1582. International Committee on Computational Linguistics (2020)
Google Scholar
Yan, Z., Zhang, C., Fu, J., Zhang, Q., Wei, Z.: A partition filter network for joint entity and relation extraction. In: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP), Online and Punta Cana, Dominican Republic, pp. 185–197. Association for Computational Linguistics (2021)
Google Scholar
Devlin, J., Chang, M., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2019), Volume 1 (Long and Short Papers), Minneapolis, MN, USA, pp. 4171–4186. Association for Computational Linguistics (2019)
Google Scholar
Joshi, M., Chen, D., Liu, Y., Weld, D.S., Zettlemoyer, L., Levy, O.: Spanbert: improving pre-training by representing and predicting spans. Trans. Assoc. Comput. Linguist. 8, 64–77 (2020)
Article Google Scholar
Li, Y., Wang, C., Lin, Y., Lin, Y., Chang, L.: Span-based relational graph transformer network for aspect-opinion pair extraction. Knowl. Inf. Syst. 64(5), 1305–1322 (2022)
Article Google Scholar
Li, Z., Song, M., Zhu, Y., Zhang, L.: Chinese nested named entity recognition based on boundary prompt. In: Yuan, L., Yang, S., Li, R., Kanoulas, E., Zhao, X. (eds.) WISA 2023. LNCS, vol. 14094, pp. 331–343. Springer, Singapore (2023). https://doi.org/10.1007/978-981-99-6222-8_28
Nong, W., Zhang, T., Yang, S., Hu, N., He, X.: HfGCN: hierarchical fused GCN for joint entity and relation extraction. In: 2021 IEEE International Conference on Big Knowledge (ICBK), Auckland, New Zealand, pp. 307–314. IEEE (2021)
Google Scholar
Tran, T., Kavuluru, R.: Neural metric learning for fast end-to-end relation extraction. arXiv preprint arXiv:1905.07458 (2019)
Luan, Y., Wadden, D., He, L., Shah, A., Ostendorf, M., Hajishirzi, H.: A general framework for information extraction using dynamic span graphs. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Minneapolis, Minnesota, pp. 3036–3046. Association for Computational Linguistics (2019)
Google Scholar
Miyato, T., Dai, A.M., Goodfellow, I.J.: Adversarial training methods for semi-supervised text classification. In: 5th International Conference on Learning Representations (ICLR), Toulon, France. OpenReview.net (2017)
Google Scholar
Roth, D., Yih, W.: A linear programming formulation for global inference in natural language tasks. In: Proceedings of the Eighth Conference on Computational Natural Language Learning (CoNLL-2004) at HLT-NAACL 2004, Boston, Massachusetts, USA, pp. 1–8. Association for Computational Linguistics (2004)
Google Scholar
Gurulingappa, H., Rajput, A.M., Roberts, A., Fluck, J., Hofmann-Apitius, M., Toldo, L.: Development of a benchmark corpus to support the automatic extraction of drug-related adverse effects from medical case reports. J. Biomed. Inform. 45(5), 885–892 (2012)
Article Google Scholar
Luan, Y., He, L., Ostendorf, M., Hajishirzi, H.: Multi-task identification of entities, relations, and coreference for scientific knowledge graph construction. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP), Brussels, Belgium, pp. 3219–3232. Association for Computational Linguistics (2018)
Google Scholar

Download references

Acknowledgement

This work was supported in part by the Natural Science Foundation of Xinjiang Uygur Autonomous Region, China Grant No. 2022D01A59, National Natural Science Foundation of China under Grand No. U20A20167, Key Research Foundation of Integration of Industry and Education and the Development of New Business Studies Research Center, Innovation Capability Improvement Plan Project of Hebei Province under Grand No. 22567637H, Hebei Province Central Leading Local Science and Technology Development Project, 246Z1817G.

Author information

Authors and Affiliations

School of Information Science and Engineering, Yanshan University, Qinhuangdao, 066004, China
Haoyu Wang, Xianshan Li & Fenda Zhao
School of Information Science and Engineering, Xinjiang University of Science and Technology, Korla, 841000, China
Fenda Zhao
Key Laboratory for Software Engineering of Hebei Province, Yanshan University, Qinhuangdao, 066004, China
Xianshan Li & Fenda Zhao
Hebei Telecom Co., Ltd., Shijiazhuang, 050035, China
Bin Wang

Authors

Bin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Haoyu Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xianshan Li
View author publications
You can also search for this author in PubMed Google Scholar
Fenda Zhao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fenda Zhao .

Editor information

Editors and Affiliations

East China Normal University, Shanghai, China
Cheqing Jin
Guangzhou University, Guangzhou, China
Shiyu Yang
Northwestern Polytechnical University, Xi'an Shaanxi, China
Xuequn Shang
Tongji University, Shanghai, China
Haofen Wang
Tsinghua University, Beijing, China
Yong Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, B., Wang, H., Li, X., Zhao, F. (2024). JEAPC: A Joint Extraction Model of Action Sequence from Chinese Instructions for Home Service Robot. In: Jin, C., Yang, S., Shang, X., Wang, H., Zhang, Y. (eds) Web Information Systems and Applications. WISA 2024. Lecture Notes in Computer Science, vol 14883. Springer, Singapore. https://doi.org/10.1007/978-981-97-7707-5_44

Download citation

DOI: https://doi.org/10.1007/978-981-97-7707-5_44
Published: 11 September 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-7706-8
Online ISBN: 978-981-97-7707-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

JEAPC: A Joint Extraction Model of Action Sequence from Chinese Instructions for Home Service Robot