Skip to main content

JEAPC: A Joint Extraction Model of Action Sequence from Chinese Instructions for Home Service Robot

  • Conference paper
  • First Online:
Web Information Systems and Applications (WISA 2024)

Abstract

The language interaction between humans and robots is one of the critical issues in the field of home service robots. In particular, irrelevant information in feature vectors interferes with the extraction task during Chinese instruction parsing. Moreover, the relations between feature vectors of different time steps affect the accuracy of action sequence extraction. In this paper, overlapping action entities in Chinese instructions are labeled through span-based mode, and a Joint Extraction Model of Action Sequences with Partition Coding(JEAPC) is proposed for Chinese instructions. The JEAPC is divided into four modules: BERT, partition encoder, and two decoders. BERT is utilized to obtain the feature vector of each Chinese character in the instructions. The partition encoder is composed of an entity gate, and an action gate, in which the features in the vector are classified, and the irrelevant features are filtered through multi-dimensional vector operations. Furthermore, adversarial training is employed to improve the robustness of JEAPC. Extensive experiments are conducted on a self-built Chinese instructions dataset(FCI) and three entity and relation extraction datasets (CoNLL04, ADE, and SciERC). The experimental results show that the JEAPC can accurately generate action sequences from Chinese instructions and obtain optimal results compared to several competitive approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Martins, P.H., Custódio, L., Ventura, R.: A deep learning approach for understanding natural language commands for mobile service robots. arXiv preprint arXiv:1807.03053 (2018)

  2. Ishikawa, S., Sugiura, K.: Target-dependent UNITER: a transformer-based multimodal language comprehension model for domestic service robots. IEEE Robot. Autom. Lett. 6(4), 8401–8408 (2021)

    Article  Google Scholar 

  3. Chen, H., Tan, H., Kuntz, A., Bansal, M., Alterovitz, R.: Enabling robots to understand incomplete natural language instructions using commonsense reasoning. In: 2020 IEEE International Conference on Robotics and Automation (ICRA), Paris, France, pp. 1963–1969. IEEE (2020)

    Google Scholar 

  4. Li, X., et al.: Entity-relation extraction as multi-turn question answering. In: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy, pp. 1340–1350. Association for Computational Linguistics (2019)

    Google Scholar 

  5. Eberts, M., Ulges, A.: Span-based joint entity and relation extraction with transformer pre-training. In: Proceedings of the 24th European Conference on Artificial Intelligence (ECAI) - Including 10th Conference on Prestigious Applications of Artificial Intelligence (PAIS), vol. 325, pp. 2006–2013. Santiago de Compostela, Spain (2020)

    Google Scholar 

  6. Zhao, S., Cai, Z., Chen, H., Wang, Y., Liu, F., Liu, A.: Adversarial training based lattice LSTM for Chinese clinical named entity recognition. J. Biomed. Inform. 99, 103290 (2019)

    Google Scholar 

  7. Misra, D.K., Sung, J., Lee, K., Saxena, A.: Tell me dave: context-sensitive grounding of natural language to manipulation instructions. Int. J. Robot. Res. 35(1–3), 281–300 (2016)

    Article  Google Scholar 

  8. Zhang, S., Jiang, J., He, Z., Zhao, X., Fang, J.: A novel slot-gated model combined with a key verb context feature for task request understanding by service robots. IEEE Access 7, 105937–105947 (2019)

    Article  Google Scholar 

  9. Mensio, M., Bastianelli, E., Tiddi, I., Rizzo, G.: Mitigating bias in deep nets with knowledge bases: the case of natural language understanding for robots. In: Proceedings of the AAAI 2020 Spring Symposium on Combining Machine Learning and Knowledge Engineering in Practice (AAAI-MAKE), Palo Alto, CA, USA, vol. 2600, p. 20 (2020)

    Google Scholar 

  10. Sharma, S., Gupta, J., Tuli, S., Paul, R.: Goalnet: inferring conjunctive goal predicates from human plan demonstrations for robot instruction following. arXiv preprint arXiv:2205.07081 (2022)

  11. Zhao, S., Hu, M., Cai, Z., Chen, H., Liu, F.: Dynamic modeling cross-and self-lattice attention network for Chinese NER. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 16, pp. 14515–14523. AAAI Press (2021)

    Google Scholar 

  12. Wei, Z., Su, J., Wang, Y., Tian, Y., Chang, Y.: A novel cascade binary tagging framework for relational triple extraction. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 1476–1488. Association for Computational Linguistics, Online (2020)

    Google Scholar 

  13. Wang, Y., Yu, B., Zhang, Y., Liu, T., Zhu, H., Sun, L.: Tplinker: single-stage joint extraction of entities and relations through token pair linking. In: Proceedings of the 28th International Conference on Computational Linguistics (COLING), Barcelona, Spain (Online), pp. 1572–1582. International Committee on Computational Linguistics (2020)

    Google Scholar 

  14. Yan, Z., Zhang, C., Fu, J., Zhang, Q., Wei, Z.: A partition filter network for joint entity and relation extraction. In: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP), Online and Punta Cana, Dominican Republic, pp. 185–197. Association for Computational Linguistics (2021)

    Google Scholar 

  15. Devlin, J., Chang, M., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT 2019), Volume 1 (Long and Short Papers), Minneapolis, MN, USA, pp. 4171–4186. Association for Computational Linguistics (2019)

    Google Scholar 

  16. Joshi, M., Chen, D., Liu, Y., Weld, D.S., Zettlemoyer, L., Levy, O.: Spanbert: improving pre-training by representing and predicting spans. Trans. Assoc. Comput. Linguist. 8, 64–77 (2020)

    Article  Google Scholar 

  17. Li, Y., Wang, C., Lin, Y., Lin, Y., Chang, L.: Span-based relational graph transformer network for aspect-opinion pair extraction. Knowl. Inf. Syst. 64(5), 1305–1322 (2022)

    Article  Google Scholar 

  18. Li, Z., Song, M., Zhu, Y., Zhang, L.: Chinese nested named entity recognition based on boundary prompt. In: Yuan, L., Yang, S., Li, R., Kanoulas, E., Zhao, X. (eds.) WISA 2023. LNCS, vol. 14094, pp. 331–343. Springer, Singapore (2023). https://doi.org/10.1007/978-981-99-6222-8_28

  19. Nong, W., Zhang, T., Yang, S., Hu, N., He, X.: HfGCN: hierarchical fused GCN for joint entity and relation extraction. In: 2021 IEEE International Conference on Big Knowledge (ICBK), Auckland, New Zealand, pp. 307–314. IEEE (2021)

    Google Scholar 

  20. Tran, T., Kavuluru, R.: Neural metric learning for fast end-to-end relation extraction. arXiv preprint arXiv:1905.07458 (2019)

  21. Luan, Y., Wadden, D., He, L., Shah, A., Ostendorf, M., Hajishirzi, H.: A general framework for information extraction using dynamic span graphs. In: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Minneapolis, Minnesota, pp. 3036–3046. Association for Computational Linguistics (2019)

    Google Scholar 

  22. Miyato, T., Dai, A.M., Goodfellow, I.J.: Adversarial training methods for semi-supervised text classification. In: 5th International Conference on Learning Representations (ICLR), Toulon, France. OpenReview.net (2017)

    Google Scholar 

  23. Roth, D., Yih, W.: A linear programming formulation for global inference in natural language tasks. In: Proceedings of the Eighth Conference on Computational Natural Language Learning (CoNLL-2004) at HLT-NAACL 2004, Boston, Massachusetts, USA, pp. 1–8. Association for Computational Linguistics (2004)

    Google Scholar 

  24. Gurulingappa, H., Rajput, A.M., Roberts, A., Fluck, J., Hofmann-Apitius, M., Toldo, L.: Development of a benchmark corpus to support the automatic extraction of drug-related adverse effects from medical case reports. J. Biomed. Inform. 45(5), 885–892 (2012)

    Article  Google Scholar 

  25. Luan, Y., He, L., Ostendorf, M., Hajishirzi, H.: Multi-task identification of entities, relations, and coreference for scientific knowledge graph construction. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP), Brussels, Belgium, pp. 3219–3232. Association for Computational Linguistics (2018)

    Google Scholar 

Download references

Acknowledgement

This work was supported in part by the Natural Science Foundation of Xinjiang Uygur Autonomous Region, China Grant No. 2022D01A59, National Natural Science Foundation of China under Grand No. U20A20167, Key Research Foundation of Integration of Industry and Education and the Development of New Business Studies Research Center, Innovation Capability Improvement Plan Project of Hebei Province under Grand No. 22567637H, Hebei Province Central Leading Local Science and Technology Development Project, 246Z1817G.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Fenda Zhao .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wang, B., Wang, H., Li, X., Zhao, F. (2024). JEAPC: A Joint Extraction Model of Action Sequence from Chinese Instructions for Home Service Robot. In: Jin, C., Yang, S., Shang, X., Wang, H., Zhang, Y. (eds) Web Information Systems and Applications. WISA 2024. Lecture Notes in Computer Science, vol 14883. Springer, Singapore. https://doi.org/10.1007/978-981-97-7707-5_44

Download citation

  • DOI: https://doi.org/10.1007/978-981-97-7707-5_44

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-97-7706-8

  • Online ISBN: 978-981-97-7707-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics