Conferences >2024 5th International Confer...

Dual-Agent Multi-Hop Reasoning Based on Reward Shaping and Action Dropout

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

The dual-agent multi-hop reasoning method addresses the issue of target entity omission in single-agent systems caused by increased reasoning path length. However, this m...Show More

Metadata

Abstract:

The dual-agent multi-hop reasoning method addresses the issue of target entity omission in single-agent systems caused by increased reasoning path length. However, this method still has the following challenges: First, even if the agent finds the correct answer within the knowledge graph, it may misclassify it as incorrect due to training set limitations, known as the false negative target problem. Second, agents may be misled by erroneous paths, leading to the creation of false positive paths. To tackle these issues, this paper proposes a dual-agent multi-hop reasoning method based on Reward Shaping and Action Dropout (RADAM). This approach improves the model in two ways: (1) by introducing an embedding-based single-hop reasoning model, TransE, to optimize the reward function and reduce false negatives; and (2) by incorporating a random masking mechanism to diminish the agent's sensitivity to spurious paths, thereby reducing false positive paths. Experimental results demonstrate that RADAM achieves more accurate and efficient answer retrieval across most benchmark datasets compared to baseline algorithms, with ablation experiments further confirming the effectiveness and synergy of reward shaping and action dropout.

Published in: 2024 5th International Conference on Machine Learning and Computer Application (ICMLCA)

Date of Conference: 18-20 October 2024

Date Added to IEEE Xplore: 21 November 2024

ISBN Information:

DOI: 10.1109/ICMLCA63499.2024.10754169

Conference Location: Hangzhou, China

Contents

References is not available for this document.

Dual-Agent Multi-Hop Reasoning Based on Reward Shaping and Action Dropout

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Dual-Agent Multi-Hop Reasoning Based on Reward Shaping and Action Dropout

Alerts

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?