Skip to main content
Log in

Distributed matroid-constrained submodular maximization for multi-robot exploration: theory and practice

  • Published:
Autonomous Robots Aims and scope Submit manuscript

Abstract

This work addresses the problem of efficient online exploration and mapping using multi-robot teams via a new distributed algorithm for multi-robot exploration, distributed sequential greedy assignment (DSGA), which is based on sequential greedy assignment (SGA). While SGA permits bounds on suboptimality, robots must execute planning steps sequentially. Rather than plan for each robot sequentially as in SGA, DSGA assigns plans to subsets of robots using a fixed number of sequential planning rounds. DSGA retains the same suboptimality bounds as SGA with the addition of a term that describes the additional suboptimality incurred when assigning multiple plans at once. We use this result to extend a single-robot planner based on Monte-Carlo tree search to the multi-robot domain and evaluate the resulting planner in simulated exploration of a confined and cluttered environment. The experimental results show that for teams of 4–32 robots suboptimality due to redundant sensor information introduced in the distributed planning rounds remains small in practice given only two or three distributed planning rounds while providing a 2–8 times speedup over SGA. We also incorporate aerial robots with inter-robot collision constraints and non-trivial dynamics and address subsequent impacts on safety and optimality. Real-time simulation and experimental results for teams of quadrotors demonstrate online planning for multi-robot exploration and indicate that collision constraints have limited impacts on exploration performance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Notes

  1. Although this paper addresses multi-robot exploration, this result applies generally to informative planning problems and general monotone submodular maximization with partition matroid constraints (aside from notation and problem specialization).

  2. A previous version of this paper (Corah and Michael 2017) included a corollary to Theorem 2 that mistakenly applied submodularity to \(\psi \) in an attempt to provide bounds that could be evaluated at different stages of the planning process. Although this bound does not hold, we note that none of the rest of this work depends on this result. However, unlike as for mutual information, some common submodular functions such as set and area coverage have higher-order properties that are related to submodularity and are amenable to similar bounds. We investigate related bounds and algorithms in a following work (Corah and Michael 2018).

  3. In the implementation, robots are required to come to a stop and trajectories are then temporally extended as necessary. Although these conditions can be relaxed, such as to invariant sets, there do not appear to be clear or significant benefits from doing so.

References

  • Atanasov, N.A., Le Ny, J., Daniilidis, K., & Pappas, G.J. (2015). Decentralized active information acquisition: Theory and application to multi-robot SLAM. In Proceedings of the IEEE international conference on robotics and automation, Seattle, WA.

  • Barbosa, RdP., Ene, A., Nguyen, H.L., & Ward, J. (2016). A new framework for distributed submodular maximization. In Proceedings of the IEEE annual symposium on foundations of computer science, New Brunswick, NJ.

  • Best, G., Cliff, O. M., Patten, T., Mettu, R. R., & Fitch, R. (2016). Decentralised Monte Carlo tree search for active perception. In Algorithmic foundation robotics. San Francisco, CA

  • Browne, C., Powley, E., Whitehouse, D., Lucas, S., Cowling, P. I., Rohlfshagen, P., et al. (2012). A survey of Monte Carlo tree search methods. IEEE Transactions on Computational Intelligence and AI in Games, 4(1), 1–43.

    Article  Google Scholar 

  • Calinescu, G., Chekuri, C., Pal, M., & Vondrák, J. (2011). Maximizing a monotone submodular function subject to a matroid constraint. SIAM Journal on Computing, 40(6), 1740–1766.

    Article  MathSciNet  MATH  Google Scholar 

  • Charrow, B. (2015). Information-theoretic active perception for multi-robot teams. Ph.D. thesis, University of Pennsylvania.

  • Charrow, B., Kumar, V., & Michael, N. (2014). Approximate representations for multi-robot control policies that maximize mutual information. Autonomous Robots, 37(4), 383–400.

    Article  Google Scholar 

  • Charrow, B., Kahn, G., Patil, S., Liu, S., Goldberg, K., Abbeel, P., Michael, N., & Kumar, V. (2015a). Information-theoretic planning with trajectory optimization for dense 3D mapping. In Proceedings of robotics: science and systems, Rome, Italy.

  • Charrow, B., Liu, S., Kumar, V., & Michael, N. (2015b). Information-theoretic mapping using Cauchy-Schwarz quadratic mutual information. In Proceedings 1990 IEEE international conference on robotics and automation, Seattle, WA.

  • Chaslot, G. (2010). Monte-Carlo tree search. Ph.D. thesis, Universiteit Maastricht.

  • Chekuri, C., & Martin, P. (2005). A recursive greedy algorithm for walks in directed graphs. In Proceedings of the IEEE annual symposium on foundations of computer science, pp 245–253.

  • Choi, H. L., Brunet, L., & How, J. P. (2009). Consensus-based decentralized auctions for robust task allocation. IEEE Transactions on Robotics, 25(4), 912–926.

    Article  Google Scholar 

  • Corah, M., & Michael, N. (2017). Efficient online multi-robot exploration via distributed sequential greedy assignment. In Proceedings of robotics: science and system, Cambridge, MA.

  • Corah, M., & Michael, N. (2018). Distributed submodular maximization on partition matroids for planning on large sensor networks. In Proceedings of the IEEE conference on decision and control. Miami, FL (submitted for publication).

  • Cover, T. M., & Thomas, J. A. (2012). Elements of information theory. New York, NY: Wiley.

    MATH  Google Scholar 

  • Elfes, A. (1989). Using occupancy grids for mobile robot perception and navigation. IEEE Computer Society, 22(6), 46–57.

    Article  Google Scholar 

  • Filmus, Y., & Ward, J. (2012). A tight combinatorial algorithm for submodular maximization subject to a matroid constraint. In Proceedings of the IEEE annual symposium on foundations of computer science, New Brunswick, NJ.

  • Gharan, S.O., & Vondrák, J. (2011). Submodular maximization by simulated annealing. In Proceedings of the symposium on discrete algorithms, Philadelphia, PA.

  • Gharesifard, B., & Smith, S.L. (2017). Distributed submodular maximization with limited information. IEEE Transactions on Control of Network Systems. https://doi.org/10.1109/TCNS.2017.2740625.

  • Goundan, P. R., & Schulz, A. S. (2007). Revisiting the greedy approach to submodular set function maximization. Optim Online, 1984, 1–25.

    Google Scholar 

  • Grimsman, D., Ali, M.S., Hespanha, P., & Marden, J.R. (2017). Impact of information in greedy submodular maximization. In Proceedings of the IEEE conference on decision and control, Melbourne, Australia.

  • Jadidi, M.G., Miro, J.V., & Dissanayake, G. (2015). Mutual information-based exploration on continuous occupancy maps. In Proceedings of the IEEE/RSJ international conference on intelligent robots and systems, Hamburg, Germany.

  • Jorgensen, S., Chen, R.H., Milam, M.B., & Pavone, M. (2017). The matroid team surviving orienteers problem: Constrained routing of heterogeneous teams with risky traversal. In Proceedings of the IEEE/RSJ international conference on intelligent robots and systems, Vancouver, Canada.

  • Julian, B. J., Karaman, S., & Rus, D. (2014). On mutual information-based control of range sensing robots for mapping applications. The International Journal of Robotics Research, 33(10), 1357–1392.

    Article  Google Scholar 

  • Krause, A., & Guestrin, C.E. (2005). Near-optimal nonmyopic value of information in graphical models. In Proceedings of the conference on uncertainty in artificial intelligence, Edinburgh, Scotland.

  • Krause, A., Singh, A., & Guestrin, C. (2008). Near-optimal sensor placements in Gaussian processes: Theory, efficient algorithms and empirical studies. Journal of Machine Learning Research, 9, 235–284.

    MATH  Google Scholar 

  • Ladner, R. E., & Fischer, M. J. (1980). Parallel prefix computation. Journal of the ACM (JACM), 27(4), 831–838.

    Article  MathSciNet  MATH  Google Scholar 

  • Lauri, M., & Ritala, R. (2016). Planning for robotic exploration based on forward simulation. Robotics and Autonomous Systems, 83, 15–31.

    Article  Google Scholar 

  • Mahony, R., Kumar, V., & Corke, P. (2012). Multirotor aerial vehicles: Modeling, estimation, and control of quadrotor. IEEE Robotics and Automation Magazine, 19(3), 20–32.

    Article  Google Scholar 

  • Mirzasoleiman, B., Karbasi, A., Sarkar, R., & Krause, A. (2013). Distributed submodular maximization: Identifying representative elements in massive data. In C. J. C. Burges, L. Bottou, M. Welling, Z. Ghahramani & K. Q. Weinberger (Eds.), Procdeedings of the advances in neural information processing systems (Vol. 26, pp. 2049–2057). Stateline, Nevada: Curran Associates, Inc. http://papers.nips.cc/HrBpaper/5039-distributed-submodular-maximization-identifying-reHrBpresentative-elements-in-massivedata.pdf.

  • Nelson, E., & Michael, N. (2015). Information-theoretic occupancy grid compression for high-speed information-based exploration. In Proceedings of the IEEE/RSJ international conference on intelligent robots and systems, Hamburg, Germany.

  • Nemhauser, G. L., & Wolsey, L. A. (1978). Best algorithms for approximating the maximum of a submodular set function. Mathematics of Operations Research, 3(3), 177–188.

    Article  MathSciNet  MATH  Google Scholar 

  • Nemhauser, G. L., Wolsey, L. A., & Fisher, M. L. (1978a). An analysis of approximations for maximizing submodular set functions-I. Mathematics Program, 14(1), 265–294.

    Article  MathSciNet  MATH  Google Scholar 

  • Nemhauser, G. L., Wolsey, L. A., & Fisher, M. L. (1978b). An analysis of approximations for maximizing submodular set functions-II. Polyhedral Combinatorics, 8, 73–87.

    Article  MathSciNet  MATH  Google Scholar 

  • Patten, T. (2017). Active object classification from 3D range data with mobile robots. Ph.D. thesis, The University of Sydney.

  • Quigley, M., Gerkey, B., Conley, K., Faust, J., Foote, T., Leibs, J., et al. (2009). ROS: An open-source robot operating system. In ICRA workshop on open source software, Kobe, Japan

  • Rawlings, J. B., & Muske, K. R. (1993). The stability of constrained receding horizon control. IEEE Transactions on Automatic Control, 38(10), 1512–1516.

    Article  MathSciNet  MATH  Google Scholar 

  • Regev, T., & Indelman, V. (2016). Multi-robot decentralized belief space planning in unknown environments via efficient re-evaluation of impacted paths. In Proceedings of the IEEE/RSJ international conference on intelligent robots and systems, Daejeon, Korea.

  • Singh, A., Krause, A., Guestrin, C., & Kaiser, W. J. (2009). Efficient informative sensing using multiple robots. Journal of Artificial Intelligence Research, 34, 707–755.

    Article  MathSciNet  MATH  Google Scholar 

  • Tabib, W., Corah, M., Michael, N., & Whittaker, R. (2016). Computationally efficient information-theoretic exploration of pits and caves. In Proceedings of the IEEE/RSJ international conference on intelligent robots and systems, Daejeon, Korea.

  • Williams, J.L. (2007). Information theoretic sensor management. Ph.D. thesis, Massachusetts Institute of Technology.

  • Williams, R.K., Gasparri, A., & Ulivi, G. (2017). Decentralized matroid optimization for topology constraints in multi-robot allocation problems. In Proceedings of the IEEE international conference on robotics and automation, Singapore.

  • Yamauchi, B. (1997). A frontier-based approach for autonomous exploration. In Proceedings of the international symposium on computer intelligence in robotics and automation, Monterey, CA.

  • Zhou, T., Ouyang, H., Chang, Y., Bilmes, J., & Guestrin, C. (2017). Scaling submodular maximization via pruned submodularity graphs. Proceedings of Machine Learning Research, 54, 316–324.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Micah Corah.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The authors gratefully acknowledge the support of ARL Grant W911NF-08-2-0004.

This is one of several papers published in Autonomous Robots comprising the “Special Issue on Robotics Science and Systems”.

Appendices

Appendix A: Proof of Theorem 2

Proof

The proof of the suboptimality bound relating DSGA to SGA incorporates suboptimality of the single-robot planner and is similar to (Singh et al. 2009) or (Atanasov et al. 2015). We obtain the following by monotonicity and by rearranging the resulting telescoping sum

$$\begin{aligned} I(M;Y^*)&\le I(M;Y^*) + \sum _{i=1}^{n_r} I\left( M;Y^d_{i}|Y^d_{1:i-1}, Y^*_{i+1:n_r}\right) \nonumber \\&= I(M;Y^d) + \sum _{i=1}^{n_r} I\left( M;Y^*_{i}|Y^d_{1:i-1}, Y^*_{i+1:n_r}\right) . \end{aligned}$$
(25)

By submodularity

$$\begin{aligned} I\left( M;Y^*_{i}|Y^d_{1:i-1}, Y^*_{i+1:n_r}\right) \le I\left( M;Y^*_{i} | Y^d_{1:i-1}\right) . \end{aligned}$$

Without loss of generality, assume that agent indices correspond to the selection order and rewrite in terms of the planning rounds such that \( I(M;Y^*_{i} | Y^d_{1:i-1}) = I(M;Y^*_{D_{j,k}}|Y^d_{D_{j,1:k-1}\cup F_{j-1}}) \) and note that that although \(Y^*\) is formally a set, the mapping from elements to robots can be obtained by the intersections \(Y^*\cap \mathcal {Y}_i\) given that the sets \(\mathcal {Y}_i\) are disjoint. Then, by submodularity,

$$\begin{aligned} I(M;Y^*_{i} | Y^d_{1:i-1}) \le I\left( M;Y^*_{D_{j,k}}|Y^d_{F_{j-1}}\right) . \end{aligned}$$

By (17) and the greedy maximization step in Algorithm 1

$$\begin{aligned} I\left( M;Y^*_{D_{i,j}}|Y^d_{F_{i-1}}\right) \le \eta I\left( M;Y^d_{D_{i,j}}|Y^d_{F_{i-1}}\right) . \end{aligned}$$
(26)

Substitute (26) and preceding inequalities into (25) to obtain

$$\begin{aligned} I(M;Y^*) \le I(M;Y^d) + \eta \sum _{i=1}^{n_d} \sum _{j=1}^{|D_i|} I\left( M;Y^d_{D_{i,j}}|Y^d_{F_{i-1}}\right) . \end{aligned}$$
(27)

The distributed objective can be rewritten as a sum so that \( I(M;Y^d)= \sum _{i=1}^{n_d} \sum _{j=1}^{|D_i|} I(M;Y^d_{D_{i,j}}|Y^d_{D_{i,1:j-1}\cup F_{i-1}}) \) and substituted into (27) to obtain

$$\begin{aligned} I(M;Y^*)\le & {} (1+\eta )I(M;Y^d) + \eta \sum _{i=1}^{n_d} \sum _{j=1}^{|D_i|} \left( I\left( M;Y^d_{D_{i,j}}|Y^d_{F_{i-1}}\right) \right. \nonumber \\&\left. -I\left( M;Y^d_{D_{i,j}}|Y^d_{D_{i,1:j-1}\cup F_{i-1}}\right) \right) \end{aligned}$$
(28)

which expresses the suboptimality in terms of decreases in the conditional mutual information from when results are obtained from the planner to when they are assigned. By rewriting mutual information in terms of entropies we can rearrange to obtain the following

$$\begin{aligned} I(M;Y_1) - I(M;Y_1|Y_2) = I(Y_1;Y_2) - I(Y_1;Y_2|M). \end{aligned}$$

If \(Y_1\) and \(Y_2\) are conditionally independent given M, then the mutual information, \(I(Y_1;Y_2|M) = 0\) and so \( I(M;Y_1) - I(M;Y_1|Y_2) = I(Y_1;Y_2) \). By substitution into (28) we can obtain the slightly more concise and final expression for the suboptimality in terms of the mutual information between observations

$$\begin{aligned} I(M;Y^*)&\le (1+\eta )I(M;Y^d) \nonumber \\&\quad + \eta \sum _{i=1}^{n_d} \sum _{j=1}^{|D_i|} I(Y^d_{D_{i,j}};Y^d_{D_{i,1:j-1}}|Y^d_{F_{i-1}}). \end{aligned}$$
(29)

\(\square \)

Appendix B: Proof of Theorem 3

Proof

Equation (21) follows from Theorem 1 by Grimsman et al. (2017) which proves a \(k+1\) bound where k is the clique cover number of a directed acyclic graph associated with the planner structure. In this directed graph, each robot represents a vertex, and the graph has a directed edge (ab) between robots \(a,b\in \mathcal {R}\) if b maximizes its objective (15) conditional on the sequence of observations selected by a. Here, a clique, which is a complete subgraph, is analogous to a set of robots that plan sequentially given the choices by all prior robots in the clique. For Algorithm 1, any set of robots \(A \subseteq \mathcal {R}\) with at most one robot from each planning round (\(|A\cap D_i| \le 1\) for \(i \in \{1,\ldots ,n_d\}\)) forms a clique in the associated directed graph. A clique cover of size \(\lceil n_r/n_d \rceil =\max _{i\in \{1,\ldots ,n_d\}} |D_i|\) can be obtained by selecting cliques with a single robot (as available) from each planning round (\(D_1,\ldots ,D_{n_d}\)) without replacement. Then, (21) follows by substitution of (17) to obtain a factor of \(\eta \). \(\square \)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Corah, M., Michael, N. Distributed matroid-constrained submodular maximization for multi-robot exploration: theory and practice. Auton Robot 43, 485–501 (2019). https://doi.org/10.1007/s10514-018-9778-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10514-018-9778-6

Keywords

Navigation