Discovering dispatching rules from data using imitation learning: A case study for the job-shop problem

Ingimundardottir, Helga; Runarsson, Thomas Philip

doi:10.1007/s10951-017-0534-0

Discovering dispatching rules from data using imitation learning: A case study for the job-shop problem

Published: 13 June 2017

Volume 21, pages 413–428, (2018)
Cite this article

Journal of Scheduling Aims and scope Submit manuscript

1227 Accesses
Explore all metrics

Abstract

Dispatching rules can be automatically generated from scheduling data. This paper will demonstrate that the key to learning an effective dispatching rule is through the careful construction of the training data, $\{\mathbf {x}_i(k),y_i(k)\}_{k=1}^K\in {\mathscr {D}}$, where (i) features of partially constructed schedules $\mathbf {x}_i$ should necessarily reflect the induced data distribution ${\mathscr {D}}$ for when the rule is applied. This is achieved by updating the learned model in an active imitation learning fashion; (ii) $y_i$ is labelled optimally using a MIP solver; and (iii) data need to be balanced, as the set is unbalanced with respect to the dispatching step k. Using the guidelines set by our framework the design of custom dispatching rules, for a particular scheduling application, will become more effective. In the study presented three different distributions of the job-shop will be considered. The machine learning approach considered is based on preference learning, i.e. which dispatch (post-decision state) is preferable to another.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Integration of Dispatch Rules for JSSP: A Learning Approach

Experimental Proposal with Mallows Distribution Applied to the Mixed No-Idle Permutation Flowshop Scheduling Problem

Real-world flexible resource profile scheduling with multiple criteria: learning scalarization functions for MIP and heuristic approaches

Article 03 July 2017

Discover the latest articles and news from researchers in related subjects, suggested using machine learning.

Notes

Dispatch and time step are used interchangeably.
There can be several optimal solutions available for each problem instance. However, it is deemed sufficient to inspect only one optimal trajectory per problem instance as there are $N_{\text {train}}=300$ independent instances, which give the training data variety.
$\beta _0=1$ and $\beta _i=0,\forall i>0$.

References

Andresen, M., Engelhardt, F., & Werner, F. (2010). LiSA—A Library of Scheduling Algorithms (version 3.0) [software]. http://www.math.ovgu.de/Lisa.html.
Burke, E. K., Gendreau, M., Hyde, M., Kendall, G., Ochoa, G., Ozcan, E., et al. (2013). Hyper-heuristics: A survey of the state of the art. Journal of the Operational Research Society, 64(12), 1695–1724.
Article Google Scholar
Burke, E., Petrovic, S., & Qu, R. (2006). Case-based heuristic selection for timetabling problems. Journal of Scheduling, 9, 115–132.
Article Google Scholar
Cesa-Bianchi, N., & Lugosi, G. (2006). Prediction, learning, and games, Chap. 4. Cambridge: Cambridge University Press.
Chang, K., Krishnamurthy, A., Agarwal, A., III, Daume, H., & Langford, J. (2015). Learning to search better than your teacher. In Proceedings of the 32nd international conference on machine learning, pp. 2058–2066.
Chen, T., Rajendran, C., & Wu, C. W. (2013). Advanced dispatching rules for large-scale manufacturing systems. The International Journal of Advanced Manufacturing Technology, 67(1–4), 1–3.
Article Google Scholar
Fan, R. E., Chang, K. W., Hsieh, C. J., Wang, X. R., & Lin, C. J. (2008). LIBLINEAR: A library for large linear classification. Journal of Machine Learning Research, 9, 1871–1874.
Google Scholar
Garey, M. R., Johnson, D. S., & Sethi, R. (1976). The complexity of flowshop and jobshop scheduling. Mathematics of Operations Research, 1(2), 117–129.
Article Google Scholar
Gomes, C. P., & Selman, B. (2001). Algorithm portfolios. Artificial Intelligence, 126(1–2), 43–62.
Article Google Scholar
Gurobi Optimization, Inc. (2014). Gurobi optimization (version 6.0.0) [software]. http://www.gurobi.com/.
Hannan, J. (1957). Approximation to bayes risk in repeated play. Contributions to the Theory of Games, 3, 97–139.
Google Scholar
Haupt, R. (1989). A survey of priority rule-based scheduling. OR Spectrum, 11, 3–16.
Article Google Scholar
Ingimundardottir, H., & Runarsson, T. P. (2011). Supervised learning linear priority dispatch rules for job-shop scheduling. In: Learning and intelligent optimization, Lecture Notes in Computer Science (Vol. 6683, pp. 263–277). Berlin: Springer
Ingimundardottir, H., & Runarsson, T. (2014). Evolutionary learning of weighted linear composite dispatching rules for scheduling. In International conference on evolutionary computation theory and applications. SCITEPRESS.
Ingimundardottir, H., & Runarsson, T. P. (2015). Generating training data for learning linear composite dispatching rules for scheduling. In Learning and intelligent optimization, Lecture Notes in Computer Science (Vol. 8994, pp. 236–248). Berlin: Springer.
Jayamohan, M., & Rajendran, C. (2004). Development and analysis of cost-based dispatching rules for job shop scheduling. European Journal of Operational Research, 157(2), 307–321.
Article Google Scholar
Judah, K., Fern, A., & Dietterich, T. G. (2012). Active imitation learning via reduction to I.I.D. active learning. CoRR abs/1210.4876.
Kim, B., & Pineau, J. (2013). Maximum mean discrepancy imitation learning. In Robotics: Science and systems.
Korytkowski, P., Rymaszewski, S., & Wiśniewski, T. (2013). Ant colony optimization for job shop scheduling using multi-attribute dispatching rules. The International Journal of Advanced Manufacturing Technology, 1(67), 231–241.
Article Google Scholar
Li, X., & Olafsson, S. (2005). Discovering dispatching rules using data mining. Journal of Scheduling, 8, 515–527.
Article Google Scholar
Lu, M. S., & Romanowski, R. (2013). Multicontextual dispatching rules for job shops with dynamic job arrival. The International Journal of Advanced Manufacturing Technology, 67(1–4), 19–33.
Article Google Scholar
Malik, A. M., Russell, T., Chase, M., & Beek, P. (2008). Learning heuristics for basic block instruction scheduling. Journal of Heuristics, 14(6), 549–569.
Article Google Scholar
Meeran, S., & Morshed, M. (2012). A hybrid genetic tabu search algorithm for solving job shop scheduling problems: a case study. Journal of intelligent manufacturing, 23(4), 1063–1078.
Article Google Scholar
Mönch, L., Fowler, J. W., & Mason, S. J. (2013). Production planning and control for semiconductor wafer fabrication facilities. In Operations Research/Computer Science Interfaces Series, Vol. 52, chap. 4. Berlin: Springer
Olafsson, S., & Li, X. (2010). Learning effective new single machine dispatching rules from optimal scheduling data. International Journal of Production Economics, 128(1), 118–126.
Article Google Scholar
Panwalkar, S. S., & Iskander, W. (1977). A survey of scheduling rules. Operations Research, 25(1), 45–61.
Article Google Scholar
Pickardt, C. W., Hildebrandt, T., Branke, J., Heger, J., & Scholz-Reiter, B. (2013). Evolutionary generation of dispatching rule sets for complex dynamic scheduling problems. International Journal of Production Economics, 145(1), 67–77.
Article Google Scholar
Pinedo, M. L. (2008). Scheduling: Theory, Algorithms, and Systems (3rd ed.). Berlin: Springer.
Google Scholar
Rice, J. R. (1976). The algorithm selection problem. Advances in Computers, 15, 65–118.
Article Google Scholar
Ross, S., & Bagnell, D. (2010). Efficient reductions for imitation learning. In Proceedings of the thirteenth international conference on artificial intelligence and statistics, Vol. 9, pp. 661–668.
Ross, S., Gordon, G. J., & Bagnell, D. (2011). A reduction of imitation learning and structured prediction to no-regret online learning. In Proceedings of the fourteenth international conference on artificial intelligence and statistics, Vol. 15, pp. 627–635. Journal of Machine Learning Research—Workshop and Conference Proceedings.
Ross, S., Melik-Barkhudarov, N., Shankar, K., Wendel, A., Dey, D., Bagnell, J., et al. (2013). Learning monocular reactive UAV control in cluttered natural environments. In IEEE international conference on robotics and automation, pp. 1765–1772.
Runarsson, T. (2006). Ordinal regression in evolutionary computation. In Parallel problem solving from nature—PPSN IX, Lecture Notes in Computer Science (Vol. 4193, pp. 1048–1057). Berlin: Springer.
Runarsson, T. P., Schoenauer, M., & Sebag, M. (2012). Pilot, rollout and monte carlo tree search methods for job shop scheduling. In Learning and intelligent optimization, Lecture Notes in Computer Science, pp. 160–174. Berlin: Springer.
Russell, T., Malik, A. M., Chase, M., & van Beek, P. (2009). Learning heuristics for the superblock instruction scheduling problem. IEEE Transactions on Knowledge and Data Engineering, 21(10), 1489–1502.
Xu, L., Hutter, F., Hoos, H., & Leyton-Brown, K. (2007). SATzilla-07: The design and analysis of an algorithm portfolio for SAT. In Principles and practice of constraint programming.
Yu, J. M., Doh, H. H., Kim, J. S., Kwon, Y. J., Lee, D. H., & Nam, S. H. (2013). Input sequencing and scheduling for a reconfigurable manufacturing system with a limited number of fixtures. The International Journal of Advanced Manufacturing Technology, 67(1–4), 157–169.
Article Google Scholar

Download references

Author information

Authors and Affiliations

University of Iceland, School of Engineering and Natural Sciences, Reykjavík, Iceland
Helga Ingimundardottir & Thomas Philip Runarsson

Authors

Helga Ingimundardottir
View author publications
You can also search for this author inPubMed Google Scholar
Thomas Philip Runarsson
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Helga Ingimundardottir.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ingimundardottir, H., Runarsson, T.P. Discovering dispatching rules from data using imitation learning: A case study for the job-shop problem. J Sched 21, 413–428 (2018). https://doi.org/10.1007/s10951-017-0534-0

Download citation

Published: 13 June 2017
Issue Date: August 2018
DOI: https://doi.org/10.1007/s10951-017-0534-0

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Discovering dispatching rules from data using imitation learning: A case study for the job-shop problem

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Integration of Dispatch Rules for JSSP: A Learning Approach

Experimental Proposal with Mallows Distribution Applied to the Mixed No-Idle Permutation Flowshop Scheduling Problem

Real-world flexible resource profile scheduling with multiple criteria: learning scalarization functions for MIP and heuristic approaches

Explore related subjects

Notes

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now