Abstract
Digital presses have consistently improved their speed in the past ten years. Meanwhile, the need for document personalization and customization has increased. As a consequence of these two facts, the traditional RIP (Raster Image Processing) process has become a highly demanding computational step in the print workflow. Print Service Providers (PSP) are now using multiple RIP engines and parallelization strategies to speed up the whole ripping process which is currently based on a per-page basis. Nevertheless, these strategies are not optimized in terms of ensuring the best Return On Investment (ROI) for the RIP engines. Depending on the input document jobs characteristics, the ripping step may not achieve the print-engine speed creating a unwanted bottleneck. The aim of this paper is to present a way to improve the ROI of PSPs proposing a profiling strategy which enables the optimal usage of RIPs for specific jobs features ensuring that jobs are always consumed at least at engine speed. The profiling strategy is based on a per-page analysis of input Portable Document Format (PDF) jobs identifying their key components. This work introduces a PDF Profiler tool aimed at extracting information from jobs and some metrics to predict a job ripping cost based on its profile. This information is extremely useful to rasterize jobs in a clever way. The computational cost estimated using the information extracted by the PDF Profiler and the proposed metrics is useful for the print jobs queue management to improve the allocated RIPs load balance, resulting in a higher throughput for the ripping step. Experiments have been carried out in order to evaluate the PDF Profiler, the proposed metrics and their impact in the print jobs queue management.
Similar content being viewed by others
References
Adobe Systems (2003) PDF Reference, 4th edn. Adobe Systems Incorporated, San Jose
Davis P, deBronkart D (2000) PPML (Personalized Print Markup Language): a new XML-based industry standard print language. In: XML Europe 2000, pp 1–14, Paris, France. International Digital Enterprise Alliance
Déjean H, Meunier J-L (2006) A system for converting PDF documents into structured XML format. In: DAS’06: Proceedings of the 7th International Workshop on Document Analysis Systems, Nelson, New Zealand. LNCS, vol 3872. Springer, Berlin, pp 129–140
Extensible Markup Language (XML Home Page) (2009) Extracted from http://www.w3.org/XML, 30th November, 2009
Getov V, Hummel SF, Mintchev S (1998) High performance parallel programming in Java: exploiting native libraries. Concurr Pract Exp 10(11):863–872
ImageMagick Home Page (2009) Extracted from http://www.imagemagick.org, 30th November 2009
LAM/MPI Home Page (2009) Extracted from http://www.lam-mpi.org/, 30th November 2009
MPI Home Page (2009) Extracted from http://www.mpi-forum.org/, 30th November 2009
MPICH Home Page (2009) Extracted from http://www-unix.mcs.anl.gov/mpi/mpich/, 30th November 2009
MPJ Express Home Page (2009) Extracted from http://mpj-express.org/, 30th November 2009
Nunes T, Giannetti F, Kolberg M, Nemetz R, Cabeda A, Fernandes LG (2009) Job profiling in high performance printing. In: ACM DocEng’09: Proceedings of the 9th ACM Symposium on Document Engineering, Munich, Germany. ACM, New York, pp 109–118
Nunes T, Raeder M, Kolberg M, Fernandes LG, Cabeda A, Giannetti F (2009) High performance printing: increasing personalized documents rendering through PPML jobs profiling and scheduling. In: IEEE CSE’09: Proceedings of the 12th IEEE International Conference on Computational Science and Engineering, Vancouver, Canada. IEEE Comput Soc, Los Alamitos, pp 285–291
Nunes T, Fernandes LG, Giannetti F, Cabeda A, Raeder M, Bedin G (2007) An improved parallel XSL-FO rendering for personalized documents. In: Euro PVM/MPI’07: Proceedings of the 14th European PVM/MPI Users Group Meeting. Recent advances in parallel virtual machine and message passing interface, Paris, France. LNCS, vol 4757. Springer, Berlin, pp 56–63
Nunes T, Giannetti F, Fernandes LG, Timmers R, Raeder M, Castro M (2006) High performance XSL-FO rendering for variable data printing. In: ACM SAC’06: Proceedings of the 21st ACM Symposium on Applied Computing, Dijon, France. ACM, New York, pp 811–817
PDFBox Home Page (2009) Extracted from http://www.pdfbox.org, 20th March 2009
Purvis L, Harrington S, O’Sullivan B, Freuder EC (2003) Creating personalized documents: an optimization approach. In: ACM Doc-Eng’03: Proceedings of the 2003 ACM Symposium on Document Engineering, Grenoble, France. ACM, New York, pp 68–77
Yuan F, Liu B, Yu G (2005) A study on information extraction from PDF files. In: ICMLC’05: Proceedings of the 4th International Conference Advances in Machine Learning and Cybernetics. LNCS, vol 3930. Springer, Berlin, pp 258–267
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Fernandes, L.G., Nunes, T., Kolberg, M. et al. Job profiling and queue management in high performance printing. Comput Sci Res Dev 27, 147–166 (2012). https://doi.org/10.1007/s00450-010-0134-0
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00450-010-0134-0