Text and Layout Information Extraction from Document Files of Various Formats Based on the Analysis of Page Description Language | IEEE Conference Publication | IEEE Xplore