Modeling documents for structure recognition using generalized N-grams | IEEE Conference Publication | IEEE Xplore