Improving Speech-Based End-of-Turn Detection Via Cross-Modal Representation Learning with Punctuated Text Data | IEEE Conference Publication | IEEE Xplore