Journals & Magazines >IEEE/ACM Transactions on Audi... >Volume: 28

SBERT-WK: A Sentence Embedding Method by Dissecting BERT-Based Word Models

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Sentence embedding is an important research topic in natural language processing (NLP) since it can transfer knowledge to downstream tasks. Meanwhile, a contextualized wo...Show More

Metadata

Abstract:

Sentence embedding is an important research topic in natural language processing (NLP) since it can transfer knowledge to downstream tasks. Meanwhile, a contextualized word representation, called BERT, achieves the state-of-the-art performance in quite a few NLP tasks. Yet, it is an open problem to generate a high quality sentence representation from BERT-based word models. It was shown in previous study that different layers of BERT capture different linguistic properties. This allows us to fuse information across layers to find better sentence representations. In this work, we study the layer-wise pattern of the word representation of deep contextualized models. Then, we propose a new sentence embedding method by dissecting BERT-based word models through geometric analysis of the space spanned by the word representation. It is called the SBERT-WK method. No further training is required in SBERT-WK. We evaluate SBERT-WK on semantic textual similarity and downstream supervised tasks. Furthermore, ten sentence-level probing tasks are presented for detailed linguistic analysis. Experiments show that SBERT-WK achieves the state-of-the-art performance. Our codes are publicly available.

Published in: IEEE/ACM Transactions on Audio, Speech, and Language Processing ( Volume: 28)

Page(s): 2146 - 2157

Date of Publication: 14 July 2020

ISSN Information:

DOI: 10.1109/TASLP.2020.3008390

Contents

References is not available for this document.

SBERT-WK: A Sentence Embedding Method by Dissecting BERT-Based Word Models

Abstract:

Metadata

Abstract:

ISSN Information:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

SBERT-WK: A Sentence Embedding Method by Dissecting BERT-Based Word Models

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Authors

Figures

References

Citations

Keywords

Metrics

Footnotes

References

IEEE Account

Purchase Details

Profile Information

Need Help?