Partial Tensorized Transformers for Natural Language Processing

Subhadra Vadlamannati; Ryan Solgi

Research.Publish.Connect.

*Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

*Please fill out at least one Field.

Name:
Country:
Subject:

Advanced Search Affiliations Search

If you're looking for an exact phrase use quotation marks on text fields.

Proceedings

Proceedings Search *Please fill out at least one Field. *Value must be an number!

Title:
ISBN:
Year:
Acronym:
Subject:

Advanced Search Proceedings Search

If you're looking for an exact phrase use quotation marks on text fields.

Papers

Papers Search *Please fill out at least one Field.

Title:
Author:
Affiliation:
Subject:

Advanced Search Papers Search

If you're looking for an exact phrase use quotation marks on text fields.

Authors

Authors Search *Please fill out at least one Field.

Name:
Affiliation:
Country:
Conference:
Subject:

Advanced Search Authors Search

If you're looking for an exact phrase use quotation marks on text fields.

Advanced Search

Paper

Partial Tensorized Transformers for Natural Language Processing

Topics: Machine Learning; Natural Language Processing; Neural Networks

In Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART, 543-547, 2024 , Rome, Italy

Authors: Subhadra Vadlamannati ¹ and Ryan Solgi ²

Affiliations: ¹ Mercer Island High School, 9100 SE 42nd St, Mercer Island, U.S.A. ; ² Department of Electrical and Computer Engineering, University of California Santa Barbara, Santa Barbara, U.S.A.

Keyword(s): Neural Networks, Machine Learning, Natural Language Processing, ALIGN, Tensor-Train Decomposition, Vision-Language Modelling.

Abstract: The transformer architecture has revolutionized Natural Language Processing (NLP) and other machine-learning tasks, due to its unprecedented accuracy. However, their extensive memory and parameter requirements often hinder their practical applications. In this work, we study the effect of tensor-train decomposition to improve the accuracy and compress transformer vision-language neural networks, namely BERT and ViT. We focus both on embedding-layer compression and partial tensorization of neural networks (PTNN) through an algorithmic approach. Our novel PTNN approach significantly improves the accuracy of existing models by up to 5%, all without the need for post-training adjustments, breaking new ground in the field of tensor decomposition.

CC BY-NC-ND 4.0

Guest: Register as new SciTePress user now for free.

SciTePress user: please login.

My Papers

You are not signed in, therefore limits apply to your IP address 3.139.82.252

In the current month:

Recent papers: 100 available of 100 total

2⁺ years older papers: 200 available of 200 total

Paper citation in several formats:

Vadlamannati, S. and Solgi, R. (2024). Partial Tensorized Transformers for Natural Language Processing. In Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART; ISBN 978-989-758-680-4; ISSN 2184-433X, SciTePress, pages 543-547. DOI: 10.5220/0012366500003636

@conference{icaart24,
author={Subhadra Vadlamannati. and Ryan Solgi.},
title={Partial Tensorized Transformers for Natural Language Processing},
booktitle={Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART},
year={2024},
pages={543-547},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012366500003636},
isbn={978-989-758-680-4},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART
TI - Partial Tensorized Transformers for Natural Language Processing
SN - 978-989-758-680-4
IS - 2184-433X
AU - Vadlamannati, S.
AU - Solgi, R.
PY - 2024
SP - 543
EP - 547
DO - 10.5220/0012366500003636
PB - SciTePress