loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Authors: Subhadra Vadlamannati 1 and Ryan Solgi 2

Affiliations: 1 Mercer Island High School, 9100 SE 42nd St, Mercer Island, U.S.A. ; 2 Department of Electrical and Computer Engineering, University of California Santa Barbara, Santa Barbara, U.S.A.

Keyword(s): Neural Networks, Machine Learning, Natural Language Processing, ALIGN, Tensor-Train Decomposition, Vision-Language Modelling.

Abstract: The transformer architecture has revolutionized Natural Language Processing (NLP) and other machine-learning tasks, due to its unprecedented accuracy. However, their extensive memory and parameter requirements often hinder their practical applications. In this work, we study the effect of tensor-train decomposition to improve the accuracy and compress transformer vision-language neural networks, namely BERT and ViT. We focus both on embedding-layer compression and partial tensorization of neural networks (PTNN) through an algorithmic approach. Our novel PTNN approach significantly improves the accuracy of existing models by up to 5%, all without the need for post-training adjustments, breaking new ground in the field of tensor decomposition.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.139.82.252

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Vadlamannati, S. and Solgi, R. (2024). Partial Tensorized Transformers for Natural Language Processing. In Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART; ISBN 978-989-758-680-4; ISSN 2184-433X, SciTePress, pages 543-547. DOI: 10.5220/0012366500003636

@conference{icaart24,
author={Subhadra Vadlamannati. and Ryan Solgi.},
title={Partial Tensorized Transformers for Natural Language Processing},
booktitle={Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART},
year={2024},
pages={543-547},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0012366500003636},
isbn={978-989-758-680-4},
issn={2184-433X},
}

TY - CONF

JO - Proceedings of the 16th International Conference on Agents and Artificial Intelligence - Volume 3: ICAART
TI - Partial Tensorized Transformers for Natural Language Processing
SN - 978-989-758-680-4
IS - 2184-433X
AU - Vadlamannati, S.
AU - Solgi, R.
PY - 2024
SP - 543
EP - 547
DO - 10.5220/0012366500003636
PB - SciTePress