Publications | Uladzislau Yorsh

On Difficulties of Attention Factorization through Shared Memory

Uladzislau Yorsh, Martin Holeňa, Ondřej Bojar, David Herel

May 2024 At ICLR 2024 Tiny Papers

Our findings challenge the conventional thinking on the models that use external learnable memory, like Luna or Memory Augmented Transformer, to reduce the computational complexity. We reveal that interfacing with the memory directly through an attention operation is suboptimal, and that the models’ performance may be considerably improved by filtering the input signal before communicating with it.

ICLR 2024 Tiny Paper

Text-to-Ontology Mapping via Natural Language Processing Models

Uladzislau Yorsh, Alexander S. Behr, Norbert Kockmann, Martin Holeňa

September 2022 In ITAT 2022

We have started to automatize the process of an automatic assignment of a relevant ontology to an input article, and attack the problem by utilizing state-of-the-art NLP tools and neural networks. We assess the quality by visializing the latent space of annotation and text embeddings and sampling examples of mappings between text fragments and ontology annotations.

Accepted to ITAT 2022

Linear Self-Attention Approximation via Trainable Feedforward Kernel

Uladzislau Yorsh, Alexander Kovalenko

September 2022 At ICANN 2022

We employ the kernelized formulation of an attention computation in Transformer, and evaluate the kernel implementation as FFNN on the subset of the Long Range Arena.

DOI Accepted to ICANN 2022

SimpleTRON: Simple Transformer with O(N) Complexity

Uladzislau Yorsh, Alexander Kovalenko, Vojtěch Vančura, Daniel Vašata, Pavel Kordík, Tomáš Mikolov

June 2022

We have suggested, that a Transformer attention module can be implemented without a nonlinearity between query and key multiplication, and evaluated our findings on the Long Range Arena subset.

Preprint