multihead-attention
TransformerX is a python library for building transformer-based models using ready-to-use layers.
This package is a Tensorflow2/Keras implementation for Graph Attention Network embeddings and also provides a Trainable layer for Multihead Graph Attention.