Abstract: Transformer-based models, such as Bidirectional Encoder Representations from Transformers (BERT), cannot process long sequences because their self-attention operation scales quadratically ...
Abstract: Text classification (TC) has gained substantial importance across diverse domains due to the increasing proliferation of internet-based and digital technologies. Organizations worldwide ...