News

Transformers combined with convolutional encoders have been recently used for hand gesture recognition (HGR) using micro-Doppler signatures. In this letter, we propose a vision-transformer-based ...
Abstract: Image captioning develops a relationship between visual and text information to generate a sequence of words as captions. Transformers perform machine translation and language comprehension ...