News

Using a Transformer encoder is advantageous because it allows for “bi-directional information aggregation” as compared to the LSTM used in the original pointer networks. The decoder is conditioned on ...