結果 : cnn transformer based encoder decoder model for nepali image captioning