Fine Tuning OCR Free Document Understanding Transformer for Image-to-Text Captioning
Keywords - Deep Learning, Multi-Modal, Image-to-Text, Multiheaded Attention, Encoder-Decoder, PyTorch, Hugging Face, GPU, SGD, AdamW, Levenshtein Distance, Cross Entropy Loss, Attention Mask