deep-learning - Chadrick Blog

paper review: "Graph Attention Networks"

Jul 30, 2022 paper review: "Graph Attention Networks"

pytorch implementation of sinusoidal position encoding

May 26, 2022 pytorch implementation of sinusoidal position encoding

relu, gelu , swish, mish activation function comparison

May 25, 2022 relu, gelu , swish, mish activation function comparison

paper review: "Donut : Document Understanding Transformer without OCR"

Jan 15, 2022 paper review: "Donut : Document Understanding Transformer without OCR"

paper review: “BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension”

Jan 11, 2022 paper review: “BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension”

paper summary: "DocFormer: End-to-End Transformer for Document Understanding"

Nov 23, 2021 paper summary: "DocFormer: End-to-End Transformer for Document Understanding"

paper review: "LayoutLMV2: Multi-Modal Pre-training for Visually-Rich Document Understanding"

Nov 19, 2021 paper review: "LayoutLMV2: Multi-Modal Pre-training for Visually-Rich Document Understanding"

paper summary: "BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents"

Nov 10, 2021 paper summary: "BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents"

paper summary “Perceiver IO: A General Architecture for Structured Inputs & Outputs”

Sep 27, 2021 paper summary “Perceiver IO: A General Architecture for Structured Inputs & Outputs”

paper summary: Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

Sep 27, 2021 paper summary: Swin Transformer: Hierarchical Vision Transformer using Shifted Windows