NLU

Browse posts by tag

April 24, 2026

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Bidirectional pre-training via masked language modeling. Defined the pre-train/fine-tune paradigm.