Skip to main content

BERT: Pre-training of Deep Bidirectional Transformers (Devlin et al 2018)

Draft notes on BERT: Pre-training of Deep Bidirectional Transformers (Devli.

Draft — not yet written.

Overview

Key Ideas

Further Reading