v21’s Twitter Archive
—№ 61,615
⇤ Newest
Tweet
⇠ Newer
Tweet
Older
Tweet
⇢
RT
@
artetxem
: Who said that training GPT-2 or BERT was expensive? "We use 512 Nvidia V100 GPUs [...] Upon the submission of this paper, tr…
Look on archive.org
Retweet
2019 Oct 1