Heads up: Transformer layers + Functional API -> missing trainable weights
|
|
0
|
609
|
July 14, 2020
|
Question about all_head_size under BertSelfAttention
|
|
0
|
368
|
July 13, 2020
|
Update TensorFlow version to 2.2
|
|
0
|
387
|
July 13, 2020
|
Why multiplying the output of T5 by some scalar before LM head?
|
|
0
|
279
|
July 13, 2020
|
How to yield hidden_states from a saved, fine-tuned (distil)bert model?
|
|
2
|
405
|
July 12, 2020
|
How to get NER pipeline output to match with spacy's output?
|
|
3
|
2090
|
July 12, 2020
|
Multi GPU fintuning BART
|
|
3
|
1655
|
July 11, 2020
|
High Level Overview Blogpost
|
|
1
|
347
|
July 8, 2020
|
Migration guide from v2.X to v3.X for the tokenizer API
|
|
0
|
758
|
July 7, 2020
|
Transformers v3.0.0 is out!
|
|
0
|
1941
|
July 7, 2020
|
About the Transformers category
|
|
1
|
248
|
July 7, 2020
|