About the Flax/JAX Projects category
|
|
3
|
2110
|
July 1, 2022
|
Train the Best Sentence Embedding Model Ever with 1B Training Pairs
|
|
36
|
23208
|
July 2, 2023
|
How to implement learnable position embed?
|
|
0
|
617
|
July 16, 2023
|
Train REINFORCE with JAX
|
|
0
|
553
|
July 15, 2023
|
Unsupervised Code-Code Translation based on TransCoder
|
|
11
|
2768
|
June 28, 2023
|
About training flax transformers: The design choice to use targets variable from external scope vs. give params as argument to loss_fn
|
|
0
|
515
|
June 27, 2023
|
Pretrain T5 for Arabic
|
|
17
|
2628
|
June 11, 2023
|
[Open-to-the-community] Community week using JAX/Flax for NLP & CV :jax:
|
|
52
|
19291
|
April 25, 2023
|
PreTrain RoBERTa/T5 from scratch for Programming Languages
|
|
27
|
3437
|
April 16, 2023
|
msgpack.exceptions.ExtraData: unpack(b) received extra data
|
|
0
|
1464
|
April 15, 2023
|
Issue loading a FlaxHybridCLIP trained model
|
|
0
|
618
|
April 14, 2023
|
T5x Model Checkpoint Surgery
|
|
0
|
907
|
April 13, 2023
|
BigBirDNA - Pretraining BigBird on DNA sequences
|
|
20
|
3785
|
March 21, 2023
|
Stable Diffusion on Tpu using Colab
|
|
1
|
1749
|
March 1, 2023
|
Develop robust examples for model parallel training on TPU's
|
|
0
|
873
|
January 15, 2023
|
Calling python run_t5_mlm_flax.py when running on multiple GPU
|
|
1
|
1036
|
December 2, 2022
|
Padding for T5-flax pre-training on protein sequences
|
|
0
|
762
|
November 29, 2022
|
How do I construct a function to inference?
|
|
0
|
1360
|
September 13, 2022
|
Fine-tuning BERT-based language model to overcome gender-bias
|
|
6
|
2599
|
September 11, 2022
|
Is resize_token_embeddings available to the FlaxPreTrainedModel?
|
|
1
|
1700
|
August 25, 2022
|
How to get word embeddings for Flax model?
|
|
1
|
1011
|
August 25, 2022
|
PreTrain T5 from scratch in Bengali
|
|
5
|
2181
|
July 26, 2022
|
PreTrain ProteinBERT from scratch
|
|
5
|
2248
|
July 6, 2022
|
Covid19 adverse event detection
|
|
21
|
1922
|
May 10, 2022
|
Where are the jax jit annotations in flax models?
|
|
0
|
1459
|
May 2, 2022
|
StyleGAN2 for medical datasets
|
|
21
|
2886
|
May 1, 2022
|
Fine-tune CLIP on satellite images+captions
|
|
14
|
4924
|
April 6, 2022
|
Pretrain GPT-Neo for Open Source GitHub Copilot Model
|
|
54
|
23716
|
January 18, 2022
|
Training scripts
|
|
1
|
2982
|
January 5, 2022
|
Image captioning for French with pre-trained vision and text model
|
|
6
|
2139
|
January 4, 2022
|