About the Flax/JAX Projects category
|
|
3
|
2111
|
July 1, 2022
|
Train the Best Sentence Embedding Model Ever with 1B Training Pairs
|
|
36
|
23341
|
July 2, 2023
|
How to implement learnable position embed?
|
|
0
|
618
|
July 16, 2023
|
Train REINFORCE with JAX
|
|
0
|
559
|
July 15, 2023
|
Unsupervised Code-Code Translation based on TransCoder
|
|
11
|
2780
|
June 28, 2023
|
About training flax transformers: The design choice to use targets variable from external scope vs. give params as argument to loss_fn
|
|
0
|
515
|
June 27, 2023
|
Pretrain T5 for Arabic
|
|
17
|
2633
|
June 11, 2023
|
[Open-to-the-community] Community week using JAX/Flax for NLP & CV :jax:
|
|
52
|
19334
|
April 25, 2023
|
PreTrain RoBERTa/T5 from scratch for Programming Languages
|
|
27
|
3439
|
April 16, 2023
|
msgpack.exceptions.ExtraData: unpack(b) received extra data
|
|
0
|
1470
|
April 15, 2023
|
Issue loading a FlaxHybridCLIP trained model
|
|
0
|
620
|
April 14, 2023
|
T5x Model Checkpoint Surgery
|
|
0
|
907
|
April 13, 2023
|
BigBirDNA - Pretraining BigBird on DNA sequences
|
|
20
|
3786
|
March 21, 2023
|
Stable Diffusion on Tpu using Colab
|
|
1
|
1762
|
March 1, 2023
|
Develop robust examples for model parallel training on TPU's
|
|
0
|
875
|
January 15, 2023
|
Calling python run_t5_mlm_flax.py when running on multiple GPU
|
|
1
|
1038
|
December 2, 2022
|
Padding for T5-flax pre-training on protein sequences
|
|
0
|
763
|
November 29, 2022
|
How do I construct a function to inference?
|
|
0
|
1366
|
September 13, 2022
|
Fine-tuning BERT-based language model to overcome gender-bias
|
|
6
|
2603
|
September 11, 2022
|
Is resize_token_embeddings available to the FlaxPreTrainedModel?
|
|
1
|
1713
|
August 25, 2022
|
How to get word embeddings for Flax model?
|
|
1
|
1012
|
August 25, 2022
|
PreTrain T5 from scratch in Bengali
|
|
5
|
2183
|
July 26, 2022
|
PreTrain ProteinBERT from scratch
|
|
5
|
2250
|
July 6, 2022
|
Covid19 adverse event detection
|
|
21
|
1922
|
May 10, 2022
|
Where are the jax jit annotations in flax models?
|
|
0
|
1461
|
May 2, 2022
|
StyleGAN2 for medical datasets
|
|
21
|
2891
|
May 1, 2022
|
Fine-tune CLIP on satellite images+captions
|
|
14
|
4935
|
April 6, 2022
|
Pretrain GPT-Neo for Open Source GitHub Copilot Model
|
|
54
|
23729
|
January 18, 2022
|
Training scripts
|
|
1
|
2999
|
January 5, 2022
|
Image captioning for French with pre-trained vision and text model
|
|
6
|
2140
|
January 4, 2022
|