|
ContractNLI-based NDA Risk Analyzer using RoBERTa + Chunking – Looking for Feedback
|
|
5
|
25
|
November 23, 2025
|
|
DistilBERT reaches 76% accuracy but still predicts “believable” for impossible/fantasy excuses — why?
|
|
3
|
19
|
November 23, 2025
|
|
Train instance segmentation model with dinov3 backbone
|
|
1
|
28
|
November 22, 2025
|
|
Search query autocomplete from the queries I have in my data
|
|
1
|
1688
|
November 21, 2025
|
|
How to sample from the validation set when using Trainer?
|
|
5
|
1965
|
November 21, 2025
|
|
Evaluate subset of data during training
|
|
6
|
5895
|
November 21, 2025
|
|
NeuroTrace – GPT-2 Small Residual Attack & Defence Framework (IOI Task)
|
|
0
|
16
|
November 21, 2025
|
|
Passing Inputs Longer Than 512 Tokens After Pretraining a T5 Model: Is It Safe?
|
|
3
|
21
|
November 20, 2025
|
|
[LLaVA-1.5] Validating Logic for Token-Level KV Cache Extraction
|
|
3
|
18
|
November 20, 2025
|
|
Evalutation of expert router logits simultanous to generation
|
|
4
|
23
|
November 19, 2025
|
|
AetherMind-KD-Student (184M) Compact, fast, and robust NLI model distilled from DeBERTa-v3
|
|
2
|
11
|
November 15, 2025
|
|
Fine-tuning a custom module but do not use LoRA
|
|
1
|
29
|
November 14, 2025
|
|
Inconsistent output between flash attention and eager
|
|
3
|
35
|
November 14, 2025
|
|
Num_return_sequences > num_beams
|
|
3
|
14
|
November 13, 2025
|
|
Debugging inf/NaN Loss in Multi-Process Optuna/PyTorch Lightning HPO in Colab
|
|
3
|
17
|
November 13, 2025
|
|
Why does using `TextIteratorStreamer` result in so many empty outputs?
|
|
6
|
33
|
November 11, 2025
|
|
Creating language model only Lora Config
|
|
3
|
35
|
November 10, 2025
|
|
IndexError: index -1 is out of bounds for dimension 0 with size 0
|
|
3
|
37
|
November 7, 2025
|
|
How to use Qwen3-VL generate() with num_return_sequences > 1?
|
|
3
|
34
|
November 6, 2025
|
|
How can I get a list of word segmentation results for non-English string?
|
|
14
|
42
|
November 6, 2025
|
|
PEFT with SFTTrainer unexpected 'resume_from_checkpoint'
|
|
2
|
28
|
November 6, 2025
|
|
Model fine-tuning not respecting <|endoftext|> stop tokens during training
|
|
1
|
22
|
November 4, 2025
|
|
Additional_chat_templates does not exist on "main"
|
|
5
|
235
|
November 3, 2025
|
|
[Research/Discussion] Depth-agnostic stability for residual models (no extra norms, no tuning). Is this useful to you?
|
|
0
|
9
|
November 3, 2025
|
|
Xcode Can't Find swift-transformers Package
|
|
1
|
21
|
November 2, 2025
|
|
AutoTokenizer 404 error issue
|
|
3
|
141
|
November 2, 2025
|
|
Doing inference with FSDP during training affects checkpointing
|
|
3
|
620
|
November 1, 2025
|
|
Trainer being very slow to init training setting group_by_length to True
|
|
4
|
372
|
October 29, 2025
|
|
Unable to Run Sentence Transformer Text embedding in Docker
|
|
2
|
617
|
October 29, 2025
|
|
Training with Trainer really slow
|
|
1
|
1698
|
October 27, 2025
|