Confidence Scores / Self-Training for Wav2Vec2 / CTC models With LM (PyCTCDecode)
|
|
1
|
2886
|
April 21, 2022
|
More expressive attention with negative weights
|
|
1
|
288
|
November 14, 2024
|
How to fine tune fine tune GitHub Copilot?
|
|
3
|
3621
|
June 24, 2022
|
Translation model to 100+ Languages
|
|
4
|
1742
|
January 25, 2025
|
Using mixup on RoBERTa
|
|
7
|
2273
|
December 18, 2024
|
XLSR-Wav2Vec2 with punctuation
|
|
1
|
1386
|
October 12, 2022
|
How can I replicate the research paper?
|
|
1
|
694
|
December 22, 2023
|
Text to Speech Alignment with Transformers
|
|
2
|
5443
|
April 20, 2022
|
Merry Christmas & We have released "Awesome-Neuro-Symbolic-Learning-with-LLM"
|
|
0
|
89
|
December 26, 2024
|
New seq2seq tool: search hparam space with run_eval.py
|
|
5
|
347
|
September 17, 2020
|
Using NLP for People On Low Income in the UK
|
|
0
|
836
|
September 24, 2021
|
[Call for participation] Interactive Grounded Language Understanding in a Collaborative Environment (IGLU) Competition@NeurIPS2021
|
|
0
|
727
|
September 9, 2021
|
Understanding what went wrong in attention
|
|
5
|
1647
|
July 31, 2020
|
New: Distributed GPU Platform
|
|
2
|
655
|
November 8, 2023
|
[Suggestions and Guidance]Finetuning Bert models for Next word Prediction
|
|
4
|
4877
|
January 26, 2022
|
A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to GPT-5 All You Need?
|
|
3
|
2996
|
September 23, 2024
|
What are some recommended pretrained models for extracting semantic feature on single sentence?
|
|
4
|
1490
|
December 14, 2020
|
Runtime Identity Drift in LLMs — Can We Stabilize Without Memory?
|
|
3
|
169
|
April 28, 2025
|
Embeddings from the Decoder only model
|
|
5
|
1289
|
March 26, 2025
|
AI-Human Co-Creation: Seeking Collaborators for an Ethical AI Development Framework
|
|
2
|
90
|
May 20, 2025
|
Coral USB Edge TPU coprocessor
|
|
0
|
479
|
April 9, 2024
|
Why are segment and position embeddings so large?
|
|
2
|
1543
|
August 2, 2020
|
Shortformer: Better Language Modeling using Shorter Inputs
|
|
0
|
467
|
December 31, 2020
|
Similarity search with combined image and text?
|
|
6
|
3132
|
June 24, 2022
|
Model that can generate both text and image as output
|
|
5
|
1058
|
December 31, 2024
|
Seq2Seq Distillation: Methodology Questions
|
|
7
|
2741
|
August 7, 2023
|
Dealing with Imbalanced Datasets?
|
|
1
|
5437
|
March 11, 2021
|
Using transformers (BERT, RoBERTa) without embedding layer
|
|
8
|
4128
|
December 16, 2020
|
How to get the probabilities of each class when we use T5 or Flan
|
|
0
|
389
|
January 23, 2024
|
Pre-training with Lamb optimizer
|
|
7
|
4308
|
December 28, 2020
|
Introduce Our New Paper "OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use"
|
|
1
|
265
|
March 21, 2025
|
Carrying Gradients Through Generate
|
|
5
|
2704
|
January 29, 2023
|
How to correctly cite Hugging Face Transformer model_doc
|
|
1
|
7924
|
October 25, 2023
|
Strategies for Enhancing LLM's Understanding of a Complex Novel for Improved Question Answering
|
|
1
|
1301
|
January 19, 2024
|
Evaluation metrics for BERT-like LMs
|
|
4
|
4599
|
December 6, 2024
|
Combinatorial Optimization with LLMs/Transformers
|
|
7
|
2009
|
February 8, 2025
|
GPT-2 in DNA data
|
|
1
|
1269
|
August 6, 2023
|
State of the art technique for initializing Embedding Matrix?
|
|
3
|
5009
|
July 19, 2020
|
Few-shot learning vs Fine-Tuning
|
|
0
|
1782
|
May 26, 2023
|
`nan` training loss but eval loss does improve over time
|
|
5
|
3998
|
October 10, 2022
|
Open source psychology project using HF sentence transformers
|
|
6
|
364
|
January 3, 2025
|
Modern NLP for "Economics of Innovation" (Open Research Project using Patent Data)
|
|
4
|
757
|
July 14, 2020
|
Text similarity not by cosine similarity
|
|
3
|
4694
|
April 12, 2022
|
The Tree Oil Painting: AI Unlocks a 10-Year Art Mystery
|
|
0
|
50
|
March 17, 2025
|
Question about maximum number of tokens
|
|
1
|
6147
|
February 9, 2021
|
Encoder-Decoder vs Decoder Only Architecture Models
|
|
0
|
1515
|
December 18, 2022
|
Continue pre-training GPT2
|
|
1
|
591
|
March 10, 2025
|
Understanding Technical Drawings
|
|
2
|
465
|
January 22, 2025
|
How I fine-tune BART for summarization using large texts?
|
|
3
|
3983
|
September 28, 2020
|
Is there a open source implementation of "Deep Learning Based Page Layout Analyze"?
|
|
5
|
1793
|
May 21, 2024
|