Search code examples
How to know which words are encoded with unknown tokens in HuggingFace BertTokenizer?...


huggingface-transformershuggingface-tokenizers

Read More
Llama-2 7B-hf repeats context of question directly from input prompt, cuts off with newlines...


pythonartificial-intelligencehuggingface-transformerslarge-language-model

Read More
Continual pre-training vs. Fine-tuning a language model with MLM...


deep-learningnlphuggingface-transformersbert-language-modelpre-trained-model

Read More
Deepspeed : AttributeError: 'DummyOptim' object has no attribute 'step'...


pythonhuggingface-transformerslarge-language-modelhuggingface-trainerdeepspeed

Read More
Size mismatch for embed_out.weight: copying a param with shape torch.Size([0]) from checkpoint - Hug...


pytorchhuggingface-transformerslarge-language-modelhuggingface

Read More
How to fix no token found error while downloading hugging face?...


python-3.xpytorchhuggingface-transformers

Read More
Different embeddings for same sentences with torch transformer...


pythonpytorchhuggingface-transformersbert-language-model

Read More
I load a float32 Hugging Face model, cast it to float16, and save it. How can I load it as float16?...


pythonmachine-learninghuggingface-transformershuggingfacehalf-precision-float

Read More
How to fix the learning-rate for Huggingface´s Trainer?...


machine-learningdeep-learninghuggingface-transformershuggingface-trainerlearning-rate

Read More
Alternative to Receptive field in Transformers and what factors impact it...


nlphuggingface-transformersreceptive-field

Read More
Why aren't my metrics showing in SageMaker (CloudWatch)?...


pytorchhuggingface-transformersamazon-sagemakerhuggingfaceamazon-sagemaker-studio

Read More
Huggingface - Finetuning in Tensorflow with custom datasets...


tensorflowhuggingface-transformerstransfer-learninghuggingface-tokenizersfine-tuning

Read More
Fine-tuning model's classifier layer with new label...


pytorchhuggingface-transformers

Read More
AutoTokenizer.from_pretrained took forever to load...


pythonhuggingface-transformershuggingface-tokenizers

Read More
Error RuntimeError: CUDA error: operation not supported when tried to locate something into CUDA...


pythonpytorchhuggingface-transformers

Read More
How to use adapter transformers with a Huggingface Pipeline...


pythonmachine-learningnlphuggingface-transformersadapter-transformers

Read More
How can i solve ImportError: Using the `Trainer` with `PyTorch` requires `accelerate>=0.20.1` whe...


pythonnlpimporterrorhuggingface-transformershuggingface

Read More
Embedding of LLM vs custom embeddings...


huggingface-transformersembeddinglarge-language-modelhuggingface-tokenizersretrieval-augmented-generation

Read More
HuggingFace pipeline - Debug prompt...


pythonnlphuggingface-transformers

Read More
Speeding up load time of LLMs...


huggingface-transformerslarge-language-modelquantization

Read More
Running out of RAM when finetuning model...


machine-learningpytorchartificial-intelligencehuggingface-transformers

Read More
How to fix error `OSError: <model> does not appear to have a file named config.json.` when loa...


pytorchnlphuggingface-transformerslarge-language-modelpeft

Read More
presidio transformers package not available, despite being installed...


pythonhuggingface-transformerspiipresidio

Read More
AttributeError: 'TrainingArguments' object has no attribute 'model_init_kwargs'...


pythonnlphuggingface-transformerslarge-language-modelpeft

Read More
Is it possible to load huggingface model which does not have config.json file?...


pythonmachine-learningcomputer-visionhuggingface-transformers

Read More
How to deal with stack expects each tensor to be equal size eror while fine tuning GPT-2 model?...


pythontensorflowartificial-intelligencehuggingface-transformersgpt-2

Read More
Using torchrun with AWS sagemaker estimator on multi-GPU node...


amazon-web-servicespytorchhuggingface-transformersamazon-sagemakerdistributed-computing

Read More
What does "permutation invariant" mean in the context of transformers doing language model...


pytorchhuggingface-transformers

Read More
How to tune LLM to give full length and detailed answers...


pythonmachine-learninghuggingface-transformerslarge-language-modelnlp-question-answering

Read More
Can I dynamically add or remove LoRA weights in the transformer library like diffusers...


pythonhuggingface-transformershuggingfacepeft

Read More
BackNext