Search code examples
How to disable Neptune callback in transformers trainer runs?...

pythonpytorchcallbackhuggingface-transformersneptune

Read More
ImportError: Using the `Trainer` with `PyTorch` requires `accelerate>=0.26.0` but I have version ...

importpytorchhuggingface-transformersaccelerate

Read More
Huggingface Transformers not getting imported in VS Code...

pythonpython-importvisual-studio-codehuggingface-transformers

Read More
ValueError: Unrecognized model in ./MRPC/. Should have a `model_type` key in its config.json, or con...

pythontensorflowhuggingface-transformersbert-language-modelonnx

Read More
How to get text and image embedding of same dimension using Huggingface CLIP...

python-3.xtensorflow2.0huggingface-transformers

Read More
Tokenizer.from_file() HUGGINFACE : Exception: data did not match any variant of untagged enum ModelW...

jsonnlphuggingface-transformershuggingface-tokenizershuggingface

Read More
OSError: meta-llama/Llama-2-7b-chat-hf is not a local folder...

pythonhuggingface-transformershuggingfacellama

Read More
stucking at downloading shards for loading LLM model from huggingface...

pythonnlphuggingface-transformers

Read More
using pipelines with a local model...

pythontensorflow2.0huggingface-transformers

Read More
Removing strange/special characters from outputs llama 3.1 model...

pythonhuggingface-transformerstokenizelarge-language-modelllama

Read More
Seq2Seq trainer.train() keeps giving indexing error...

pythonnlphuggingface-transformershuggingface-trainer

Read More
Alternative to device_map = "auto" in Huggingface Pretrained...

machine-learningdeep-learningnlphuggingface-transformers

Read More
Error during the compilation of the tokenizers package when trying to install transformers 4.27...

artificial-intelligencehuggingface-transformerslarge-language-model

Read More
BertTokenizer.from_pretrained raises UnicodeDecodeError...

pythonhuggingface-transformers

Read More
Pytorch Lightning places model inputs and model to different devices...

pythonartificial-intelligencehuggingface-transformerspytorch-lightning

Read More
How are the weights of the Mistral models reinitialized in Huggingface?...

nlphuggingface-transformerslarge-language-modelmistral-7bxaiver-initialization

Read More
Loss becomes Nan after attention_mask is added to the model while fine-tuning gemma2...

pythonnanhuggingface-transformerspytorch-lightningloss

Read More
HuggingFace | ValueError: Connection error, and we cannot find the requested files in the cached pat...

python-3.xtensorflowhuggingface-transformersvalueerrorgpt-2

Read More
What is "language modeling head" in BertForMaskedLM...

nlpbert-language-modelhuggingface-transformerslanguage-model

Read More
HuggingFace - 'optimum' ModuleNotFoundError...

pythonhuggingface-transformersquantizationmodulenotfounderrorpruning

Read More
What is the exact vocab size of the Mistral-Nemo-Instruct-2407 tokenizer model?...

huggingface-transformerstokenizelarge-language-modelmistral-ai

Read More
How to Visualize Cross-Attention Matrices in MarianMTModel During Output Generation...

pythonpytorchnlphuggingface-transformers

Read More
Transformers.js in React.js...

javascriptreactjshuggingface-transformershuggingface

Read More
Why doesn't permuting positional encodings in BERT affect the output as expected?...

pythonpytorchnlphuggingface-transformers

Read More
load_state_dict getting random results...

pythonpytorchhuggingface-transformerspre-trained-modelgpt-2

Read More
Why doesn't permuting positional encodings in GPT-2 affect the output as expected?...

pytorchhuggingface-transformerstransformer-modelgpt-2

Read More
SpaCy and Gensim on Jupyter Notebooks...

pythonjupyter-notebookspacyhuggingface-transformersgensim

Read More
Does Padding in a Batch of Sequences Affect Performance? How Effective is the Attention Mask?...

pytorchnlphuggingface-transformerstransformer-model

Read More
HuggingFace: ValueError: expected sequence of length 165 at dim 1 (got 128)...

pythondeep-learningpytorchhuggingface-transformersbert-language-model

Read More
Top-p sampling not working. CUDA error: device-side assert triggered...

pytorchartificial-intelligencehuggingface-transformerssamplinglogits

Read More
BackNext