Search code examples
I have a tensor of shape [5, 2, 18, 4096]. I want to stack the 0th dimension along the 2nd dimension...

pythonmachine-learningdeep-learningpytorchtransformer-model

Read More
Get probability of multi-token word in MASK position...

pythonpytorchtransformer-modelbert-language-modelhuggingface-transformers

Read More
Why doesn't the transformer use positional encoding in every layer?...

machine-learningartificial-intelligencetransformer-model

Read More
Do you need to put EOS and BOS tokens in autoencoder transformers?...

pythonpytorchtransformer-model

Read More
Understanding dimensions in MultiHeadAttention layer of Tensorflow...

tensorflownlptransformer-modelattention-model

Read More
BOS token for encoder decoder models...

deep-learninghuggingface-transformerstransformer-modelhuggingface-tokenizers

Read More
How to preserve column order after applying sklearn.compose.ColumnTransformer on numpy array...

pythonscikit-learnnumpy-ndarrayscalingtransformer-model

Read More
The decoder part in a transformer model...

nlptransformer-modeldecoder

Read More
How to get YoloS predicted bounding boxes ordered: from top part of image to lower...

computer-visionobject-detectionyolotransformer-modelyolov5

Read More
gcc ON arm/android...

androidgccarmandroid-3.0-honeycombtransformer-model

Read More
tf.keras.layers.MultiHeadAttention's argument key_dim sometimes not matches to paper's examp...

tensorflowtf.kerastransformer-modelattention-model

Read More
Masking layer vs attention_mask parameter in MultiHeadAttention...

pythontensorflowkerastransformer-model

Read More
HuggingFace Summarization: effect of specifying both `do_sample` and `num_beams`...

nlphuggingface-transformerstransformer-modelsummarizationbeam-search

Read More
How to predownload a transformers model...

machine-learningflaskamazon-elastic-beanstalktransformer-modelhuggingface-transformers

Read More
How to resume training in spacy transformers for NER...

deep-learningspacynamed-entity-recognitiontransformer-model

Read More
Why heads share same KQV weights(matrix) in transformer?...

pytorchtransformer-model

Read More
Transformer summariser pipeline giving different results on same model with fixed seed...

deep-learningnlphuggingface-transformerstransformer-modelsummarization

Read More
Extracting specific blocks from a module list...

pythonpytorchtransformer-model

Read More
RuntimeError: module compiled against API version 0xe but this version of numpy is 0xd when importin...

python-3.ximporterrortransformer-modelsentence-transformers

Read More
Clustering based on semantic similarity returning no values...

python-3.xpandasnumpyword-embeddingtransformer-model

Read More
How does NLP model know the output length during translation tasks?...

nlphuggingface-transformersbert-language-modeltransformer-modelsentence-transformers

Read More
Do weights of the [PAD] token have a function?...

huggingface-transformersword-embeddingtransformer-modelhuggingface-tokenizershuggingface

Read More
SkLearn DecisionTree Doesn't Include Numerical Features After Fitting...

pythonmachine-learningscikit-learntransformer-model

Read More
Having 6 labels instead of 2 in Hugging Face BertForSequenceClassification...

pythontransformer-modelhuggingface-transformersbert-language-model

Read More
How to handle target decoder inputs for self attention transformer model during predict()...

tensorflowkerastransformer-modelattention-model

Read More
Does HuggingFace's Trainer automatically ignore features not required by the model?...

deep-learninghuggingface-transformersbert-language-modeltransformer-model

Read More
Difference between from_config and from_pretrained in HuggingFace...

huggingface-transformerstransformer-modeldistilbert

Read More
How to create an iterable DataPipe with PyTorch using txt files...

pythonnlppytorchtransformer-modeltorchtext

Read More
PyTorch TransformerEncoderLayer different input order gets different results...

pythonpytorchtransformer-model

Read More
Issue in tensor value assignment. What can be wrong in script?...

pythonpipelinetensortransformer-model

Read More
BackNext