torchtext data build_vocab / data_field

I want to ask you some about torchtext.

I have a task about abstractive text summarization, and I build a seq2seq model with pytorch.

I just wonder about data_field constructed by build_vocab function in torchtext.

In machine translation, i accept that two data_fields(input, output) are needed.

But, in summarization, input data and output data are same language.

Here, should I make two data_field(full_sentence, abstract_sentence) in here?

Or is it okay to use only one data_field?

I'm afraid that my wrong choice make model's performance down.

Please, give me a hint.

Solution

You are right in the case of summarization and other tasks, it makes sense to build and use the same vocab for input and output

What do BatchNorm2d's running_mean / running_var mean in PyTorch?
Map each element of torch.Tensor with it's value in the dict
Pytroch clamp for complex values
Can batch normalization be considered a linear transformation?
Doing PyWavelets calculation on GPU
What is the difference between an Embedding Layer with a bias immediately afterwards and a Linear Layer in PyTorch
How do I display a single image in PyTorch?
Examples or explanations of pytorch dataloaders?
How to map values from a 3D tensor to a 1D tensor in PyTorch?
Traceback (most recent call last) in Colab when looping through dataloader in pytorch
Is there a way to use list of indices to simultaneously access the modules of nn.ModuleList in python?
How to multiply 2x3x3x3 matrix by 2x3 matrix to get 2x3 matrix
How does one set the pad token correctly (not to eos) during fine-tuning to avoid model not predicting EOS?
HuggingFace Model - OnnxRuntime - Jupyter Notebook Print Model Summary
The “Forward/Backward Passage Size” is too large for the pytorch model (Yolov3)
How to solve the pytorch RuntimeError: Numpy is not available without upgrading numpy to the latest version because of other dependencies
Torch Euclidian Norm (L2)
Does order of transforms applied for data augmentation matter in Torchvision transforms?
Problem in Backpropagation through a sample in Beta distribution in pytorch
Reinforcement Learning Gymnasium ValueError
Difference between torch.as_tensor() and torch.asarray()
Why KL divergence is negative in Pytorch?
Neural network learning to sum two numbers
Forward pass with all samples
Pytorch: how to (efficiently) apply a function without a “dim” argument to each row of a 2D tensor?
override pytorch Dataset efficiently
Implementation of F1-score, IOU and Dice Score
Why bilinear scaling of images with PIL and pytorch produces different results?
Pytorch Python Distributed Multiprocessing: Gather/Concatenate tensor arrays of different lengths/sizes
CUDA not available