Search code examples
shelljoinawkpastecut

paste two files according to concordance between columns


I would need some help about this:

I have file1:

ID
100
102
103
104
108
109
112
.
.
.

And file2:

ID    []    p1    p2
100    2.5    3.0    2.0
101    2.0    4.0    3.0
102    2.6    4.0    2.5
103    2.3    2.0    NA
104    2.3    2.0    2.0
105    3.5    2.8    2.0
106    1.7    NA    3.2
107    5.0    4.0    4.0
108    3.2    2.0    4.0
109    2.9    1.0    1.5
110    5.0    NA    NA
111    2.9    4.0    4.0
112    3.1    2.5    2.0
.
.
.

I would like to paste both files in file3 looking like:

ID    []    p1    p2
100    2.5    3.0    2.0
102    2.6    4.0    2.5
103    2.3    2.0    NA
104    2.3    2.0    2.0
108    3.2    2.0    4.0
109    2.9    1.0    1.5
112    3.1    2.5    2.0
.
.
.

Basically, pasting data in fields 2,3,4 from file2 to file1, considering concordancies in field1 in both file1 and file2.

I've tried some awk commands with NR == NFR but I just get the output with the content of file1 followed by content of file2...

Any help? Unix commands with cut and paste are also welcome


Solution

  • You can indeed use awk for that:

    awk 'NR==FNR{a[$1]=1} NR>FNR && a[$1]' file1 file2
    

    NR==FNR{a[$1]=1} The a array filled with the content of file1.

    NR>FNR && a[$1] is printing the line of file2 if the array contains the ID (aka $1).