Equal genomic intervals between samples

I would like to found the exactly same genomic intervals shared between samples (NE_id).

My Input:

chr  start_call   end_call  NE_id 
chr1    150         200      NE01
chr1    150         200      NE02
chr2    100         150      NE01
chr2    100         160      NE02
chr3    200         300      NE01   
chr3    200         300      NE02

My expected output:

chr  start_call   end_call  NE_id 
chr1    150         200      NE01, NE02   
chr3    200         300      NE01, NE02

In this example the chr2 genomic interval have some overlap, however it don´t correspond to the exact same genomic interval (size difference == 10).

Thank you very much.

Solution

If dat is the data, you could try:

res <-aggregate(NE_id~., data=dat, FUN=I)
res[sapply(res$NE_id,length)>1,]
#    chr  start_call end_call     NE_id
# 3 chr1        150      200 NE01, NE02
# 4 chr3        200      300 NE01, NE02

Estimating non-monotonic bi-exponential curve fit
column type issue when converting csv to parquet using duckdb in R
"Target position can only be set for new windows" in chromote in R
Determine level of nesting in R?
Week start on Mondays
Center output from dm_draw
plot a network based on given values
Adding a X axis title to faceted ggballoonplot
Calculate mean of matrices having different dimensions
check if two columns have a one-to-one relationship in R
How to extract Std.Dev from VarCorr glmmTMB
How do you print to stderr in R?
How to plot China map with South China Sea in base R
Get column and row position of nth element in a matrix
Is there any authoritative documentation on R release nicknames?
R Glassdoor Web Scraping
Issue with graticule across 180° for several country/territory EEZs
Separating grouped layers in a raster stack in terra
How can I use group_by and mutate to perform a subtraction calculation with specific groupings? Time 0 minus Time X for all groups
How to directly open .R data containing data frame code in R?
Way to web-scrape a popular eSport website using R?
Variance calculation warning: longer object length is not a multiple
gratia::draw(): "'length.out' must be a non-negative number"
Using Swift as custom engine in knitr and including all previous content
convert source target value dataframe into a correlation matrix
ggplot2 plotting a 100% stacked area chart
Use string as formula for ipwtm function?
interpolarization within groups with NA
Multi-row x-axis labels in ggplot line chart
How to do a SOAP request for EUR-Lex API with R?