Search code examples
rweb-scrapingrvesthttr

Scrape "aspx" page with R


can someone help me or give me some suggestion how scrape table from this url: https://www.promet.si/portal/sl/stevci-prometa.aspx.

I tried with instructions and packages rvest, httr and html but for this particular site without any sucess. Thank you.


Solution

  • This ought to help get you started:

    library(RSelenium)
    library(wdman)
    library(seleniumPipes)
    library(rvest)
    library(tidyverse)
    
    selServ <- selenium(verbose = FALSE)
    selServ$log() # find the port
    remDr <- remoteDr(browserName = "chrome", port = 4567L)
    
    remDr %>% 
      go("https://www.promet.si/portal/sl/stevci-prometa.aspx")
    
    Sys.sleep(5)
    
    pg <- getPageSource(remDr)
    
    html_node(pg, xpath=".//div[@id='ctl00_mainContent_ctl00_StvContainer']/table") %>% 
      html_table() %>% 
      tbl_df()
    ## # A tibble: 1,239 x 10
    ##    X1    X2            X3     X4                       X5     X6      X7     X8    X9     X10  
    ##    <lgl> <chr>         <chr>  <chr>                    <chr>  <chr>   <chr>  <chr> <chr>  <lgl>
    ##  1 NA    Lokacija      Cesta  Smer                     Pas    Števil… Hitro… Razm… Stanje NA   
    ##  2 NA    Ajdovščina    R2-444 vzhod - zahod            ""     60      64     81,7  Norma… NA   
    ##  3 NA    Ajdovščina    R2-444 zahod - vzhod            ""     12      62     371,6 Norma… NA   
    ##  4 NA    Ajdovščina 2  R2-444 Ajdovščina - Selo        ""     36      67     117,8 Norma… NA   
    ##  5 NA    Ajdovščina 2  R2-444 Ajdovščina - Selo        ""     12      60     787,1 Norma… NA   
    ##  6 NA    Ajdovščina AC HC-H4  Nova Gorica - Vipava     vozni  96      100    31,5  Norma… NA   
    ##  7 NA    Ajdovščina AC HC-H4  Nova Gorica - Vipava     prehi… 36      124    120,7 Norma… NA   
    ##  8 NA    Ankaran       R2-406 Križ. Moretini - Ankaran ""     96      59     29    Norma… NA   
    ##  9 NA    Ankaran       R2-406 Ankaran - Križ. Moretini ""     12      57     292,1 Norma… NA   
    ## 10 NA    Apače         R2-438 Trate - Gornja Radgona   ""     24      58     110,6 Norma… NA   
    ## # ... with 1,229 more rows