Search code examples
rggplot2facet

Can ggplot's faceting be used here?


Welcome to Tidyville.

Below is a small df showing the populations of cities in Tidyville. Some cities belong to the A state and some the B state.

I wish to highlight the cities that decreased in population in red. Mission accomplished so far.

But there are many states in Tidyville. Is there a way to use ggplot's faceting faceting to show a plot for each state. I'm uncertain because I'm new and I do a little calculation outside the ggplot call to identify the cities that decreased in population.

library(ggplot2)
library(tibble)

t1 <- tibble (
  y2001 = c(3, 4, 5, 6, 7, 8, 9, 10),
  y2016 = c(6, 3, 9, 2, 8, 2, 11, 15),
  type = c("A", "A", "B", "B", "A", "A", "B", "B")
)

years <-  15
y2001 <- t1$y2001
y2016 <- t1$y2016

# Places where 2016 pop'n < 2001 pop'n
yd <- y2016 < y2001

decrease <- tibble (
  y2001 = t1$y2001[yd],
  y2016 = t1$y2016[yd]
)

# Places where 2016 pop'n >= 2001 pop'n
yi <- !yd

increase <- tibble (
  y2001 = t1$y2001[yi],
  y2016 = t1$y2016[yi]
)

ggplot() + 
  # Decreasing
  geom_segment(data = decrease, aes(x = 0, xend = years, y = y2001, yend = y2016), 
             color = "red") +

  # Increasing or equal
  geom_segment(data = increase, aes(x = 0, xend = years, y = y2001, yend = y2016), 
             color = "black") 

Solution

  • Your intermediary steps are unnecessary and lose some of your data. We'll keep what you created first:

    t1 <- tibble (
      y2001 = c(3, 4, 5, 6, 7, 8, 9, 10),
      y2016 = c(6, 3, 9, 2, 8, 2, 11, 15),
      type = c("A", "A", "B", "B", "A", "A", "B", "B")
    )
    years <- 15
    

    But instead of doing all the separating and subsetting, we'll just create a dummy variable for whether or not y2016 > y2001.

    t1$incr <- as.factor(ifelse(t1$y2016 >= t1$y2001, 1, 0))
    

    Then we can extract the data argument to the ggplot() call to make it more efficient. We'll only use one geom_segment() argument and set the color() argument to be that dummy variable we created before. We then need to pass a vector of colors to scale_fill_manual()'s value argument. Finally, add the facet_grid() argument. If you're only faceting on one variable, you put a period on the opposite side of the tilde. Period first mean's they'll be paneled side-by-side, period last means they'll be stacked on top of each toher

    ggplot(data = t1) +
      geom_segment(aes(x = 0, xend = years, y = y2001, yend = y2016, color=incr)) +
      scale_fill_manual(values=c("black", "red")) +
      facet_grid(type~.)