Pages Menu
Categories Menu

Posted by on Feb 2, 2012 in R Spatial, Resources | 26 comments

Great Maps with ggplot2

The above map (and this one) was produced using R and ggplot2 and serve to demonstrate just how sophisticated R visualisations can be. We are used to seeing similar maps produced with conventional GIS platforms or software such as Processing but I hadn’t yet seen one from the R community (feel free to suggest some in the comments). The map contains three layers: buildings, water and the journey segments. The most challenging aspect was to change the standard line ends in geom_segment from “butt” to “round” in order that the lines appeared continuous and not with “cracks” in, see below.

I am grateful to Hadley and the rest of the ggplot2 Google Group for the solution. You can see it here. From this point I layered the plots using the geom_polygon() command for the buildings and water bodies and my new function geom_segment2() for the journey segments- these were simply the start and end latitudes and longitudes for each node in the road network and the number of times a cyclist passed between them. I have included the code below

#Code supplied by james cheshire Feb 2012
#load packages and enter development mode

#if your map data is a shapefile use maptools

#create GeomSegment2 function
GeomSegment2 <- proto(ggplot2:::GeomSegment, { objname <- "geom_segment2" draw <- function(., data, scales, coordinates, arrow=NULL, ...) { if (is.linear(coordinates)) { return(with(coord_transform(coordinates, data, scales), segmentsGrob(x, y, xend, yend, default.units="native", gp = gpar(col=alpha(colour, alpha), lwd=size * .pt, lty=linetype, lineend = "round"), arrow = arrow) )) } }}) geom_segment2 <- function(mapping = NULL, data = NULL, stat = "identity", position = "identity", arrow = NULL, ...) { GeomSegment2$new(mapping = mapping, data = data, stat = stat, position = position, arrow = arrow, ...) } #load data stlat/stlong are the start points elat/elong are the end points of the lines lon<- read.csv("bikes_london.csv", header=F, sep=";") names(lon)<-c("stlat", "stlon", "elat", "elong", "count") #load spatial data. You need to fortify if loaded as a shapefile water<- fortify(readShapePoly("waterfeatures.shp")) built<- fortify(readShapePoly("buildings.shp")) #This step removes the axes labels etc when called in the plot. xquiet<- scale_x_continuous("", breaks=NA) yquiet<-scale_y_continuous("", breaks=NA) quiet<-list(xquiet, yquiet) #create base plot plon1<- ggplot(lon, aes(x=stlon, y=stlat)) #ready the plot layers pbuilt<-c(geom_polygon(data=built, aes(x=long, y=lat, group=group), colour= "#4B4B4B", fill="#4F4F4F", lwd=0.2)) pwater<-c(geom_polygon(data=water, aes(x=long, y=lat, group=group), colour= "#708090", fill="#708090")) #create plot plon2<- plon1 +pbuilt+ pwater+ geom_segment2(aes(xend=elong, yend=elat, size= count, colour=count))+scale_size(range=c(0.06, 1.8))+scale_colour_gradient(low="#FFFFFF", high="#FFFF33", space="rgb")+coord_equal(ratio=1/cos(lon$elat[1]*pi/180))+quiet+ opts(panel.background=theme_rect(fill="#404040")) #done plon2


  1. WOW, that is an amazing map from R.

    Would it be too rude to ask you to post the code, or at least a tutorial / example version of it? Especially how you go about getting data for each of the layers and using it within the plot

    Do you query openstreetmap dynamically, or manually? etc.

    I can see doing some US postal analysis using a mix of open street map and our own internal data.

  2. I am quite curious as well!

    • I am in the process of cleaning things up and will post some code soon

  3. Especially since it is your package being used!

    • Code added, see above. Glad you like it!

      • Thanks!

        Bear with me, I am a very junior R person (but excessively senior with BI tools and SQL)

        Your inputs are….
        bikes_london, which literally has each straight line from point A to point B with the count of cycles hired (which controls line weight). My equivalent to this would be a file of routes, likely replacing count with a route ID and having different colors per route.

        I would also need a similar file if I wanted to plot roads

        Shapefiles for water and buildings – silly question, where do you get such files? I would be looking for US data.

        Does the order of the layers matter? It looks like they are specified from the top down.

        Once again, this is a fantastic map.

      • Great work! Is the “bikes_london.csv” dataset available to check how data is stored? How is stlat and stlon different from elat and elon? I want to see how the data is to be stored.

  4. In one word: AMAZING

  5. Very inspiring! Thanks!

  6. Amazing work. Thank you James

  7. Very nice! Is the use of gpclibPermit() strictly necessary, and if so for what? It has a very restrictive license, which you see if you look at the source.

    • Thanks Roger, I’m glad you like it. gpclibPermit() is required for fortify()- I find it a convenient way to extract the polygon boundary coordinates into a format that works with ggplot(). There are obviously ways around this but I didn’t want to complicate the code too much here.

      • In fact, gpclibPermit() may only be needed on systems without rgeos installed, as unionSpatialPolygons() inside fortify.sp() will use rgeos rather than gpclib if available, even if gpclibPermit() has been issued. And ggplot2 could drop suggesting gpclib, because maptools already suggests it; it is only used in that one place.

        So typically it is only needed on OSX systems where the user doesn’t setRepositories(ind=1:2) before installing spatial things. My guess is that fortify.sp will use rgeos if available. Could you contact me off-page if you could let me check the call logic avoiding gpclib in your case?

        The concern about avoiding gpclib is motivated by movement towards licence-based filtering on CRAN, because institutions will want positive confirmation that their software train is not encumbered – rgeos is not encumbered, gpclib is, unfortunately.

  8. Thanks for sharing. This is terrific data visualization. It really speaks for itself. You’ve really captured the data side of things while appealing to the aesthetics. It’s inspired me to delve deeper into ggplot2 in combination with various mapping resources. Thanks again.

  9. Beautiful and inspirational! Way to elevate the bar!
    Thanks for sharing your code too.

  10. Thanks for sharing.
    I’m trying to execute your code but I’m always getting this error when I try to print out the result:

    Error : Discrete value supplied to continuous scale

    Do you have any clue?


    • Found!
      I had a header at the first line of my CSV file and according to this line of code:
      lon<- read.csv("bikes_london.csv", header=F, sep=";")
      header was set to (F)alse…


  11. Hi,
    can you tell me where I can find the bikes_london.csv dataset?

  12. Looks great!

    I tried to run your code, but apparently R does not know how to interpret the space between ‘GeomSegment2’ and ‘objname'(and also between draw and if, I would guess) in the lines

    #create GeomSegment2 function
    GeomSegment2 objname <- “geom_segment2″
    draw if (is.linear(coordinates)) {

    Do I have to change anything about the code to make it run?

    I would appreciate your help.

  13. Brilliant! Kudos

  14. This is amazing, this has opened new doors as to how R can be used even in spatial visualisations. Appreciated the sharing of knowledge, thank you!

  15. Really great use of ggplot! Is this data open source and accessible on the web?

  16. Amazing stuff. Where can I get the bikes_london.csv data file?


  1. London Cycle Hire and Pollution | Spatial Analysis - [...] The routes have been guessed using routing algorithms and OpenStreetMap data and optimised for cyclists (ie we assumed that…
  2. eoda, R und Datenanalyse » Geodatenanalyse und -visualisierung mit R - [...] faszinierendes Beispiel für die Möglichkeiten der Datenvisualisierung mit R hat der Brite James Cheshire geliefert. Er nutzte die Daten…
  3. London Cycle Hire and Pollution - [...] The routes have been guessed using routing algorithms and OpenStreetMap data and optimised for cyclists (ie we assumed that…
  4. The changing face of “Analysis” « Drunks&Lampposts - [...] Cheshire over at SpatialAnalysis has shown what is possible with visualising data using ggplot2. Even five years ago this…
  5. When should we use a black background for a map? « Andrew Wheeler - [...] many of James Chesire’s maps of London flows are quite nice. For two examples see one with a darker…
  6. Data Visualization: Graphics with GGPlot2 - [...] London Bike Routes [...]
  7. Serpents de charges « @comeetie :: blog - [...] nombre de vélib / mois parcourant chaque artère. Une telle stratégie à déjà été employée sur londre par des…
  8. Data Visualization: Better Events & Education - Data Community DC - [...] presents those objects, clearing the way for discussion and discovery. Great examples include The London Cycle Hire Journeys, National…
  9. London Cycle Hire and Pollution | The Mapping London Blog - [...] The routes have been guessed using routing algorithms and OpenStreetMap data and optimised for cyclists (ie we assumed that…
  10. R: Street of France › this and that about applied statistics - [...] geom_segment function for plotting the lines was originares form the blog-post Great Maps with ggplot2. The function conv_sp_lines_to_seg converts…
  11. R – update to OpenStreetMaps package | BInalytics - [...] 6 months or so (something tells me, I might be a bit rusty). Whilst I like GIS visualization from…
  12. Daily Data: 5/22 | Austin+Wehrwein - […] graphic made with ggplot2 shows that data does not always have to be dull. Visit this link here to…
  13. Improving R Data Visualisations Through Design | - […] wall”. This map, showing cyclists in London by time of day, was created from code similar to this…
  14. BibSonomy :: url :: Great Maps with ggplot2 | - […] […]

Post a Reply

Your email address will not be published. Required fields are marked *