[R] How to access https page

Prof Brian Ripley ripley at stats.ox.ac.uk
Tue Mar 10 07:28:38 CET 2015

On 09/03/2015 22:39, Hui Du wrote:
> Hi All,
> I am trying to parse some information from website, say, a linkedin page.
> The linkedin url was
> url = "http://www.linkedin.com/in/huidu"
> I had no problem to use readLines and XML package to collect the
> information I need. However, that url became "
> https://www.linkedin.com/in/huidu" now.
> url = "https://www.linkedin.com/in/huidu"
> It failed readLines function.
>> readLines(url)
> Error in file(con, "r") : cannot open the connection
> In addition: Warning message:
> In file(con, "r") : unsupported URL scheme
> Do you know any way to read-in web information if the url is https? Thanks
> a lot.

Try R-devel, soon to become R 3.2.0.  That has support for this on 
platforms where libcurl is installed (which should be possible almost 

You did not give the 'at a minimum' information required by the posting 
guide.  This has long been possible on Windows with --internet2.

> Hui
> 	[[alternative HTML version deleted]]
> ______________________________________________
> R-help at r-project.org mailing list -- To UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.

Brian D. Ripley,                  ripley at stats.ox.ac.uk
Emeritus Professor of Applied Statistics, University of Oxford
1 South Parks Road, Oxford OX1 3TG, UK

More information about the R-help mailing list