[R] RSXML - Parsing XML Documents on Internet

Prof Brian Ripley ripley at stats.ox.ac.uk
Thu Sep 30 08:57:00 CEST 2004

Sorry, my flaky connection (I am on a slow dialup) broke this up: here's 
the rest.

On Thu, 30 Sep 2004, Prof Brian Ripley wrote:

> On Wed, 29 Sep 2004 larsenmtl at comcast.net wrote:


> > Now I know that xmlTreeParse uses the libxml facilities for downloading and 
> > parsing off the web.   Along with one of our network people, I did some packet 
> > sniffing and it looks like libxml doesn't go through our proxy server (it tries 
> > to directly connect to the above URL), which is the reason for the error.  Is 
> > there anyway to force it through the proxy?  Am I missing some setting or 
> > option?  If I download the xml file and parse it locally it works without error.
> > 
> > Please Note that the URL is valid and I can open it in my browser.  Also note 
> > that I must start R with the --internet2 option so it'll use our proxy server.  
> That's your problem.  You *can* use proxies without --internet2 (see 
> ?download.file) and the XML code uses the standard version of the code.

The first issue is that you have not configured R to use your proxy, so 
please get that working.

Issue two is that you may need to get an older version of XML compiled 
against an older libxml, as the current one has not been tested.

Brian D. Ripley,                  ripley at stats.ox.ac.uk
Professor of Applied Statistics,  http://www.stats.ox.ac.uk/~ripley/
University of Oxford,             Tel:  +44 1865 272861 (self)
1 South Parks Road,                     +44 1865 272866 (PA)
Oxford OX1 3TG, UK                Fax:  +44 1865 272595

More information about the R-help mailing list