[R] Function gutenberg_download in the gutenbergr package
Patrick Connolly
p_connolly at slingshot.co.nz
Wed Jan 24 08:23:06 CET 2018
I've been working through https://www.tidytextmining.com/tidytext.html
wherein everything worked until I got to this part in section 1.5
> hgwells <- gutenberg_download(c(35, 36, 5230, 159))
Determining mirror for Project Gutenberg from http://www.gutenberg.org/robot/harvest
Error in open.connection(con, "rb") :
Failed to connect to www.gutenberg.org port 80: Connection timed out
Which indicates the problem is at the very start:
if (is.null(mirror)) {
mirror <- gutenberg_get_mirror(verbose = verbose)
}
The documentation for gutenberg_get_mirror indicates there's nothing
different I could set.
So I tried specifying my usual mirror:
> hgwells <- gutenberg_download(c(1260, 768, 969, 9182, 767), mirror = "http://cran.stat.auckland.ac.nz")
Error in read_zip_url(full_url) : could not find function "read_zip_url"
>
Which is, indeed, strange since according to
> help.search("read_zip_url")
Help files with alias or concept or title matching ‘read_zip_url’ using
regular expression matching:
gutenbergr::read_zip_url
Read a file from a .zip URL
Aliases: read_zip_url
[...]
And according to
library(help = "gutenbergr")
[...]
Index:
gutenberg_authors Metadata about Project Gutenberg authors
gutenberg_download Download one or more works using a Project
Gutenberg ID
gutenberg_get_mirror Get the recommended mirror for Gutenberg files
gutenberg_metadata Gutenberg metadata about each work
gutenberg_strip Strip header and footer content from a Project
Gutenberg book
gutenberg_subjects Gutenberg metadata about the subject of each
work
gutenberg_works Get a filtered table of Gutenberg work metadata
read_zip_url Read a file from a .zip URL
[...]
However, when I look at the list for that part of the search(), there
is no read_zip_url but all the rest of that list are present. So it's
not surprising that it isn't found. But it puzzles me that it is not
there.
Ideas as to where I should proceed gratefully appreciated.
> sessionInfo()
R version 3.4.2 (2017-09-28)
Platform: x86_64-pc-linux-gnu (64-bit)
Running under: Ubuntu 14.04.5 LTS
Matrix products: default
BLAS: /home/hrapgc/local/R-3.4.2/lib/libRblas.so
LAPACK: /home/hrapgc/local/R-3.4.2/lib/libRlapack.so
locale:
[1] LC_CTYPE=en_NZ.UTF-8 LC_NUMERIC=C
[3] LC_TIME=en_NZ.UTF-8 LC_COLLATE=en_NZ.UTF-8
[5] LC_MONETARY=en_NZ.UTF-8 LC_MESSAGES=en_NZ.UTF-8
[7] LC_PAPER=en_NZ.UTF-8 LC_NAME=C
[9] LC_ADDRESS=C LC_TELEPHONE=C
[11] LC_MEASUREMENT=en_NZ.UTF-8 LC_IDENTIFICATION=C
attached base packages:
[1] grDevices utils stats graphics methods base
other attached packages:
[1] sos_2.0-0 brew_1.0-6 gutenbergr_0.1.3 ggplot2_2.2.1
[5] stringr_1.2.0 bindrcpp_0.2 dplyr_0.7.4 janeaustenr_0.1.5
[9] tidytext_0.1.6 FactoMineR_1.38 readxl_1.0.0 tm_0.7-3
[13] NLP_0.1-11 wordcloud_2.5 RColorBrewer_1.1-2 lattice_0.20-35
loaded via a namespace (and not attached):
[1] Rcpp_0.12.13 cellranger_1.1.0 compiler_3.4.2
[4] plyr_1.8.4 bindr_0.1 tokenizers_0.1.4
[7] tools_3.4.2 gtable_0.2.0 tibble_1.3.4
[10] nlme_3.1-131 pkgconfig_2.0.1 rlang_0.1.2
[13] Matrix_1.2-11 psych_1.7.8 curl_3.0
[16] parallel_3.4.2 xml2_1.1.1 cluster_2.0.6
[19] hms_0.3 flashClust_1.01-2 grid_3.4.2
[22] scatterplot3d_0.3-40 glue_1.1.1 ellipse_0.3-8
[25] R6_2.2.2 foreign_0.8-69 readr_1.1.1
[28] purrr_0.2.4 tidyr_0.7.2 reshape2_1.4.2
[31] magrittr_1.5 scales_0.5.0 SnowballC_0.5.1
[34] MASS_7.3-47 leaps_3.0 assertthat_0.2.0
[37] mnormt_1.5-5 colorspace_1.3-2 labeling_0.3
[40] stringi_1.1.5 lazyeval_0.2.1 munsell_0.4.3
[43] slam_0.1-42 broom_0.4.2
>
--
~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.
___ Patrick Connolly
{~._.~} Great minds discuss ideas
_( Y )_ Average minds discuss events
(:_~*~_:) Small minds discuss people
(_)-(_) ..... Eleanor Roosevelt
~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.~.
More information about the R-help
mailing list