[R] Odp: convert Factor as numeric
Petr PIKAL
petr.pikal at precheza.cz
Thu Apr 29 13:27:40 CEST 2010
Hi
You have to get rid of thousands separator firsr
as.numeric(gsub(",", "", S))
Regards
Petr
r-help-bounces at r-project.org napsal dne 29.04.2010 13:12:44:
> Dear group,
>
> I know this issue has been already covered, and before you reply I must
say
> I have read the R-FAQ and search the mailing list archive.
> I still can't manage to change my factor to numeric as I couldn't find
any
> clear answer.
>
> Here is my df :
>
> Pose1 <-
> structure(list(DESCRIPTION = structure(c(1L, 2L, 3L, 4L, 5L,
> 8L), .Label = c(" SUGAR NO.11 May/10 ", "COTTON NO.2 May/10 ",
> "PLATINUM Jul/10 ", "ROBUSTA COFFEE (10) May/10 ", "WHEAT May/10 ",
> "PRIMARY NICKEL USD", "PRM HGH GD ALUMINIUM USD", "SPCL HIGH GRADE ZINC
> USD",
> "STANDARD LEAD USD"), class = "factor"), POSITION = c(5, 3, -1,
> 15, 4, 2), SETTLEMENT = structure(c(3L, 5L, 2L, 1L, 4L, 8L), .Label =
> c("1,353.0000",
> "1,739.4000", "16.5400", "467.7500", "78.1300", "25,760.8600",
> "2,415.9000", "2,421.0500", "2,357.1200"), class = "factor")), .Names =
> c("DESCRIPTION",
> "POSITION", "SETTLEMENT"), row.names = c("1", "2", "3", "4",
> "5", "51"), class = "data.frame")
>
> >S<-Pose1$SETTLEMENT #select the last column
> > S
> [1] 16.5400 78.1300 1,739.4000 1,353.0000 467.7500 2,421.0500
> Levels: 1,353.0000 1,739.4000 16.5400 467.7500 78.1300 25,760.8600
> 2,415.9000 2,421.0500 2,357.1200
> > str(S)
> Factor w/ 9 levels "1,353.0000","1,739.4000",..: 3 5 2 1 4 8
>
> Now I need to change S to numeric class
>
> > S1<-as.numeric(levels(S))[as.integer(S)] #doesn't work, numbers are
> rounded or NA
> Warning message:
> NAs introduced by coercion
>
> > S1<-as.numeric(levels(S))[S] #doesn't work, numbers are rounded or NA
> Warning message:
> NAs introduced by coercion
>
> > S1<-as.numeric(as.character(S)) #doesn't work, numbers are rounded or
NA
> Warning message:
> NAs introduced by coercion
>
> If it can help, my column S is part of a DF that has been obtained via
this
> line :
>
>
>pose=read.csv2("LSCPos1.csv",sep=",",dec=".",as.is=T,h=T,skip=1)[,c(4,8,14,
> 15)]
>
> pose <-
> structure(list(DESCRIPTION = c("WHEAT May/10 ", "WHEAT May/10 ",
> "WHEAT May/10 ", "WHEAT May/10 ", "COTTON NO.2 May/10 ", "COTTON NO.2
May/10
> ",
> "COTTON NO.2 May/10 ", "PLATINUM Jul/10 ", " SUGAR NO.11 May/10 ",
> " SUGAR NO.11 May/10 ", " SUGAR NO.11 May/10 ", " SUGAR NO.11 May/10 ",
> " SUGAR NO.11 May/10 ", "ROBUSTA COFFEE (10) May/10 ", "ROBUSTA COFFEE
(10)
> May/10 ",
> "ROBUSTA COFFEE (10) May/10 ", "ROBUSTA COFFEE (10) May/10 ",
> "ROBUSTA COFFEE (10) May/10 ", "ROBUSTA COFFEE (10) May/10 ",
> "ROBUSTA COFFEE (10) May/10 ", "ROBUSTA COFFEE (10) May/10 ",
> "ROBUSTA COFFEE (10) May/10 ", "ROBUSTA COFFEE (10) May/10 ",
> "ROBUSTA COFFEE (10) May/10 ", "ROBUSTA COFFEE (10) May/10 ",
> "PRM HGH GD ALUMINIUM USD 09/07/10 ", "PRM HGH GD ALUMINIUM USD 09/07/10
",
> "PRIMARY NICKEL USD 04/06/10 ", "PRIMARY NICKEL USD 04/06/10 ",
> "PRIMARY NICKEL USD 10/06/10 ", "PRIMARY NICKEL USD 10/06/10 ",
> "STANDARD LEAD USD 01/07/10 ", "STANDARD LEAD USD 01/07/10 ",
> "STANDARD LEAD USD 01/07/10 ", "STANDARD LEAD USD 01/07/10 ",
> "STANDARD LEAD USD 01/07/10 ", "STANDARD LEAD USD 01/07/10 ",
> "STANDARD LEAD USD 01/07/10 ", "STANDARD LEAD USD 06/07/10 ",
> "SPCL HIGH GRADE ZINC USD 08/07/10 ", "SPCL HIGH GRADE ZINC USD 08/07/10
",
> "SPCL HIGH GRADE ZINC USD 08/07/10 ", "SPCL HIGH GRADE ZINC USD 09/07/10
",
> "SPCL HIGH GRADE ZINC USD 09/07/10 ", "SPCL HIGH GRADE ZINC USD 09/07/10
",
> "SPCL HIGH GRADE ZINC USD 09/07/10 ", "SPCL HIGH GRADE ZINC USD 09/07/10
",
> "SPCL HIGH GRADE ZINC USD 13/04/10 ", "SPCL HIGH GRADE ZINC USD 13/04/10
"
> ), CREATED.DATE = structure(c(14705, 14707, 14707, 14711, 14700,
> 14700, 14711, 14711, 14708, 14708, 14708, 14711, 14711, 14707,
> 14707, 14707, 14707, 14707, 14708, 14708, 14708, 14708, 14708,
> 14708, 14708, 14708, 14708, 14672, 14673, 14678, 14678, 14700,
> 14700, 14700, 14700, 14700, 14700, 14700, 14705, 14707, 14707,
> 14707, 14708, 14708, 14708, 14708, 14708, 14622, 14634), class =
"Date"),
> QUANITY = c(1, 1, 1, 1, 1, 1, 1, -1, 1, 1, 1, 1, 1, 2, 1,
> 1, 1, 2, 1, 1, 1, 1, 2, 1, 1, -1, 1, 1, -1, -1, 1, 1, -1,
> 1, -1, -1, 1, -1, 1, 1, 1, -1, -1, 1, -1, 1, 1, 1, -1),
CLOSING.PRICE =
> c("467.7500",
> "467.7500", "467.7500", "467.7500", "78.1300", "78.1300",
> "78.1300", "1,739.4000", "16.5400", "16.5400", "16.5400",
> "16.5400", "16.5400", "1,353.0000", "1,353.0000", "1,353.0000",
> "1,353.0000", "1,353.0000", "1,353.0000", "1,353.0000",
"1,353.0000",
> "1,353.0000", "1,353.0000", "1,353.0000", "1,353.0000",
"2,415.9000",
> "2,415.9000", "25,755.7100", "25,755.7100", "25,760.8600",
> "25,760.8600", "2,355.9600", "2,355.9600", "2,355.9600",
> "2,355.9600", "2,355.9600", "2,355.9600", "2,355.9600",
"2,357.1200",
> "2,420.7300", "2,420.7300", "2,420.7300", "2,421.0500",
"2,421.0500",
> "2,421.0500", "2,421.0500", "2,421.0500", "2,388.4300", "2,388.4300"
> )), .Names = c("DESCRIPTION", "CREATED.DATE", "QUANITY",
> "SETTLEMENT"), row.names = c(NA, -49L), class = "data.frame")
>
> > str(pose)
> 'data.frame': 49 obs. of 4 variables:
> $ DESCRIPTION : chr "WHEAT May/10 " "WHEAT May/10 " "WHEAT May/10 "
"WHEAT
> May/10 " ...
> $ CREATED.DATE:Class 'Date' num [1:49] 14705 14707 14707 14711 14700
...
> $ QUANITY : num 1 1 1 1 1 1 1 -1 1 1 ...
> $ SETTLEMENT : chr "467.7500" "467.7500" "467.7500" "467.7500" ...
>
>
> "Pose$SETTLEMENT" has a "character" class, when it should have been
> "numeric". So maybe a solution would be to give a numeric class when I
read
> my .csv file?
> I tried to change class of this column right after the read.csv()(using
> type.convert() let me with a factor), but again got some rounded number
or
> NA.
>
> So, what am I supposed to do??
>
> TY for the help.
>
> ______________________________________________
> R-help at r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
More information about the R-help
mailing list