Trouble with reading CSV file in R -
i'm newbie in r. have 24mb csv file. reads rstudio on macbook air os yoswmite, 4gb ram. r version 3.1.1 (2014-07-10). viewing contents of view (df) ok. attempting apply filter. not hits. attempting cast character number. r replacing charaters na in column casting done! happens here? seems r can not read contents of cells. there encoding? have done fare: first summary:
r code:
eiendommer <- read.csv("eiendommer.csv", sep = ";", quote = "", encoding="utf-8", stringsasfactors = false) view(eiendommer)# can view content of csv file filtereiendommer <- filter(eiendommer, kommune == "0101")# no match filtereiendom <- eiendommer [eiendommer$kommune == "0101",]#no match utvalg <- eiendommer[160567:161934,]#manual selection of rows work utvalgsortert <- arrange(utvalg, desc(jordbruksareal), desc(skogareal))# works view(utvalgsortert) ##try transform columns character number. transformedeiendom <- transform(sorterteiendom, jordbruksareal = as.numeric(jordbruksareal), skogareal = as.numeric(skogareal) ) #this result in na earlier characters lengt 1-3:"646", "18", "2"
summary:
kommune x.gardsnr. x.bruksnr. x.festenr. bruksnavn jordbruksareal length:207554 length:207554 length:207554 length:207554 length:207554 length:207554 class :character class :character class :character class :character class :character class :character mode :character mode :character mode :character mode :character mode :character mode :character x.annetareal. skogareal x.fulldyrket. x.overflatedyrket. x.innmarksbeite. length:207554 length:207554 length:207554 length:207554 length:207554 class :character class :character class :character class :character class :character mode :character mode :character mode :character mode :character mode :character
head:
head(eiendommer) kommune x.gardsnr. x.bruksnr. x.festenr. bruksnavn jordbruksareal x.annetareal. skogareal x.fulldyrket. 1 "0101" "1" "1" "0" "prestegÅrd" "0" "5" "0" "0" 2 "0101" "1" "6" "0" "mo" "8" "4" "7" "8" 3 "0101" "1" "9" "0" "berg gÅrd" "415" "16" "39" "415" 4 "0101" "2" "1" "0" "boberg" "467" "22" "276" "463" 5 "0101" "4" "1" "0" "lundestad" "877" "62" "793" "837" 6 "0101" "4" "5" "0" "leiren" "74" "14" "165" "74"
it appear have specified quote = ""
, in fact should have, , perhaps quote='"'
or default value work.
see example below
d <- data.frame(x='a',y='"a"',stringsasfactors=false) d # x y # 1 "a"
for regular character vector, print.data.frame
not enclose in "
Comments
Post a Comment