r Archives - Page 5 of 30

[Solved] Add a column for counting unique tuples in the data frame [duplicate]

January 4, 2023 by Kirat

1) aggregate ag <- aggregate(count ~ ., cbind(count = 1, df), length) ag[do.call(“order”, ag), ] # sort the rows giving: userID A B count 3 1 2 2 1 4 1 3 3 1 2 3 2 1 2 1 5 1 0 2 The last line of code which sorts the rows could be … Read more

[Solved] scatter plot of different color in R

January 4, 2023 by Kirat

Your question is not very clear, but assuming your data is in df, it sounds like you want something like this to get started: plot(1:5, df$var1, pch=19, col=”blue”, ylim=c(0,80)) points(1:5, df$var2, pch=19, col=”red”) As for the trend of the data, what do you mean? A trend for each line? Or do you actually want to … Read more

[Solved] Subset Columns in R using which

January 1, 2023 by Kirat

What is wrong is that you are using the condition which(ff[,1:ncol(ff)]>2.5) to choose rows (we know that there is only a single row) rather than columns. Hence, ff[, which(ff[,1:ncol(ff)]>2.5)] would work. Or simply ff[, ff > 2.5] 3 solved Subset Columns in R using which

[Solved] Several questions on ggplot2 [closed]

January 1, 2023 by Kirat

Have a look at scale_color_manual, the examples should be sufficient. The general structure of tweaking any scale in ggplot2 is to use the appropriate scale function: scale_{aes_name}_{scale_type}, where aes_name can be color, x, or any other aeshetic, and where scale_type can be continuous, discrete, manual, etc. Googling for ggplot2 legend position led me to this … Read more

[Solved] R: How to make aggregate pivot table [closed]

January 1, 2023 by Kirat

data: df<- data.table::fread(“id product qua color month 1 Box 3 red jan 2 Box 14 blue jan 3 Box 22 green jan 4 Box 10 red feb 5 Box 12 blue feb 6 Box 36 green feb 7 Box 31 red mar 8 Box 1 blue mar 9 Box 7 green mar”)[,-1] %>% setDF code: … Read more

[Solved] R subset based on range of dates [closed]

December 31, 2022 by Kirat

Are you asking something like the following? Let’s say your initial dataframe is df, which is the following: df A B C 1 2016-02-16 2016-03-21 2016-01-01 2 2016-07-07 2016-06-17 2016-01-31 3 2016-05-19 2016-09-10 2016-03-01 4 2016-01-14 2016-08-21 2016-04-01 5 2016-09-02 2016-06-15 2016-05-01 6 2016-05-09 2016-07-17 2016-05-31 7 2016-06-13 2016-06-23 2016-07-01 8 2016-09-17 2016-03-11 2016-07-31 9 … Read more

[Solved] Read csv file from directory

December 30, 2022 by Kirat

In windows, ‘\’ is translation operator, you need to use “C://Users//Riya Sajid//Downloads//New folder” solved Read csv file from directory

[Solved] Removing different words form a document using R console

December 30, 2022 by Kirat

A simple way would be to use gsub(paste0(‘\\b’, YOURVECTOROFWORDSTOREMOVE, ‘\\b’, collapse=”|”),”,YOURSTRING) which replaces every occurence of the words in the vector surrounded by either end/beginning characters or whitespace with a single space. but you might want to look at the tm package and work with a corpus object if you have many files like this. … Read more

[Solved] How to create a map with zipcode dataset? ( US Zipcode ) [closed]

December 28, 2022 by Kirat

Try the following code which I modified from January at how do I map (on a geographical map) data in R just given the US Zipcodes: pdf(“myzipcodes.pdf”) library(maps) map(database=”usa”)# national boundaries library(zipcode) data(“zipcode”) myzips <- c(“22313″,”83701″,”32301”) selected <- zipcode[ zipcode$zip %in% myzips, ] points( selected$longitude, selected$latitude, pch= 19, cex= 2 ) text( selected$longitude, selected$latitude, selected$zip, … Read more

[Solved] Selecting rows in R with “Yes” in one column of a set of columns, and NOT “Yes” in all columns of another set of columns

December 28, 2022 by Kirat

as long as you have the dataframe organized the way you do now i.e., comp1type1,comp1type2,comp1type3,comp2type1,…,comp[I]type[J]. I am sure you can use the following method. ncomp <- 20 ntype <- 3 vecone <- df[,seq(1,ncomp*ntype,ntype)] vectwo <- df[,seq(2,ncomp*ntype,ntype)] vecthree <- df[,seq(3,ncomp*ntype,ntype)] # now that we have the vectors of types seperated into data.frame’s # it’ll be easier … Read more

[Solved] Find new and active user each week from user_id and date

December 28, 2022 by Kirat

so I managed to calculate the count_new by checking the first appearance of a user_id and then merging with initial data adding a column that tell if a user is new by date and id then I counted the new by date: library(dplyr) firstshow<-Orders %>% group_by(user_id) %>% arrange(date) %>% slice(1L) %>% mutate(new = “new”) newdata<-merge.data.frame(Orders,firstshow,by=c(“date”,”user_id”),all … Read more

[Solved] In R; I would like to do something in R rather than excel because excel can’t handle the calculation. In excel the calculation is: =A2+SUM($B$2:B2)

December 27, 2022 by Kirat

Look into dplyr https://cran.rstudio.com/web/packages/dplyr/vignettes/introduction.html install.packages(“dplyr”) library(dplyr) df <- df %>% mutate(phys_pos=cumsum(length)+position) I am assuming your data.frame is named df Or with base R df$phys_pos <- cumsum(df$length) + df$position 1 solved In R; I would like to do something in R rather than excel because excel can’t handle the calculation. In excel the calculation is: =A2+SUM($B$2:B2)

[Solved] Combine data table row elements into a new column as a vector [closed]

December 25, 2022 by Kirat

I am not sure if you want something like below DT[,result := asplit(DT,1)] such that > DT col1 col2 col3 result 1: 1 a x 1,a,x 2: 1 b y 1,b,y 3: 1 c z 1,c,z 1 solved Combine data table row elements into a new column as a vector [closed]

[Solved] Make columns in data frame equal to median, mean, etc.? (R) [closed]

December 25, 2022 by Kirat

Your dataframe: df<-data.frame(x=c(1, 3 , 5), y=c(1, 1, 2), z=c(256, 5, 4)) Using dplyr/tidyr: df1<-df%>%gather(var,val,x:z)%>%group_by(var)%>%summarise(max=max(val),min=min(val),avg=mean(val),median=median(val)) you can extend this to any number of summary stats. # A tibble: 3 × 5 var max min avg median <chr> <dbl> <dbl> <dbl> <dbl> 1 x 5 1 3.000000 3 2 y 2 1 1.333333 1 3 z … Read more

[Solved] How to replace a value by another within a variable in R?

December 25, 2022 by Kirat

You can use this approach to get what you need: df = within(df, { variable[variable == “string”] = “string2” variable[is.na(variable)] = mean(variable) }) The trick here is that you can create a subset using [] and assign values again to that subset using [] =. 2 solved How to replace a value by another within … Read more