- #1

Trollfaz

- 137

- 14

- TL;DR Summary
- R

For R's dplyr package this is my query.

Suppose I have a data frame/tibble of n observations or n rows. Let's call it df1. Is the time complexity for dplyr's basic manipulation functions O(N)

filter()

select()

mutate() assuming mutate is O(1)

rename()

summarize()

count()

separate()

unite()

spread()

gather()

If I have another data frame/tibble df2 of m rows, then are the following functions of time complexity O(N+M)

inner_join(df1,df2)

right/left_join(df1,df2)

outer_join(df1,df2)

Suppose I have a data frame/tibble of n observations or n rows. Let's call it df1. Is the time complexity for dplyr's basic manipulation functions O(N)

filter()

select()

mutate() assuming mutate is O(1)

rename()

summarize()

count()

separate()

unite()

spread()

gather()

If I have another data frame/tibble df2 of m rows, then are the following functions of time complexity O(N+M)

inner_join(df1,df2)

right/left_join(df1,df2)

outer_join(df1,df2)