select(): Selecting columns with select

Although our data table only has six columns, some data tables include hundreds. If we want to view just a few, we can use the dplyr select function. In the code below we select three columns, assign this to a new object and then filter the new object:

new_table <- select(murders, state, region, rate)
filter(new_table, rate <= 0.71)
#>           state        region  rate
#> 1        Hawaii          West 0.515
#> 2          Iowa North Central 0.689
#> 3 New Hampshire     Northeast 0.380
#> 4  North Dakota North Central 0.595
#> 5       Vermont     Northeast 0.320

In the call to select, the first argument murders is an object, but state, region, and rate are variable names.

Instruction

Run the sample code to see how select() function works.

library(dplyr) library(dslabs) data("murders") # Add the rate column murders <- mutate(murders, rate = total / population * 100000) # Check murders to find that the new column is added. head(murders) # select new table with 4 columns new_table <- select(murders, state, region, rate) # filter new_table filter(new_table, rate <= 0.71)

Previous: 3-4 | filter(): Subsetting with filter

Next: 3-6 | %>% : Pipe

Back to Main