## R: Dataset "AirPassengers" - Test your skills (Part 2)

Did you try the first "Part" of this dataset included in R?

If no, then please have a look here: https://data-science-getting-smarter.rocks/r/dataset/airpassengers-1/

Boxplot is a method for graphically depicting groups of numerical data through their quartiles. Box plots may also have lines extending vertically from the boxes (whiskers) indicating variability outside the upper and lower quartiles, hence the terms box-and-whisker plot and box-and-whisker diagram.

Get the rest...

## R: Dataset included to test your skills (Part 1)

When you install R on you local machine or on your server it comes with datasets to which you can play around with and learn different techniques.

Example you have AirPassengers {datasets} which is "Monthly Airline Passenger Numbers 1949-1960"

Let's try it out together by starting up your Rstudio.

First get the dataset:

Then load the data to AP:

When you have loaded your data it's always good to have a look at it to make sure that the data is loaded correctly and check is the values are as expected.

Get the rest...

## R: dplyr

dplyr is a powerful R-package to transform and summarize tabular data with rows and columns.

dplyr verbs and description:

select() - select columns

filter() - filter rows

arrange() - re-order or arrange rows

mutate() - create new columns

summarise() - summarise values

group_by() - allows for group operations in the “split-apply-combine” concept

filter() example:

select(FL_DATE, CARRIER, ORIGIN, ORIGIN_CITY_NAME, ORIGIN_STATE_ABR, DEP_DELAY, DEP_TIME, ARR_DELAY, ARR_TIME) %>%

filter(CARRIER == "UA")

Here you see the operater %>% please read more about this here

Try out dplyr with the help from these great examples from and see this video from Rstudio

## R: Statistics Features of R

a. Basic Statistics: Mean, variance, median.

b. Static graphics: Basic plots, graphic maps.

c. Probability distributions: Beta, Binomial.

Programming Features of R

a. Distributed Programming

b. R Packages

Mean example:

b <- mean(data)

b