Data Structures in R Programming Assignment

Use R Programming to perform the following analysis

  • Print the structure of your dataset
  • List the variables in your dataset
  • Print the top 15 rows of your dataset
  • Write a user-defined function using any of the variables from the data set.
  • Use data manipulation techniques and filter rows based on any logical criteria that exist in your dataset.
  • Identify the dependent & independent variables and use reshaping techniques and create a new data frame by joining those variables from your dataset.
  • Remove missing values in your dataset.
  • Identify and remove duplicated data in your dataset
  • Reorder multiple rows in descending order
  • Rename some of the column names in your dataset
  • Add new variables in your data frame by using a mathematical function (for e.g. multiply an existing column by 2 and add it as a new variable to your data frame)
  • Create a training set using random number generator engine.
  • Print the summary statistics of your dataset
  • Use any of the numerical variables from the dataset and perform the following statistical functions
  • Mean
  • Median
  • Mode
  • Range
  • Plot a scatter plot for any 2 variables in your dataset
  • Plot a bar plot for any 2 variables in your dataset
  • Find the correlation between any 2 variables by applying least square linear regression model.