codebook.Rmd

---
title: "Codebook"
output:
  html_document:
    toc: true
    toc_depth: 4
    toc_float: true
    code_folding: 'hide'
    self_contained: true
  pdf_document:
    toc: yes
    toc_depth: 4
    latex_engine: xelatex
---

This script (run_analysis.R) performs the data preparation in the first step. After the preparation of data, it performs the 5 steps as required by the project.

1. Data download and extraction
    Before starting the steps, the data were downloaded.

2. Assigning the data to different variables
  features <- features.txt : 561 rows, 2 columns
  activities <- activity_labels.txt : 6 rows, 2 columns
  subject_test <- test/subject_test.txt : 2947 rows, 1 column
  x_test <- test/X_test.txt : 2947 rows, 561 columns
  y_test <- test/y_test.txt : 2947 rows, 1 columns
  subject_train <- test/subject_train.txt : 7352 rows, 1 column
  x_train <- test/X_train.txt : 7352 rows, 561 columns
  y_train <- test/y_train.txt : 7352 rows, 1 columns

3. Merges the training and the test sets to create one data set
  X (10299 rows, 561 columns) is created by merging x_train and x_test using     rbind() function
  Y (10299 rows, 1 column) is created by merging y_train and y_test using        rbind() function
  Subject (10299 rows, 1 column) is created by merging subject_train and         subject_test using rbind() function
  Merged_Data (10299 rows, 563 column) is created by merging Subject, Y and X    using cbind() function

4. Extracts only the measurements on the mean and standard deviation for each    measurement
  TidyData (10299 rows, 88 columns) is created by subsetting Merged_Data,        selecting only columns: subject, code and the measurements on the mean and     standard deviation (std) for each measurement

5.  Uses descriptive activity names to name the activities in the data set
  Entire numbers in code column of the TidyData replaced with corresponding     activity taken from second column of the activities variable

6. Appropriately labels the data set with descriptive variable names
  code column in TidyData renamed into activities
  All Acc in column’s name replaced by Accelerometer
  All Gyro in column’s name replaced by Gyroscope
  All BodyBody in column’s name replaced by Body
  All Mag in column’s name replaced by Magnitude
  All start with character f in column’s name replaced by Frequency
  All start with character t in column’s name replaced by Time

7. From the data set in step 4, creates a second, independent tidy data set     with the average of each variable for each activity and each subject
  FinalData (180 rows, 88 columns) is created by sumarizing TidyData taking the     means of each variable for each activity and each subject, after groupped    by subject and activity.
  Export FinalData into FinalData.txt file.