GCD-Course-Project

Getting and Cleaning Data Course Project

Data Set Source

The data was collected from sensors of smartphone carried by experiment subject when performing certain activity. There have been 30 subjects, and each subject performed 6 activities (WALKING, WALKING_UPSTAIRS, WALKING_DOWNSTAIRS, SITTING, STANDING, LAYING) wearing a smartphone (Samsung Galaxy S II) on the waist.

A full description is available at the site where the data was obtained.

Processes to Get Tidy Data

The reshape2 package was required to melt() and dcast() the data frame.

Download and unzip the data set file.
Read the relevant files to variables:
- features.txt -> features
- activity_labels.txt -> act
- y_test.txt -> ytest
- y_train.txt -> ytrain
- subject_test.txt -> subj_test
- subject_train.txt -> subj_train
- X_test.txt -> xtest_df
- X_train.txt -> xtrain_df
Merge the test and train data(xtest_df & xtrain_df), cbind() the subject number (subj_test & subj_train) and activity labels (act) to the left of the merged data, and name the cols with features.
Extract the cols with "mean()" or "std()". Cols with "meanFreq()" are not included because it calculates the mean frequency not the mean value.
Use the melt() and dcast() functions to get the narrow tidy data.
Give the tidy data proper variable names and use write.table() function to create a .txt file.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Codebook.md		Codebook.md
README.md		README.md
run_analysis.R		run_analysis.R
step_5_tidy_data.txt		step_5_tidy_data.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GCD-Course-Project

Data Set Source

Processes to Get Tidy Data

About

Releases

Packages

Languages

zian999/Course-Project

Folders and files

Latest commit

History

Repository files navigation

GCD-Course-Project

Data Set Source

Processes to Get Tidy Data

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages