Data-Science: What is, Basics & Process
• Python_Programming
• SQL programming
• Excel
• Comfortable with using the Terminal, version control in Git, and using GitHub
• Calculus
• Linear Algebra
• Probability and Statistics:
- Accessing database, CSV, and JSON data
- Data cleaning and transformations using pandas
- visualization
- dashboards
-
Supervised Learning
-
Unsupervised Learning
- Content-Based and Collaborative Filtering
- Evaluation of Recommendation Systems. DCG, nDCG
- Time Series Analysis
- Text Analytics
- Lexical processing
- Syntactic Processing
- Semantic Processing
- Hadoop and MapReduce Programming
- NoSQL Databases and Apache HBase
- Hive Tutorial
- Analytics using PySpark