DATA SCIENCE Online Training
The Data Science Online Training course from US GlobalSoft that provides you with detailed learning in Data Science, Data Analytics, project life cycle, data acquisition, analysis, statistical methods and Machine Learning. You will gain expertise to deploy Recommenders using R programming, and you will also learn data analysis, data transformation, experimentation and evaluation.
Description
The Data Science Online Training course from vconnectit that provides you with detailed learning in Data Science, Data Analytics, project life cycle, data acquisition, analysis, statistical methods and Machine Learning. You will gain expertise to deploy Recommenders using R programming, and you will also learn data analysis, data transformation, experimentation and evaluation.
Pre-requisites
There are no particular prerequisites for this training course. If you love mathematics, it is helpful to learn Data Science. You will also get MS Excel self-paced course free with this course.
How will I perform the practical sessions in Online training?
For online training, US GlobalSoft provides a virtual environment that helps in accessing each other’s system. The complete course material in pdf format, reference materials, course code is provided to trainees. US GlobalSoft conductes online sessions through any of the available requirements like Skype, WebEx, GOTOMeeting, Webinar, etc.
DATA SCIENCE Course Syllabus
Module 1 : Python Programming Language
Part A : Python Basic Concepts
- Introduction to Python and its involvement with Data Science
- Understanding Object Orientation Programming
- Installation: Python 3.6 or later version, pip, iPython, Sublime Text Editor, Anaconda(Jupyter and Spyder)
- Python Identifiers, Naming Conventions, Variables and Types
- Defining Functions, Classes and Methods
- Understanding Indentation
- Executing sample programs in all Editors
- Difference Between Functions and Methods
- How to use Python Functions and Methods
- Decision making through conditions and Loops
- Declaring instances and Workout its accessibility
- Understanding global and local variables in python
- Instantiating Classes and flow of execution
- Accessing Methods, Variables, Global variables and Functions
- Working with self and super keywords
- Object String representation through __str__ and __repr__
- Constructors; Initialization; object: a base class
- Inheritance Concept; Overriding and Overloading concept
- Constructors with respect to inheritance
- Understanding __name__ == ‘__main__’
- Exceptions:
- Overview of exception
- Raising common causing exceptions
- Exception Hierarchy
- Raising exception at calling method
- Handling exceptions through try, except, else and finally
- Exception propagation
- Customized Exceptions
Part B: Data Structures:
- List: Creating, Accessing, Slicing, Manipulating lists, Built-in Functions & Methods in list, Iterating & Enumerating list data and Working with Nested lists.
- Tuple, Set and Dictionaries (same above all operations)
- Handling conversions of sample data with Data Structures
Part C: Regular Expressions in Python
- Patterns, searching, Modifiers, flags
- Working with examples to find specific strings, phone numbers, email addresses and filtering html data with regular expressions
- File I/O
- Working with text files and .csv
- Reading and Writing data to the files
- Importing required packages to work with .csv
Module2 : Statistics - Probabilities and Linear Algebra
- Statistical thinking in Python and approach of Data Analysis
- Fundamental statistics terms and its definitions
- Applying basic statistics in Python with NumPy
- Cumulative Distribution functions
- Modelling Distributions
- Graphical exploratory data analysis with Python
- Probability theories:
- Ranges, Mean, Variance, Standard Deviation and various distributions
- Mass and Density functions
- Kernel density estimation
- Understanding Bayes theorem and predictions*
- Estimation
- Sampling distributions, bias and Exponential distributions
- Hypothesis testing
- Hypothesis Test
- Testing Correlation and Proportions
- Chi-Squared Tests
- Errors, Power and Replication
- NumPy: N-dimensional array operations
- Array creations, conversions, dimensional understandings, shaping, reshaping, generating sample large datasets, Linear algebra functionalities and numerical operations etc…
- SciPy: High-level Scientific Computing
- Linear Algebra operations
- Interpolation
- Optimization and fit
- Statistics and random numbers
- Numerical Integration
- Fast Fourier transforms
- Signal processing and image manipulation
Module3 : Data Mining & Data Analytics (Data Harvesting, Cleansing, Analyzing and Visualizing)
Part A :Pandas and NumPy Functionalities:
- Introduction
- Pandas DataFrame basics
- Understanding data, looking at columns, rows and cells
- Subsetting Columns, Rows with methods
- Grouped and Aggregated Calculations
i. Frequency Means and Counts
- Basic plot
- Pandas Data Structures
- Creating your own data (Series and DataFrame)
- Series (also called as Vector) Object operations
- Broadcasting and Scalar operations
- DataFrame Broadcasting (Vectorized)
- Making changes to Series and DataFrame
i. Adding additional Columns
ii. Dropping values
- Exporting and Importing Data
Part B : Introduction to Plotting:
- Introduction
- Matplotlib
- Statistical Graphics using matplotlib
- Univariate
- Bivariate
- Multivariate Data
- Seaborn Library Plotting methodology
- Univariate, Bivariate and Multivariate
- Pandas Objects Plotting
- Histogram, Density Plot, Scatterplot, Hexbin Plot and Boxplot
- Seaborn Themes and Styles
Part C : Data Manipulation:
- Data Assembly
- Concatenations and Merging Multiple datasets
- Missing Data:
- Introduction
- What is a NaN Value
- Working with merged data, user input values and Re-indexing
- Working with missing data
- Finding and Counting missing data
- Cleansing missing data
- Calculations with missing data
- Conclusion Understanding Multiple Observations (Normalization)
Part D : Data Munging:
- Understanding Data Types
- Converting types
- Categorical Data
- Convert to Category
- Manipulating Categorical Data
- Strings and Text Data
- String Subsettings
- String Methods
- String Formatting
- Apply and Groupby Operations:
- Introduction
- Functions
- Apply over a Series and DataFrame
- Apply- Column-wise and Row-wise operations
- Groupby Operation:
- Aggregate Methods and Functions
- The datetime Data Type:
- Python’s datetime Object
- Loading, Converting, Extracting Date components
- Date Calculations
- Datetime Methods
- Subsetting datetime, Date Ranges, Shifting Values, TimeZones
Module 4 : Machine Learning (Data Modelling)
- Linear Models
- Linear and Multiple Regressions using statsmodels and sklearn
- Generalized Linear Models
- Logistic and Poisson Regressions using statsmodels and sklearn
- Survival Analysis
- Model diagnostics
- Residuals
- Comparing Multiple Models
- k-Fold Cross-Validation
- Regularization
- Clustering
- k-Means, Dimension Reduction with PCA (Principal Component Analysis)
- Hierarchical Clusterings
- Conclusions
Practical Data Analysis and Understandings
Data Science Interview Questions Discussions (2 sessions)
Note: Keeping main objective as “Understanding” All the above topics are covered with logical and programmatic approach in Python. Also please note that Content order is NOT compulsorily followed at the time of delivering subject and knowledge.
Certification
Certification assistance provided with proper guidance and certification notes.