Data Science, and Machine Learning With Python

(Data Analytics (overlapping and interrelated field))

[Artificial Intelligence]

(Comprehensively, AI is a multidisciplinary field that combines the power of data science, machine learning, and computer science, including additional academic subjects such as mathematics and statistical skills.)

DISCLAIMER: The images, code snippets...etc presented in this presentation were collected, copied and borrowed from various internet sources, thanks for them & credit to the creators/owner

Agenda

What is Data Analytics and Data Science?
What They Can Do?
Prerequisites & Skillset
Why Python
Statistical Techniques (case study)
Data Analytics & Visualizations (case study)
Machine Learning (case study)
D S & M L Project Workflow (End to End Project)
Roles & Responsibilities
Course Curriculum
Q & A

What is Data Analytics?

Data Analytics is the process of inspecting, cleaning, transforming, and modeling data to discover useful information, draw conclusions, and support decision-making..

It transforms raw data into actionable insights, empowering businesses and professionals to make informed choices and drive strategic outcomes.

What is Data Science?

Data science is an interdisciplinary field. as it combines foundational ingredients from multiple disciplines and relevant domain knowledge to extract hidden patterns and trends or insights from data.

The ability to take data—to be able to understand it, to process it, to extract value from it, to visualize it, to communicate it—that’s going to be a hugely important skill in the next decades - Hal Varian

It encompasses various techniques and tools to analyze and interpret complex data sets.

The main goal of data science is to get valuable information and insights from the data that can be used to inform decision-making.

Data Science can be applied in many industries/sectors/fields such as healthcare, finance, marketing and retail, manufacturing, transportation, and many more.

What Data Analytics & Data Science Can Do?

Healthcare: Data science is used to analyze patient data and monitor vital signs in real-time to detect signs of illness or deterioration. This can be used to improve patient outcomes and reduce healthcare costs.
Finance: Data science is used in real-time to detect fraudulent transactions and monitor financial markets for signs of instability.
Retail and e-commerce: Data science is used to analyze customer data and track real-time sales trends to optimize inventory and pricing.
Transportation: Data science is used to analyze traffic and transportation data in real-time analyze customer data and track to optimize routes, reduce congestion, and improve traffic flow.
Manufacturing: Data science is used to monitor and analyze sensor data from manufacturing equipment to detect signs of wear and tear, optimize production processes, and improve efficiency.
Telecommunications: Data science is used to analyze network data in real-time to optimize performance, and detect and prevent service outages and fraud.
Agriculture: Data science is used to analyze sensor data from fields and weather patterns in real-time to optimize crop yields and reduce waste

this is actually just the tip of the iceberg

Data plays a huge part of modern life. And while data is revolutionising everything – from our shopping to our social lives – it is also transforming healthcare.

Why Python

image source: https://bit.ly/3UVcgS6

https://s3.amazonaws.com/assets.datacamp.com/blog_assets/PythonForDataScience.pdf

Packages

Numpy
Pandas
Matplotlib, Seaborn
Sklearn
.........etc

Statistics

image source: https://bit.ly/3O5S26k

Elementary statistics

Statistics

Mean, Median, Mode, Standard Deviation, Range, Quartiles, skewness, kurtosis,.. more

# Applying basic statistics in Python
import pandas as pd
data = {'Student': ['Alice',	'Bob',	'Charlie',	'David',	'Eve',	'Frank',	'Grace'], 
        'Hours _Studied': [20,	5,	10,	15,	2,	16,	22], 
        'Pre_Grade': [54,	78,	68,	67,	45,	57,	85],
        'Post_Grade': [90,	70,	96,	82,	62,	87,	98]}
df = pd.DataFrame(data, columns = ['Student', 'Hours _Studied', 'Pre_Grade', 'Post_Grade'])
print(df)
# Minimum value of Pre_Grade
df['Pre_Grade'].min()

# Maximum value of Pre_Grade
df['Pre_Grade'].max()

# The sum of all the Hours _Studied
df['Hours _Studied'].sum() 

# Mean Pre_Grade
df['Pre_Grade'].mean()

# Median value of Post_Grade
df['Post_Grade'].median()

#Sample variance of Post_Grade values
df['Post_Grade'].var()

#Sample standard deviation of Post_Grade values
df['Post_Grade'].std()

# Cumulative sum of Pre_Grade, moving from the rows from the top
df['Pre_Grade'].cumsum()

# Summary statistics on Post_Grade
df['Post_Grade'].describe()

[Artificial Intelligence]

Agenda

What is Data Analytics?

What is Data Science?

What Data Analytics & Data Science Can Do?

Why Python

Statistics

Elementary statistics

Data Analytics & Visualization

Roles & Responsibilities

Machine Learning

A Simple Example (DSML)

if i study more, will i get a higher grade

if i study more, will i get a higher grade

if i study more, will i get a higher grade

if i study more, will i get a higher grade

if i study more, will i get a higher grade

Finding #1: the more you study, the higher grade you will get

Finding #2: Also, Charlie is a smarty pants

if i study more, will i get a higher grade

linear regression

if i study more, will i get a higher grade

Data Science Skills are in High Demand Across Industries​

Popular Programming Languages

Course Curriculum

Final Note

Business Intelligence

Business Analytics

Big Data Analytics

Natural Language Processing (NLP)

ETL,ELT and Data Engineering

Deep Learning

Computer Vision

The AI Hierarchy

What is AI & 3 Types of AI (ANI,AGI,ASI):

Copy of Data Science & Machine Learning with Python - demo

More from Data Science Portal

Data Science Skills are in High Demand Across Industries