Rahul Bajaj
Data Scientist by Profession - Enjoy number crunching using Open Source Technologies. Feel free to reach out to me at rahulbajaj@hotmail.co.in
Data collected at source specifically for analytical purpose/research at hand.
Example : Surveys, interviews, focus groups etc..
Data collected for some other purposes not specifically for analytical purpose/research at hand.
Example : Industry reports, transaction records etc..
Nominal
Ordinal
Interval
Ratio
Merely labels. No further information can be gleaned.
Merely labels. No further information can be gleaned.
Merely labels. No further information can be gleaned.
Merely labels. No further information can be gleaned.
Which one of the following is not an example of nominal scale?
Classification of individuals using nationality.
Classification of individuals using blood group.
Classification of students of same 5th standard in various divisions.
Classification of students according to grades.
Which one of the following is not an example of ratio scale?
Farenheight scale of temperature measurement.
Height (cm) of an individual.
Time (min) to type 5000 words.
Weight (kg) of an individual.
Tables >> Rows >> Columns
Pre-designed fields
Video, Audio, Text files....
Everything In between ...
DISCRETE VS CONTINUOUS
Finite , Countable set of values
Examples : Shoe Sizes, ZIP Codes..
Real Numbers as attribute values
Examples : Temperature, Stock Prices, Weight..
Discrete attribute is
A variable taking all the values between 0 and infinity.
A variable taking all possible values in a certain range.
A variable taking particular values.
None of above.
Numeric
Logical
Character
Integers / Real Numbers
Boolean : TRUE / FALSE
Real + Imaginary Numbers
Vectors
Arrays
Lists
Matrices
Data Frames
Complex
Text, Strings
The population consists of the set of all measurements in which the investigator is interested. The population is also called the universe.
A sample is a subset of measurements selected from the population.
Sampling from the population is often done randomly, such that every possible sample of n elements will have an equal chance of being selected. A sample selected in this way is called a simple random sample, or just a random sample. A random sample allows chance to determine its elements.
EXAMPLE GR
A set of measurements obtained on some variable is called a data set. For example, heart rate measurements for 10 patients may constitute a data set.
Sometimes our data set consists of the entire population we’re interested in. If we have the actual point spread for five football games, and if we are interested only in these five games, then our data set of five
measurements is the entire population of interest.
In other situations data may constitute a sample from some population. If the data are to be used to draw some conclusions about the larger population they were drawn from, then we must collect the data with great care.
EXAMPLE GR
By Rahul Bajaj
Understanding Data
Data Scientist by Profession - Enjoy number crunching using Open Source Technologies. Feel free to reach out to me at rahulbajaj@hotmail.co.in