Alireza Afzal Aghaei
Graduate student at SBU
Alireza Afzal Aghaei
M.Sc at SBU
Welcome to the real world!
Some factors of Data Quality
data quality depends on the intended use of the data
Major techniques in Data Preprocessing
The data reduction is lossless if the original data can be reconstructed from the compressed data without any loss of information; otherwise, it is lossy.
Principal Components Analysis
searches for k n-dimensional orthogonal vectors that can best be used to represent the data, where k ≤ n.
Attribute subset selection reduces the data set size by removing irrelevant or redundant attributes
Concept Hierarchy Generation
Any Question?
Thank you
By Alireza Afzal Aghaei