标签:nat mos each data make mode volume mon attribute
Why: real-world data are typically noisy, enormous in volume, and may originate from a hodgepodge of heterogeneous sources.
mean; median; mode(most common value); distribution;
Knowing such basic statistics regarding each attribute makes it easier to fill in missing values, smooth noisy values, and spot outliers during data preprocessing.
BK: Data mining, Chapter 2 - getting to know your data
标签:nat mos each data make mode volume mon attribute
原文地址:https://www.cnblogs.com/dulun/p/12293674.html