Day 1 - Normal Distribution, Z-score, Standard Normal distribution - #66DaysofData Challenge
By Jerin Lalichan 1. Normal Distribution (Gaussian Distribution) How the data is distributed can tell us a lot about the data. There are over 20 different types of data distributions commonly used in data science to develop a model. The normal distribution is one of the most common data distributions seen in nature, for example, the distribution of the heights of human beings . It is also called as Bell Curve. Image credits: Statistics by Jim Properties: Parameters of a bell curve are : Mean - which determines the maximum height of the curve Standard deviation - which determines how wide should be the curve The mean, median, and mode are the same. The distribution is symmetrical, in other words, the data points are symmetrically distributed around the mean value. 2. Z score It is a number that represents how far is, ie how many standard deviations away is a point located, measured from the mean value. So farther the ...




Comments
Post a Comment