Saw this video comparing the Garmin vs the Labradar:
Garmin Xero C1 Pro vs LabRadar (youtube.com)
Interestingly, it seems the Garmin uses the population SD formula while the Labradar uses the sample SD formula. In my view a string of shots is a sample where the population is all shots fired by the rifle. Any statisticians care to comment?
Yes I saw that video, and I subscribe to his channel. He's very knowledgeable and experienced, has a great channel, and I have learned alot from his videos.
He (Bryan) is correct for the SD statistics rule, i.e. the SD from a single range shooting session is the entire population method. It is NOT a sub-sample of a larger population (e.g. all the velocity data records from that rifle over hundreds or thousands of shots on different days, with different enviro variables, possibly different lots of brass, powder, primers, bullets, different winds, barometric pressure, throat erosion on the barrel, shooter error, etc). Therefore one has to use the population SD calculation method for that
one session.
This is because in shooting, we know that there are many different variables (some of which we cannot control), affecting our velocities every single range session.
The stats-101 textbooks often use the student heights example for explaining population vs subsample. E.g. All the data for student heights in grade 12 in Canada are in this spreadsheet. That national data set uses the entire population SD method. Now I want to compare my school's subsample heights mean and SD with the National population height mean and SD, so therefore I would use the sub-sample SD method, and compare it to the national mean and SD. In this example there are no uncontrolled variables. Its a simple height measurement that cannot be affected by other variables.
HOWEVER: In this video he made a critical mistake and
violated the fundamental rules of statistics,
which makes all his SD numbers invalid in this video: He calculated SD's from each individual 3-shot velocity group. In Statistics, there is no such thing as an SD from a sample size of 3. An SD is only valid if the sample size is large enough,
AND has a normal distribution (i.e. a relatively symmetrical bell curve). Generally the rule of thumb is that a sample has to be at least 30, or approaching 30, for any hope in generating a normal distribution. You can sometimes cheat a little with smaller sample sizes than 30, and graph these results in histograms, choosing the bin size, to show that its approximately approaching a normal distribution, close enough-ish so to speak.
The velocity number is a one-dimensional data type. It has no pre-determined mean or distribution until you shoot the rounds for that session. All else being equal, any one-dimensional outcome data type will generate a normal distribution of variance around the mean (in the center hump of the bell curve). Therefore one must generate a sufficient sample size first, to generate a normal distribution, before doing the SD calculation. However, we can skip (at our peril) the technical step of proving the normal distribution first, and just generate the SD from the raw data. If we do this, we are making the assumption that our sample size was large enough and we shot every round under the same conditions, etc.
Ironically, he has all the raw data to do this correctly. He shot fourteen 3-shot groups with each chronograph, and showed these 3-shot means on screen. Therefore he has a true sample size of 42 velocities for each chronograph. This should easily produce a normal distribution for each chronograph, and then he could do
one Mean, ES and SD statistical calculation to compare his two samples (both of which would be the population method).
I wanted to add a comment to his video with an encouragement to him do this with is raw data (which is really good data), and do one SD comparison. But he has comments turned off.
On screen he shows he had already sub-pooled his raw data into 3-shot group means, (masking the true variation across the entire sample), so I am unable to run the correct SD calculation on his 42 data points per chronograph. SD absolutely relies on true distribution along the X axis in order to generate a bell curve and reliable SD.
For a good backgrounder on Standard Deviation, Wikipedia does a good job of explaining it in general language. Link:
https://en.wikipedia.org/wiki/Standard_deviation
(Aside: There are other variants of SD calculations for skewed distributions that are generated from 2-dimensional (coordinate X and Y) variables, such as bullet hit distance from bullseye center, which generate a density cloud around the bull center. These distributions show more (denser) hits closer to the center of the bull and fewer (less dense) hits further away (i.e. bell curve heavily skewed to the left with a long shallow tail to the right). These are types of distributions that require a special SD depending on their amount of skewness. These 2 or 3 dimensional types of data distributions get complicated very fast because the type of the distribution has to be determined first, and then the proper SD method chosen).