Pages

2024/01/22

Decoding the Data Whisperer:

 A Beginner's Guide to Z-Scores

πŸŸ₯πŸŸ₯πŸŸ₯πŸŸ₯πŸŸ₯πŸŸ₯πŸŸ₯πŸŸ₯πŸŸ₯πŸŸ₯πŸŸ₯πŸŸ₯πŸŸ₯πŸŸ₯






Imagine you're in a classroom full of students who all took the same exam. You scored 75, but how does that compare to everyone else? Did you ace it or just barely scrape by? Enter the z-score, a powerful tool that helps you understand your position within a dataset.


Think of z-scores as a translator. It takes your raw score and converts it into a universal language, telling you how many standard deviations away you are from the mean or average of the group. S standard deviation is basically a measure of how spread out the data is.


The lowdown on z-scores:


The Formula:



Interpretation:

Positive z-score means you scored above the mean, The higher the z-score, the further above the mean you are. For example, a z score of 2 means that you are 2 standard deviations above the average.



A negative z score means that you scored below the mean. The more negative the z-score, the further below the mean you are. 


A z-score of 0 then you are right on the mean.



Benefits of Z-Scores:
  1. you can compare apples to oranges. You can compare data from different sets with different units. Imagine comparing your exam score to your friend's height. Z-scores make it possible by putting both scores on the same scale.
  2. Spot outliers: Extreme values that deviate significantly from the rest of the data can be easily identified with z-scores. A z-score far above or below the others might indicate an error or a unique case that deserves further investigation.
  3. Predict probabilities: Knowing the z-score and the properties of the normal distribution-  the bell curve-, you can estimate the percentage of the population that scored lower or higher than you.




For Whom the Bell Curves

From Gambling Odds to Universal Truth




πŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺ

Have you ever wondered why test scores, heights, and even plant sizes seem

to follow a predictable pattern? The answer lies in a mysterious bell-shaped curve known as the normal distribution. But this ubiquitous curve was not just handed down on a silver platter - it emerged from the mind of a brilliant mathematician named Abraham De Moivre in the 18th century (Nesselroade & Grimm, 2020).


  • De Moivre, a friend of legends like Halley and Newton, was not interested in boring old graphs. He was fascinated by the odds of chance, specifically the probability of flipping a coin a thousand times and getting between 500 and 600 heads. As he crunched the numbers, something magical happened, the results began to form a bell-shaped curve.
  • 🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰

  • But how did he get that iconic formula with its mysterious constants like pi and e? Well, that's a secret lost to the writing style of the time, where results were proudly displayed but methods remained tightly under wraps (Nesselroade & Grimm, 2020). It's a tantalizing glimpse into De Moivre's mind, bending seemingly unrelated fields like gambling and geometry to birth a universal truth. 

🌑️🌑️🌑️🌑️🌑️🌑️🌑️🌑️🌑️🌑️🌑️🌑️🌑️🌑️🌑️🌑️🌑️🌑️

While De Moivre laid the groundwork, others helped the bell curve ring far and wide. Thomas Simpson extended it to continuous measurements like star positions, proving that averaging multiple observations in the sky is the key to minimizing error. Then came Pierre Leplace with his Central Limit Theorem, the grandaddy of statistics, showing that the average of many samples from a population tends to follow a normal distribution  (Nesselroade & Grimm, 2020).

  This opened the door to using the curve for all sorts of hypothesis testing and probability calculations

And who can forget Carl Gauss, the mathematical prodigy who discovered an error in his father's payroll at the age of 3 (Nesselroade & Grimm, 2020)? He not only popularized the normal distribution but also used to predict the reappearance of a lost asteroid with just a handful of observations. Talk about putting your theories to the test!!

Today, the normal curve is the bedrock of statistics, guiding everything from test score analysis to market research. It is a testament to the power of human curiosity and the unexpected connections that can lead to groundbreaking discoveries. So, the next time you see a bell curve, remember De Moivre, Simpson, Laplace, and Gauss - the pioneers who unlocked the secrets hidden within the randomness of numbers.

πŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺ 

 References

 Nesselroade, P. K. & Grimm, L. G. (2020). Statistical applications for the behavioral and social sciences (2nd ed.). Soomo Learning. https://www.webtexts.com

The Unsung Hero of Statistics

 William Gosset and the Revolution of Small Samples.

🟦🟦🟦🟦🟦🟦🟦🟦🟦🟦🟦🟦🟦🟦🟦🟦🟦🟦🟦🟦🟦🟦🟦🟦🟦🟦🟦🟦


  • In the early 20th century, the world of statistics was dominated by one assumption: large numbers mattered(Nesselroade & Grimm, 2020). But for William Gosset, a chemist brewing ale at Guinness, small samples held untold insights. His revolutionary work on t-distribution and t-tests not only transformed statistics but opened the door to scientific advancements in countless fields.
πŸ“…πŸ“…πŸ“…πŸ“…πŸ“…πŸ“…πŸ“…πŸ“…πŸ“…πŸ“…πŸ“…πŸ“…πŸ“…πŸ“…
Imagine trying to determine the best barley for brewing with only a handful of samples. Traditional methods, relying on the z distribution and massive datasets were useless. Gosset realized the need for a new approach, one that could unveil the secrets hidden within small collections of data.

πŸ‘€πŸ‘€πŸ‘€πŸ‘€πŸ‘€πŸ‘€πŸ‘€πŸ‘€πŸ‘€πŸ‘€πŸ‘€πŸ‘€πŸ‘€

His 1908 paper, "The Probable Error of a Mean," was a beacon in the statistical darkness. He recognized the limitations of the z curve and birthed the t distribution, a bell curve uniquely tuned to the whispers of small data. Armed with this new tool, Gosset crafted the t-test, a powerful technique for comparing means from two small samples. (Nesselroade & Grimm, 2020)

πŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺπŸ§ͺ

For the first time, scientists could draw meaningful conclusions from limited data. Imagine comparing the effectiveness of two fertilizers on corn yields or testing the shelf life of different brewing temperatures. Gosset's innovations made such discoveries possible, unlocking a new era of scientific inquiry.

πŸͺŸπŸͺŸπŸͺŸπŸͺŸπŸͺŸπŸͺŸπŸͺŸπŸͺŸπŸͺŸπŸͺŸπŸͺŸπŸͺŸ

Yet, Gosset's work wasn't met with immediate fanfare. He published under the pseudonym "Student" due to Guinness's restrictive publication policies. The irony was not lost on him; a man revolutionizing statistics had to hide his name. While some colleagues met his ideas with weighty apathy others recognized their brilliance. Ronald Fisher, a statistical giant himself, acknowledged Gosset's work as one of the most important publications in the history of inferential statistics.(Nesselroade & Grimm, 2020) 

πŸ³οΈβ€πŸŒˆπŸ³οΈβ€πŸŒˆπŸ³οΈβ€πŸŒˆπŸ³οΈβ€πŸŒˆπŸ³οΈβ€πŸŒˆπŸ³οΈβ€πŸŒˆπŸ³οΈβ€πŸŒˆπŸ³οΈβ€πŸŒˆπŸ³οΈβ€πŸŒˆπŸ³οΈβ€πŸŒˆπŸ³οΈβ€πŸŒˆπŸ³οΈβ€πŸŒˆπŸ³οΈβ€πŸŒˆ

Today, the t-test reigns supreme in countless scientific disciplines. From psychology and medicine to agriculture and business, it forms the backbone of countless research endeavors. Every time a scientist makes a claim based on small sample data, they pay homage to Gosset's legacy.


Gosset's story is more than just statistics; it's a testament to the power of curiosity and perseverance. He dared to challenge the status quo, venturing into the realm of the unknown and returning with tools that reshaped the scientific landscape. In an era obsessed with big data, his work reminds us that sometimes, the smallest whispers can hold the loudest truths.

πŸŸ₯πŸŸ₯πŸŸ₯πŸŸ₯πŸŸ₯πŸŸ₯πŸŸ₯πŸŸ₯

References


 Nesselroade, P. K. & Grimm, L. G. (2020). Statistical applications for the behavioral and social sciences (2nd ed.). Soomo Learning. https://www.webtexts.com

The Three Musketeers of Math: Mean, Median, and Mode

 Mean, Median, and Mode




🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰🟰
Statistics is very intimidating to me and is kicking my ass this semester, so writing these blogs and relating them to something fun really helps me commit it to memory. So today I am introducing the Three Musketeers of Math: Mean, Median, and Mode. These swashbuckling statistics will help you understand any dataset like Zorro deciphers a secret message.

🌑️🌑️🌑️🌑️🌑️🌑️🌑️🌑️🌑️🌑️🌑️🌑️🌑️

Meet the Crew

  • The Average Avenger: Mean is the sum of all the values of your data divided by the number of values. Think of it as sharing a pizza equally among your friends. Everyone gets a slice. 
  • The Middle Mastermind: Median is the value that splits your data in half when ordered from least to greatest. Imagine lining up your friends by height. The median friend is smack dab in the middle, not the shortest or the tallest.
  • The Most Popular Posse: Mode is the value that appears most often in your data. It's like the friend who always shows up to parties, the life of the statistical soiree.

When to Call on Each Musketeer

Each Musketeer has their strengths and weaknesses. Mean is great for normally distributed data - think bell curve, but gets thrown off by outliers- think your friend who brought three extra pizzas - skewing the average. The median shines when you have skewed data or outliers, but it doesn't consider all the values like the mean does. Mode is all about popularity, but it can be unreliable is there's no clear favorite value- think of friends who are all equally awesome in their own way.

 πŸŸ₯πŸŸ₯πŸŸ₯πŸŸ₯πŸŸ₯πŸŸ₯πŸŸ₯πŸŸ₯πŸŸ₯πŸŸ₯


The Musketeers in action

Let's say you're tracking your video game scores: 10, 20, 30,30,40,50


  • The mean:( 10 +20+30+30+40+50) /6 = 30
  • The Median: Order the scores (10,20,30,30,40,50) the middle value is 30.
  • Mode: 30 appears twice, making it the most popular score.
 

The Mean, Median, and Mode are not rivals, they're complementary! Use them together to paint a richer picture of your data,

Featured Blog Post

Amphetamines: A History of Abuse and Addiction

 Amphetamines have a long and complex history, dating back thousands of years (Rosenthal, 2022). Originally they were used for medicinal pur...

Some Popular Posts from my blog