For Whom the Bell Curves

From Gambling Odds to Universal Truth




๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช

Have you ever wondered why test scores, heights, and even plant sizes seem

to follow a predictable pattern? The answer lies in a mysterious bell-shaped curve known as the normal distribution. But this ubiquitous curve was not just handed down on a silver platter - it emerged from the mind of a brilliant mathematician named Abraham De Moivre in the 18th century (Nesselroade & Grimm, 2020).


  • De Moivre, a friend of legends like Halley and Newton, was not interested in boring old graphs. He was fascinated by the odds of chance, specifically the probability of flipping a coin a thousand times and getting between 500 and 600 heads. As he crunched the numbers, something magical happened, the results began to form a bell-shaped curve.
  • ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ

  • But how did he get that iconic formula with its mysterious constants like pi and e? Well, that's a secret lost to the writing style of the time, where results were proudly displayed but methods remained tightly under wraps (Nesselroade & Grimm, 2020). It's a tantalizing glimpse into De Moivre's mind, bending seemingly unrelated fields like gambling and geometry to birth a universal truth. 

๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️

While De Moivre laid the groundwork, others helped the bell curve ring far and wide. Thomas Simpson extended it to continuous measurements like star positions, proving that averaging multiple observations in the sky is the key to minimizing error. Then came Pierre Leplace with his Central Limit Theorem, the grandaddy of statistics, showing that the average of many samples from a population tends to follow a normal distribution  (Nesselroade & Grimm, 2020).

  This opened the door to using the curve for all sorts of hypothesis testing and probability calculations

And who can forget Carl Gauss, the mathematical prodigy who discovered an error in his father's payroll at the age of 3 (Nesselroade & Grimm, 2020)? He not only popularized the normal distribution but also used to predict the reappearance of a lost asteroid with just a handful of observations. Talk about putting your theories to the test!!

Today, the normal curve is the bedrock of statistics, guiding everything from test score analysis to market research. It is a testament to the power of human curiosity and the unexpected connections that can lead to groundbreaking discoveries. So, the next time you see a bell curve, remember De Moivre, Simpson, Laplace, and Gauss - the pioneers who unlocked the secrets hidden within the randomness of numbers.

๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช 

 References

 Nesselroade, P. K. & Grimm, L. G. (2020). Statistical applications for the behavioral and social sciences (2nd ed.). Soomo Learning. https://www.webtexts.com

The Unsung Hero of Statistics

 William Gosset and the Revolution of Small Samples.

๐ŸŸฆ๐ŸŸฆ๐ŸŸฆ๐ŸŸฆ๐ŸŸฆ๐ŸŸฆ๐ŸŸฆ๐ŸŸฆ๐ŸŸฆ๐ŸŸฆ๐ŸŸฆ๐ŸŸฆ๐ŸŸฆ๐ŸŸฆ๐ŸŸฆ๐ŸŸฆ๐ŸŸฆ๐ŸŸฆ๐ŸŸฆ๐ŸŸฆ๐ŸŸฆ๐ŸŸฆ๐ŸŸฆ๐ŸŸฆ๐ŸŸฆ๐ŸŸฆ๐ŸŸฆ๐ŸŸฆ


  • In the early 20th century, the world of statistics was dominated by one assumption: large numbers mattered(Nesselroade & Grimm, 2020). But for William Gosset, a chemist brewing ale at Guinness, small samples held untold insights. His revolutionary work on t-distribution and t-tests not only transformed statistics but opened the door to scientific advancements in countless fields.
๐Ÿ“…๐Ÿ“…๐Ÿ“…๐Ÿ“…๐Ÿ“…๐Ÿ“…๐Ÿ“…๐Ÿ“…๐Ÿ“…๐Ÿ“…๐Ÿ“…๐Ÿ“…๐Ÿ“…๐Ÿ“…
Imagine trying to determine the best barley for brewing with only a handful of samples. Traditional methods, relying on the z distribution and massive datasets were useless. Gosset realized the need for a new approach, one that could unveil the secrets hidden within small collections of data.

๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€

His 1908 paper, "The Probable Error of a Mean," was a beacon in the statistical darkness. He recognized the limitations of the z curve and birthed the t distribution, a bell curve uniquely tuned to the whispers of small data. Armed with this new tool, Gosset crafted the t-test, a powerful technique for comparing means from two small samples. (Nesselroade & Grimm, 2020)

๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช

For the first time, scientists could draw meaningful conclusions from limited data. Imagine comparing the effectiveness of two fertilizers on corn yields or testing the shelf life of different brewing temperatures. Gosset's innovations made such discoveries possible, unlocking a new era of scientific inquiry.

๐ŸชŸ๐ŸชŸ๐ŸชŸ๐ŸชŸ๐ŸชŸ๐ŸชŸ๐ŸชŸ๐ŸชŸ๐ŸชŸ๐ŸชŸ๐ŸชŸ๐ŸชŸ

Yet, Gosset's work wasn't met with immediate fanfare. He published under the pseudonym "Student" due to Guinness's restrictive publication policies. The irony was not lost on him; a man revolutionizing statistics had to hide his name. While some colleagues met his ideas with weighty apathy others recognized their brilliance. Ronald Fisher, a statistical giant himself, acknowledged Gosset's work as one of the most important publications in the history of inferential statistics.(Nesselroade & Grimm, 2020) 

๐Ÿณ️‍๐ŸŒˆ๐Ÿณ️‍๐ŸŒˆ๐Ÿณ️‍๐ŸŒˆ๐Ÿณ️‍๐ŸŒˆ๐Ÿณ️‍๐ŸŒˆ๐Ÿณ️‍๐ŸŒˆ๐Ÿณ️‍๐ŸŒˆ๐Ÿณ️‍๐ŸŒˆ๐Ÿณ️‍๐ŸŒˆ๐Ÿณ️‍๐ŸŒˆ๐Ÿณ️‍๐ŸŒˆ๐Ÿณ️‍๐ŸŒˆ๐Ÿณ️‍๐ŸŒˆ

Today, the t-test reigns supreme in countless scientific disciplines. From psychology and medicine to agriculture and business, it forms the backbone of countless research endeavors. Every time a scientist makes a claim based on small sample data, they pay homage to Gosset's legacy.


Gosset's story is more than just statistics; it's a testament to the power of curiosity and perseverance. He dared to challenge the status quo, venturing into the realm of the unknown and returning with tools that reshaped the scientific landscape. In an era obsessed with big data, his work reminds us that sometimes, the smallest whispers can hold the loudest truths.

๐ŸŸฅ๐ŸŸฅ๐ŸŸฅ๐ŸŸฅ๐ŸŸฅ๐ŸŸฅ๐ŸŸฅ๐ŸŸฅ

References


 Nesselroade, P. K. & Grimm, L. G. (2020). Statistical applications for the behavioral and social sciences (2nd ed.). Soomo Learning. https://www.webtexts.com

The Three Musketeers of Math: Mean, Median, and Mode

 Mean, Median, and Mode




๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ๐ŸŸฐ
Statistics is very intimidating to me and is kicking my ass this semester, so writing these blogs and relating them to something fun really helps me commit it to memory. So today I am introducing the Three Musketeers of Math: Mean, Median, and Mode. These swashbuckling statistics will help you understand any dataset like Zorro deciphers a secret message.

๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️

Meet the Crew

  • The Average Avenger: Mean is the sum of all the values of your data divided by the number of values. Think of it as sharing a pizza equally among your friends. Everyone gets a slice. 
  • The Middle Mastermind: Median is the value that splits your data in half when ordered from least to greatest. Imagine lining up your friends by height. The median friend is smack dab in the middle, not the shortest or the tallest.
  • The Most Popular Posse: Mode is the value that appears most often in your data. It's like the friend who always shows up to parties, the life of the statistical soiree.

When to Call on Each Musketeer

Each Musketeer has their strengths and weaknesses. Mean is great for normally distributed data - think bell curve, but gets thrown off by outliers- think your friend who brought three extra pizzas - skewing the average. The median shines when you have skewed data or outliers, but it doesn't consider all the values like the mean does. Mode is all about popularity, but it can be unreliable is there's no clear favorite value- think of friends who are all equally awesome in their own way.

 ๐ŸŸฅ๐ŸŸฅ๐ŸŸฅ๐ŸŸฅ๐ŸŸฅ๐ŸŸฅ๐ŸŸฅ๐ŸŸฅ๐ŸŸฅ๐ŸŸฅ


The Musketeers in action

Let's say you're tracking your video game scores: 10, 20, 30,30,40,50


  • The mean:( 10 +20+30+30+40+50) /6 = 30
  • The Median: Order the scores (10,20,30,30,40,50) the middle value is 30.
  • Mode: 30 appears twice, making it the most popular score.
 

The Mean, Median, and Mode are not rivals, they're complementary! Use them together to paint a richer picture of your data,

The Incredible Experiment:

 A Superhero Guide to Independent and Dependent Variables


๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ

Ever dreamed of being a scientist, wielding potions and peering through microscopes? Let us embark on a scientific adventure, unraveling the secrets of independent and dependent variables, your super tools for understanding the world around you!


Imagine a superhero lab



  • Professor Potential who is our wise mentor, mixing bubbling concoctions and spouting scientific wisdom.
  • Experiment X who is our trusty robot sidekick, ready for any test.
  • The question is will Professor Potential's new super-strength serum actually work on experiment X


The Key Players

  • Independent Variable - aka The Twister: This is the variable we change or control in our experiment. Just like Professor Potential changing the formula of the serum, the independent variable gets twisted and turned to see its effect.\
  • Dependent Variable - aka the Detector: This is the variable we measure or observe to see how it reacts to the changes in the independent variable. Experiment X will lift weights to see if his strength increases - that is the dependent variable, the detector of the serum's power.

The Big Showdown:


Professor Potential whips up different serums, changing the amount to a special ingredient which is the independent variable. Experiment X gulps them down and lifts weights with all his might. We measure how much he lifts which is the dependent variable - does it skyrocket with each new formula? 

The Reveal:


If Experiment X is suddenly bench pressing cars after the super-strength serum, it means the independent variable which is the serum formula has a clear effect on the dependent variable which is the weight lifted. If he is still struggling with tiny dumbbells, well, back to the lab!


  • The Independent variable controls the show, the variable we twist and turn.
  • The dependent variable watches the results, the variable that changes or doesn't in response.






Are Werewolves Real??

 Investigating the Moon and Relapse with Statistics

๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช

Ever heard that full moons bring out the crazies? Or maybe just a few extra patients in the emergency room? While the image of howling werewolves might be exaggerated, the question of a lunar influence on human behavior persists. Today, we'll put on our lab coats and use statistics to investigate the fascinating and often debated connection between moon phases and relapse rates.

๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️

The Suspect: Luna, the Earth's Satellite



Lunar lore spans centuries, with beliefs linking the moon to everything from tides to fertility. But for our investigation, the focus is on potential changes in human physiology or psychology based on the moon's phases. Some theories suggest gravitational or electromagnetic forces might play a role, while others point to altered sleep patterns or increased suggestibility under a full moon.

Gathering Evidence: The Power of Data

To test these theories, we need data. Lots of it. This means tracking relapse rates for specific conditions like addiction, and mental health episodes over time, alongside the corresponding lunar phases.

The Statistical Sleuthing:


Here's where the real fun begins! We can use various statistical tools to examine the data and see if there is any connection between phases and relapse. Let's explore some possibilities:
  • Chi-square test: This tests whether the observed distribution of relapses across lunar phases is different from what we would expect by chance.
  • Correlation coefficient: This measures the strength and direction of any relationship between lunar phases and relapse rates.
  • Regression analysis: This allows us to control for other factors that might influence relapses
     such as seasonality or weather, and see if the moon effect remains significant.
Even if some studies show a faint lunar link, it's crucial to remember correlation does not equal causation because there could be other, unknown factors at play.

So, are werewolves real? Based on current research, probably not. But the moon's influence on human behavior remains a captivating mystery. Statistical analysis helps us piece together the clues, but until the evidence speaks louder, we should maintain a healthy dose of skepticism and keep exploring.

Remember, science is a journey, not a destination. And in the realm of lunar mysteries, every full moon might just bring us a new chapter
in the story.



Data Detective:

 Cracking the Case of Interval and Ratio Data!!!

๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ
Remember the thrilling mystery novels where clues whisper secrets and numbers hold hidden truths? Well, buckle up, fellow data detective, because this time we are cracking the case of interval and ratio data!



Gone are the days of simple yes or no answers - nominal data- or rankings without measurements - ordinal data. Now we are dealing with numbers that sing, dance, and reveal fascinating secrets about the world around us.



Interval data 


Imagine a thermometer. It displays degrees, from freezing cold to scorching hot, but there's no true zero. Zero on a thermometer does not mean the absence of heat, just some arbitrary starting point. That is interval data: numbers with equal differences, but no absolute reference point.


Think of it like a ruler. Each centimeter is the same, but you would not say a book measured at zero centimeters is nonexistent. It just starts at a different point than your ruler's zero.

๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️


Ratio data


Now, picture a fancy scale, measuring your weight with a precise zero. This my friends is ratio data. It has all the benefits of interval data - equal differences - but with an added superpower: a true absolute zero. ero weight means no weight at all, not just some starting point in a system.

๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€

Think of time: zero seconds is truly the absence of time, not just a starting point for your stopwatch. 

So what is the difference????

Imagine a race: With interval data, you know who came first, second, and third, but not their exact times. Ratio data reveals everyone's exact finishing times, allowing you to calculate speeds, and gaps, and even predict future winners!

Why does it matter???๐ŸŸฅ๐ŸŸฅ๐ŸŸฅ๐ŸŸฅ๐ŸŸฅ๐ŸŸฅ

Choosing the right data type is like picking the perfect tool for the job. Using interval data for calculations that require a true zero can lead to skewed results, like trying to hammer a nail with a spoon.

So the next time you encounter a set of numbers, don't just stare blankly. Put on your detective hat and ask Interval or ratio? Numbers hold the key to understanding temperature changes, predicting economic trends, and even measuring the speed of that falling toast-ratio
data, by the way.







 


Let's Sort

 Sorting it out: A Guide to Ordinal and

Nominal Data

๐ŸชŸ๐ŸชŸ๐ŸชŸ๐ŸชŸ๐ŸชŸ๐ŸชŸ๐ŸชŸ๐ŸชŸ๐ŸชŸ๐ŸชŸ๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐ŸชŸ๐ŸชŸ๐ŸชŸ๐ŸชŸ๐ŸชŸ๐ŸชŸ๐ŸชŸ๐ŸชŸ๐ŸชŸ๐ŸชŸ๐ŸชŸ

Data, data, everywhere! Numbers dance across spreadsheets, charts bursting with colorful bars...but not all data is created equal. Ordinal and nominal, ever heard of them?



Imagine a movie theatre:
  • The seat numbers tell you where to sit, but there is no inherent order or comparison between them. You would not claim that one seat is better than the other. This is basically nominal data.
  • Picture the rows: front row, middle row! The front row sits closer to the screen, the back row farther, and the middle row falls somewhere in between. Each level has a definite rank compared to the others. This is ordinal data.
๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช๐Ÿงช

OK what is the difference?

  • Nominal data:
    • Think categories, not ranks. Imagine hair color, political party, or music genre. These are nominal-just labels that group things together without implying any order or inherent relationship.
    • Surveys and questionnaires love them. Ask "What is your favorite color? and you will get nominal data like blue, green, and purple without inherent order just individual categories.
    • Counting and percentages are their forte. We can count how many people like each color, but we cannot say blue is greater than green.
  • Ordinal data
    • Ranks matter! Think movie rows, exam grades, or clothing sizes. These levels have a clear order, each higher than the one below.
    • They tell you more than or less than. A student with an A outperformed someone with a C. A large shirt is bigger than a medium.
    • But beware of stretching the order!! Ordinal data does not always allow for equal intervals between levels. A B student is not necessarily twice as good as a D student.
๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸงŠ๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐ŸŒก️๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€๐Ÿ‘€
Why does it matter??

Choosing the right data type is crucial for accurate analysis and meaningful conclusions. Using nominal data for calculations that assume order can lead to misleading results. Conversely, forcing ordinal data into strict mathematical operations might not make sense.


Data is like legos: different pieces fit together in different ways. Knowing which type you're holding is key to building something insightful and robust.

So, next time you see data dancing around, do not be afraid to ask: ordinal or nominal to unlock a hidden story within the numbers. 

๐Ÿณ️‍๐ŸŒˆ๐Ÿณ️‍๐ŸŒˆ๐Ÿณ️‍๐ŸŒˆ๐Ÿณ️‍๐ŸŒˆ๐Ÿณ️‍๐ŸŒˆ๐Ÿณ️‍๐ŸŒˆ๐Ÿณ️‍๐ŸŒˆ๐Ÿณ️‍๐ŸŒˆ๐Ÿณ️‍๐ŸŒˆ๐Ÿณ️‍๐ŸŒˆ๐Ÿณ️‍๐ŸŒˆ๐Ÿณ️‍๐ŸŒˆ

There are other data types out there, like interval and ratio data, each with their own quirks and strengths.

Featured Blog Post

Amphetamines: A History of Abuse and Addiction

 Amphetamines have a long and complex history, dating back thousands of years (Rosenthal, 2022). Originally they were used for medicinal pur...

Popular Posts