2024/01/28

Working with Ratio Scales

⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️

Ratio Scales, Definition, Examples, and Data Analysis

A ratio scale is quantitative with true zero and equal intervals between neighboring points.
A ratio scale of zero means a total absence of the variable you are measuring.
An interval scale does not have any of the above mentions.
Length, area, and population are examples of ratio scales.
The ratio level contains all of the features of the other 3 levels.
At the ratio level, values can be categorized, and ordered, have equal intervals, and take on a true zero.
Nominal and ordinal variables are categorical variables
Interval and Ratio variables are quantitative variables
Many more statistical tests can be performed on quantitative than categorical data

So What is a True Zero?? 🟥🟥🟥🟥🟥🟥🟥

On a ratio scale, a zero means there's a total absence of the variable of interest.

For example, the number of children in a household or years of work experience are ratio variables.
A respondent can have no children in their household or zero years of work experience.

With a true zero in your scale, you can calculate ratios of values.

For example, you can say that 4 children are twice as many as 2 children in a household and eight years is double 4 years of experience

Some variables, such as temperature, can be measured on different scales

Celcius and Fahrenheit are interval scales
Kelvin is a ratio scale
In all three scales, there are equal intervals between neighboring points
The Kelvin scale has a true zero, where nothing can be colder.
That means that you can only calculate ratios of temperatures in the Kelvin scale
A true zero makes it possible to multiply, divide, or square root values.
Collecting data on a ratio level is always preferable to the other levels because it is the most precise.

Examples of ratio scales ⏹️⏹️⏹️⏹️⏹️⏹️⏹️

Interval variables and ratio variables can be discrete or continuous.
A discrete variable is expressed only in countable numbers
A continuous variable can potentially take on an infinite number of values.
Number of vehicles owned in the last 10 years discrete
The number of people in a household discrete
The number of students who identify as religious discrete
reaction time in a computer task continuous
Years of work experience continuous
Speed in miles per hour continuous

🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥

Ratio Data Analysis

After you have collected ratio data, then you can gather descriptive and inferential statistics
Almost all statistical tests can be performed on ratio data because all mathematical operations are permissible

Ratio data example - you collect data on the commute duration of employees in a large city

the data is continuous and in minutes

To summarize your data, you can collect the following descriptive statistics :

the frequency distribution in numbers or percentages
the mode, median, or mean to find the central tendency
the range, standard deviation, and variance to indicate the variability

You can get an overview of the frequency of different values in a table and visualize their distribution in a graph
Enter your data into a grouped frequency distribution table.
Create groups with equal intervals on the left-hand column and enter the number of scores that fall within each interval into the right-hand column.

To visualize the data, plot it on a frequency distribution polygon.
Plot the groupings on the x-axis and the frequencies on the y-axis
Join the midpoint of each grouping using lines

Variability

The range, standard deviation and variance describe how spread your data is.
The range is the easiest to compute
The standard deviation and the variance describe how spread your data is and they are also more informative.
The coefficient of variation is a measure of spread that only applies to ratio variables

Range

To find the range subtract the lowest value from the highest value in your data set.

the range equals72.5 - 7 = 65.5

Statistical Tests

With a normal distribution of ratio data then parametric tests are best for testing hypotheses
Parametric tests are more powerful than non-parametric tests and you can make stronger conclusions with your data
The data must meet several requirements for parametric tests to apply
The following chart lists parametric tests that are some of the most common ones applied to test hypotheses about ratio data

🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥

References

Bhandari, P. (2020, August 28). Ratio Scales | Definition, Examples, & Data Analysis. Scribbr. https://www.scribbr.com/statistics/ratio-data/

‌

2024/01/27

Need help with ordinal data??

🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥

Ordinal data

classified into categories within a variable that has a natural rank order.

However, the distances between the categories are uneven or unknown.
For example, the variable frequency of physical exercise can be categorized into the following:

There is a clear order to these categories, but we cannot say that the difference between never and rarely is exactly the same as that between sometimes and often - therefore the scale is ordinal.

🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥

Ordinal is the second of four hierarchical levels of measurement

nominal, ordinal, interval, and ratio

Nominal data differs from ordinal data because it cannot be ranked in an order.
Interval data differs from ordinal data because the differences between adjacent scores are equal.

🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥

Examples of Ordinal Scales:

In social scientific research, ordinal variables often include ratings about opinions or perceptions, or demographic factors that are categorized into levels or brackets such as social status or income.

Language ability

beginner
intermediate
fluent

level of agreement

strongly disagree
disagree
neither agree nor disagree
agree strongly agree

Income level

lower level income
middle-level income

🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥

How to Collect Ordinal Data

ordinal variables are usually assessed using closed-ended survey questions that give participants several possible answers to choose from. These are user-friendly and let you easily compare data between participants.

⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️

Examples of Ordinal scale survey questions

what is your age

0 to 18
19 to 34
35 to 49
50 plus

what is your education level

primary school
high school
bachelors degree
master's degree
PhD

In the past three months, how many times did you buy groceries online

none
1 to 4 times
5 to 9 times
10-14 times
15 or more times

⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️

Likert Scale Data

Ordinal data is often collected using Likert scales. Likert scales are made up of 4 or more Likert type questions with continuum of response items for participants to choose from

Example of Likert-type questions

How frequently do you buy energy-efficient products?

never
rarely
sometimes
often
always

How important do you think it is to reduce your carbon footprint?

not important
slightly important
important
moderately important
very important

But it's important to note that not all mathematical operations can be performed on these numbers

Although you can say that two values in your data set are equal or unequal
you can say that one value is greater or less than another.
You cannot meaningfully add or subtract the values from each other.

This becomes relevant when gathering descriptive statistics about your data.

How to Analyze Ordinal Data⏹️⏹️⏹️⏹️⏹️⏹️

Ordinal data can be analyzed with both descriptive and inferential statistics.

Descriptive Statistics⏹️⏹️⏹️⏹️⏹️

With Ordinal Data

the frequency distribution in numbers or percentages.
the mode or the median to find the central tendency
the range to indicate the variability

Example

You ask 30 survey participants to indicate their level of agreement with the statement below

Regular physical exercise is important for my mental health.

strongly disagree
disagree
neither disagree nor agree
agree
strongly agree

⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️

To get an overview of your data, you an create a frequency distribution table that tells you how many times each response was selected

To visualize your data, you can use a bar graph

Plot your categories on the x-axis and the frequencies on the y-axis
Unlike nominal data,

the order of categories matters when displaying ordinal data.

Central Tendency⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️

The central tendency of your data is where most of your values lie

The mode, mean, and median are the three most commonly used measures of central tendency.

the mode can almost always be found for ordinal data
the median can only be found in some cases.
The mean can not be computed with ordinal data

finding the mean requires you to perform arithmetic operations like addition and division on the values in the data set
the differences between adjacent scores are unknown with ordinal data, these operations can not be performed for meaningful results.
the mode of your data is the most frequently appearing value.

the mode of the data set is agree

⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️

Medians

for odd and even numbered data sets are found in different ways

in an odd numbered data set the median is the value at the middle of your data set when it is ranked.
In an even-numbered data set, the median is the mean of the two values in the middle of your data set.

Order all data values and locate the middle of your data set to find the median

Since there are 30 values, there are 2 values in the middle at the 15th and 16th positions since both values are the same the median is Agree
If the two values in the middle were Agree and Strongly agree instead, then you could not find the mean since the mean of the two values can't be found even if you coded them numerically - so in this case there is no median.

⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️⏹️

Variability

Find the minimum, maximum, and range of your data set

code your data by assigning a number to each of the responses in order from lowest to highest.

1 strongly disagree
2 disagree
3 neither disagree nor agree
4 agree
5 strongly agree

The minimum is 1 and the maximum is 5

The range gives you an idea of how widely your scores differ from each other.
From this information, you can conclude there was at least one answer on either end of the scale.

From this information, you can conclude there was at least one answer on either end of the scale.

Statistical Tests

Inferential statistics will help you test scientific hypotheses about your data
The most appropriate statistical tests for ordinal data focus on the rankings of your measurements and these just happen to be non-parametric tests
Parametric tests are used when your data fulfills certain criteria like a normal distribution
Parametric tests assess means
non-parametric tests often assess medians or ranks.
There are many possible statistical tests that you can use for ordinal data.
The one you choose depends on your aims and the number and type of samples

🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥

References

Bhandari, Pritha. “Ordinal Data: What Is It and What Can You Do with It?” Scribbr, 12 Aug. 2020, www.scribbr.com/statistics/ordinal-data/.

‌

2024/01/22

Requiem for a Teaspoon: A Caffeinated Elegy

The Caffeinator: Dawn of Toxicity

🌡️🌡️🌡️🌡️🌡️🧊🧊🧊🧊🧊🧊🧊🧊🧊🧊🧊🧊🧊🧊

Pure and highly concentrated caffeine products pose a significant public health risk due to their potency and potential for overdose. These products, often sold in bulk and packaged without precise measuring tools, contain dangerously high levels of caffeine - a single teaspoon of powder can be equivalent to 28 cups of coffee. (Nutrition, 2023)

According to the FDA's 2023 article (Nutrition, 2023), common side effects like nervousness are magnified, and potentially fatal consequences like seizures and rapid heartbeat can occur. Consumers unaware of this potency compared to regular coffee are particularly at risk. The FDA actively monitors and takes action against these products, including seizure and injunctions according to their recently published article (Nutrition, 2023).

According to Nutrition (2023), the following is a timeline on the FDA's action on pure and highly concentrated caffeine:

On September 1, 2015 - the FDA issued warning letters to five distributors of pure powdered caffeine products.
March 2016 - The FDA issued two additional warning letters
On April 13, 2018, the FDA released guidance for the industry on highly concentrated caffeine in dietary supplements

This document provides guidance for companies who manufacture, market, or distribute dietary supplements containing pure or highly concentrated caffeine or are considering doing so, to help them determine when a product is considered adulterated and illegal by the FDA

🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥

References

Nutrition, C. for F. S. and A. (2023). Pure and Highly Concentrated Caffeine. FDA. https://www.fda.gov/food/dietary-supplement-ingredient-directory/pure-and-highly-concentrated-caffeine

‌

Decoding the Data Whisperer:

A Beginner's Guide to Z-Scores

🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥🟥

Imagine you're in a classroom full of students who all took the same exam. You scored 75, but how does that compare to everyone else? Did you ace it or just barely scrape by? Enter the z-score, a powerful tool that helps you understand your position within a dataset.

Think of z-scores as a translator. It takes your raw score and converts it into a universal language, telling you how many standard deviations away you are from the mean or average of the group. S standard deviation is basically a measure of how spread out the data is.

The lowdown on z-scores:

The Formula:

Interpretation:

Positive z-score means you scored above the mean, The higher the z-score, the further above the mean you are. For example, a z score of 2 means that you are 2 standard deviations above the average.

A negative z score means that you scored below the mean. The more negative the z-score, the further below the mean you are.

A z-score of 0 then you are right on the mean.

Benefits of Z-Scores:

you can compare apples to oranges. You can compare data from different sets with different units. Imagine comparing your exam score to your friend's height. Z-scores make it possible by putting both scores on the same scale.
Spot outliers: Extreme values that deviate significantly from the rest of the data can be easily identified with z-scores. A z-score far above or below the others might indicate an error or a unique case that deserves further investigation.
Predict probabilities: Knowing the z-score and the properties of the normal distribution- the bell curve-, you can estimate the percentage of the population that scored lower or higher than you.

Pages

2024/01/28

Working with Ratio Scales

Ratio Scales, Definition, Examples, and Data Analysis

2024/01/27

Ordinal Data:

Need help with ordinal data??

2024/01/22

Requiem for a Teaspoon: A Caffeinated Elegy

The Caffeinator: Dawn of Toxicity

Decoding the Data Whisperer:

A Beginner's Guide to Z-Scores

Featured Blog Post

Amphetamines: A History of Abuse and Addiction

Some Popular Posts from my blog