๋ณธ๋ฌธ ๋ฐ”๋กœ๊ฐ€๊ธฐ

๐Ÿ’ซ ์ˆ˜ํ•™

Designing Studies

 Introduction to Probability and Data ์ˆ˜์—…์„ ๋“ค์œผ๋ฉฐ ๋‚จ๊ธด ๋…ธํŠธ์ž…๋‹ˆ๋‹ค.

Week1: Designing studies
1. Data Basics
2. Observational Studies & Experiments
3. Sampling and sources of bias
4. Experimental Design
5. Random Sample Assignment

1. Data Basics

* Observations / Variables / Data matrices

 

* types of variables

all variable

Numerical

(quantitative)

Categorical

(qualitative)

๋”ํ•˜๊ณ , ๋นผ๊ณ , ํ‰๊ท ์„ ๊ตฌํ•  ์ˆ˜ ์žˆ๋Š” ์ˆ˜์น˜์  ๊ฐ’์„ ๊ฐ€์ง„๋‹ค. 

 

take on numerical values sensible to add, subtract, take averages, etc. with these values.

์ œํ•œ๋œ ์ˆ˜์˜ ์„œ๋กœ ๋‹ค๋ฅธ ๋ฒ”์ฃผ๋ฅผ ๊ฐ’์„ ๊ฐ€์ง„๋‹ค.
์ˆซ์ž๋กœ ์‹๋ณ„ํ•  ์ˆ˜๋„ ์žˆ์ง€๋งŒ,
๊ทธ ์ˆซ์ž๋กœ ์‚ฐ์ˆ ์—ฐ์‚ฐ์„ ์ˆ˜ํ–‰ํ•˜๋Š” ๊ฒƒ์€ ์ ์ ˆ์น˜ ๋ชปํ•˜๋‹ค.

 

take on a limited number of distinct categories can be identified with numbers, 

but not sensible to do arithmetic operations.

Continuous

Discrete

Regular Categorical

Ordinal

์ฃผ์–ด์ง„ ๋ฒ”์œ„ ๋‚ด์—์„œ ๋ฌดํ•œํ•œ ์‹ค์ˆ˜์˜ ๊ฐ’์„ ์ทจํ•œ๋‹ค. 

take on any of an infinite number of values within a given range.

ํŠน์ • ์ˆซ์ž ๊ฐ’ ์ง‘ํ•ฉ ์ค‘ ํ•˜๋‚˜๋ฅผ ์ทจํ•ฉ๋‹ˆ๋‹ค.

take on one of a specific set of numeric values.

๊ณ ์œ ํ•œ ์ˆœ์„œ๊ฐ€ ์—†๋Š” ๊ฒฝ์šฐ ๊ทธ๋ƒฅ caregorical ๋ณ€์ˆ˜๋ผ ๋ถ€๋ฅธ๋‹ค.

(e.g ๋‹น์‹ ์€ ์•„์นจํ˜•์ธ๊ฐ„์ž…๋‹ˆ๊นŒ ์ €๋…˜ํ˜• ์ธ๊ฐ„์ž…๋‹ˆ๊นŒ?)

๊ณ ์œ ํ•œ ์ƒ์†์  ์ˆœ์„œ๊ฐ€ ์žˆ๋Š” ๋ณ€์ˆ˜.

levels have an inherent ordering.

  • Continuous Numerical ๋ณ€์ˆ˜๋Š” ๋ณดํ†ต ์ธก์ •๋œ ๊ฒƒ์ด๋‹ค. (e.g. ์‹ ์žฅ 163.8 cm .. 164.1 cm ...)

  • Count variable์€ Discrete ์ด์‚ฐํ˜• ๋ณ€์ˆ˜์ด๋‹ค.

  • Continuous ๋ณ€์ˆ˜๋ฅผ roundingํ•˜๋ฉด ์ด์‚ฐํ˜• ๋ณ€์ˆ˜์ฒ˜๋Ÿผ ๋‚˜ํƒ€๋‚ผ ์ˆ˜ ์žˆ๋‹ค.

  • Categorical๋ฒ”์ฃผํ˜• ๋ณ€์ˆ˜๋ฅผ ์ˆซ์ž๋กœ ์‹๋ณ„ํ•  ์ˆ˜ ๋„ ์žˆ๋‹ค. (์—ฌ์„ฑ์€ 0, ๋‚จ์„ฑ์€ 1 ๊ณผ ๊ฐ™์ด) 

  • ํ•˜์ง€๋งŒ Categorical ๋ณ€์ˆ˜ ๊ฐ’์œผ๋กœ ์‚ฐ์ˆ ์—ฐ์‚ฐ์„ ํ•˜๋Š” ๊ฒƒ์€ ์ ์ ˆ์น˜ ๋ชปํ•˜๋‹ค.

 

2. observational studies and experiments 

study

observational

experiment

  • collect data in a way that does not directly interfere with how the data arise.("observe")

  • only establish an association.

  • retrospective: uses past data.

  • prospective: data are collected throughout the study.

  • randomly assign subject to treatments

  • establish a causal connection

 

  • ๊ด€์ธก ์—ฐ๊ตฌ์—์„œ ์—ฐ๊ตฌ์ž๋“ค์€ ๊ด€์ธก ์—ฐ๊ตฌ์—์„œ ์—ฐ๊ตฌ์ž๋“ค์€ ๋ฐ์ดํ„ฐ๊ฐ€ ์–ด๋–ป๊ฒŒ ๋ฐœ์ƒํ•˜๋Š”์ง€ ์ง์ ‘ ๊ฐ„์„ญํ•˜์ง€ ์•Š๋Š” ๋ฐฉ์‹์œผ๋กœ ๋ฐ์ดํ„ฐ๋ฅผ ์ˆ˜์ง‘ํ•œ๋‹ค. 

  • ์ฆ‰, ๊ทธ๋“ค์€ ๋‹จ์ง€ ๊ด€์ฐฐํ•œ๋‹ค. ๊ทธ๋ฆฌ๊ณ  ๊ด€์ธก ์—ฐ๊ตฌ๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ association๋ฅผ ์„ค๋ฆฝ ํ•  ์ˆ˜ ์žˆ๋‹ค. ํ•˜์ง€๋งŒ ์ธ๊ณผ๊ด€๊ณ„๋ฅผ ์ฃผ์žฅํ•  ์ˆ˜๋Š” ์—†๋‹ค.

  • association์ด๋ž€ ์„ค๋ช…๋ณ€์ˆ˜์™€ ๋ฐ˜์‘๋ณ€์ˆ˜ ๊ฐ„์˜ ์ƒ๊ด€๊ด€๊ณ„๋ฅผ ๋งํ•œ๋‹ค. correlation between explanatory and the response variables.

  • ๊ด€์ธก ์—ฐ๊ตฌ๊ฐ€ ๊ณผ๊ฑฐ์˜ ์ž๋ฃŒ๋ฅผ ์‚ฌ์šฉํ•œ๋‹ค๋ฉด, ๊ทธ๊ฒƒ์€ retrospective study๋ผ ํ•œ๋‹ค.

  • ๋ฐ˜๋ฉด ์—ฐ๊ตฌ ๋„์ค‘์— ์ˆ˜์ง‘ ๋œ ๋ฐ์ดํ„ฐ๋ฅผ ์‚ฌ์šฉํ•œ๋‹ค๋ฉด prospective ์—ฐ๊ตฌ๋ผ๊ณ  ํ•œ๋‹ค.

  • observational studies์™€ experiments์˜ ๊ฐ€์žฅ ํฐ ์ฐจ์ด์ ์€ Random assignment ์ด๋‹ค.

  • ๋Œ€๋ถ€๋ถ„์˜ experiments์—์„œ๋Š” random assignment๋ฅผ ์‚ฌ์šฉํ•˜๊ณ , observational study์—์„œ๋Š” ์‚ฌ์šฉํ•˜์ง€ ์•Š๋Š”๋‹ค.

 

* random assignment? 

์ •๊ธฐ์ ์ธ ์šด๋™๊ณผ ์—๋„ˆ์ง€ ์ˆ˜์ค€ ์‚ฌ์ด์˜ ๊ด€๊ณ„๋ฅผ ํ‰๊ฐ€ํ•˜๊ณ  ์‹ถ์„๋•Œ, ์—ฐ๊ตฌ๋ฅผ ๊ด€์ธก ์—ฐ๊ตฌ๋กœ ๋˜๋Š” ์‹คํ—˜์œผ๋กœ ์„ค๊ณ„ ํ•  ์ˆ˜ ์žˆ๋‹ค.

 

1. ๊ด€์ธก ์—ฐ๊ตฌ์—์„œ๋Š” ๋ชจ์ง‘๋‹จ์—์„œ ์šด๋™์„ ์„ ํƒํ•˜๋Š” ์‚ฌ๋žŒ๋“ค๊ณผ ๊ทธ๋ ‡์ง€ ์•Š์€ ๋‘ ๊ฐ€์ง€ ์œ ํ˜•์˜ ์‚ฌ๋žŒ๋“ค๋กœ ์ƒ˜ํ”Œ์„ ์ถ”์ถœํ•œ๋‹ค. ๊ทธ๋ฆฌ๊ณ  ๋‘ ๊ทธ๋ฃน์˜ ํ‰๊ท  ์—๋„ˆ์ง€ ์ˆ˜์ค€์„ ๊ด€์ธกํ•ด ๋น„๊ตํ•œ๋‹ค.
2. ์‹คํ—˜์—๋Š” ์‚ฌ๋žŒ๋“ค์„ ๋ฌด์ž‘์œ„๋กœ ๋‘ ๊ทธ๋ฃน์œผ๋กœ ํ• ๋‹นํ•œ๋‹ค. (์—ฐ๊ตฌ ๊ธฐ๊ฐ„ ๋™์•ˆ ์ •๊ธฐ์ ์œผ๋กœ ์šด๋™์„ ํ•  ์‚ฌ๋žŒ๋“ค๊ณผ ๊ทธ๋ ‡์ง€ ์•Š์€ ์‚ฌ๋žŒ๋“ค๋กœ)

 

1๊ณผ 2์˜ ๋ฐฉ์‹์˜ ์ฐจ์ด์ ์€ ์šด๋™์„ ํ• ์ง€ ์•ˆํ• ์ง€์— ๋Œ€ํ•œ ๊ฒฐ์ •์„ ์‹คํ—˜ ๋Œ€์ƒ์ž๊ฐ€ ํ•˜๋Š๋ƒ, ์—ฐ๊ตฌ์ž๊ฐ€ ๋ถ€๊ณผํ•˜๋Š”๋ƒ์˜ ์ฐจ์ด์ด๋‹ค.

The difference is that the decision of whether to work out or not is not left up to the subjects as in the observational study, but is instead imposed by the researcher.

๊ฒฐ๊ณผ์ ์œผ๋กœ ๋‘ ๊ทธ๋ฃน ๊ฐ„์˜ ํ‰๊ท  ์—๋„ˆ์ง€ ์ˆ˜์ค€์˜ ์˜๋ฏธ์žˆ๋Š” ์ฐจ์ด๋ฅผ ๋ฐœ๊ฒฌํ•˜๋”๋ผ๋„ observational study์—์„œ๋Š” ์ •๊ธฐ์  ์šด๋™์ด๋ผ๋Š” attribute๊ฐ€ ์ „์ ์œผ๋กœ ์ด ์ฐจ์ด๋ฅผ ๋งŒ๋“ค์–ด๋ƒˆ๋‹ค๊ณ  ๋งํ•  ์ˆ˜ ์—†๋‹ค. ์ด study์—์„œ ํ†ต์ œํ•˜์ง€ ๋ชปํ•œ ๋‹ค๋ฅธ attribute๊ฐ€ ๊ด€์ฐฐ๋œ ์ฐจ์ด์— ๊ธฐ์—ฌํ–ˆ์„ ์ˆ˜ ์žˆ๊ธฐ ๋•Œ๋ฌธ์ด๋‹ค.

 

ํ•˜์ง€๋งŒ random assignment๋กœ ์ธํ•ด ์‹คํ—˜์—์„œ๋Š” ๊ฒฐ๊ณผ์— ์˜ํ–ฅ์„ ๋ผ์น  ์ˆ˜ ์žˆ๋Š” variable์ด ๋‘ ๊ทธ๋ฃน์—์„œ ๊ฑฐ์˜ ๋™์ผํ•˜๊ฒŒ ๋‚˜ํƒ€๋‚œ๋‹ค. ๊ทธ๋ ‡๊ธฐ๋•Œ๋ฌธ์— ๋‘ ๊ทธ๋ฃน์˜ ํ‰๊ท ์ ์ธ ์ฐจ์ด๋ฅผ ๋ฐœ๊ฒฌํ•˜๋ฉด, ์ด ์ฐจ์ด๋ฅผ ์„ค๋ช…ํ•˜๋Š” colossal statement๋ฅผ  ๋งŒ๋“ค ์ˆ˜ ์žˆ๋‹ค.

 

 Those who will regularly work out through the course of the stud and those who will not. The difference is that the decision of whether to work out or not is not left up to the subjects as in the observational study, but is instead imposed by the researcher. At the end, we compare the average energy levels of the two groups based on the observational study even if we find the difference between the average energy levels of these two groups of people, we can't attribute this difference solely to working out. Because there may be other variables that we didn't control for in this study, that contribute to the observed difference. For example, people who are in better shape might be more likely to regularly work out and also have higher energy levels. However, in the experiment, such variables that might also contribute to the outcome are likely equally represented in the two groups due to the random assignment. Therefore, if we find a difference between the two averages, we can indeed make a colossal statement attributing this difference to working out. 

 

* correlation vs causation.

์ƒ๊ด€๊ด€๊ณ„ vs ์ธ๊ณผ๊ด€๊ณ„

 

confounding variables: extraneous variables that affect both the explanatory and the response variable, and that make it seem like there is a relationship between them.

 

 

3. Sampling and sources of bias

* census vs a sample

Previously, we mentioned taking a sample from the population, but one might ask, 


Q: Wouldn't it be better to just include everyone and "sample" the entire population, in other words, conduct a census?

 

A:

As you can imagine, conducting a census takes lots of resources, but there are other reasons why this might not be a good idea.

 

  • First, some individuals may be hard to locate or hard to measure, and these people may be different from the rest of the population. For example, in the US Census, illegal immigrants are often not recorded properly, since they tend to be reluctant to fill out census forms, with the concern that this information could be shared with Immigration.
  • However, these individuals might possess characteristics different than the rest of the population, and hence not getting information from them might result in very unreliable data from geographical regions with high concentrations of illegal immigrants.
  • Another reason why censuses aren't always a good idea, is that populations rarely stand stillEven if you could take a census, the population changes constantly, so it's never really possible to get a perfect measure.

 

If you think about it, sampling is actually quite natural. Think about something you're cooking. We taste, in other words, we examine a small part of what we're cooking, to get an idea about the dish as a whole. 

 

  • We would never eat a whole pot of soup just to check it's taste after all. When you taste a spoonful of soup and 

    decide that spoonful you're tasted isn't salty enoughwhat you're doing is simply exploratory analysis for the sample at hand.

  • If you then generalize and conclude that your entire needs salt, that's making an inference.

  • For your inference to be valid, the spoonful you tasted, your sample, needs to be representative (sample) of your entire pot, your population. 

  • If your spoonful comes only from the surface, and the salt is collected at the bottom of the pot, what you tasted is probably not going to be representative of the whole pot. On the other hand, if you first stir the soup thoroughly before you taste, your spoonful will be more likely to be representative of the whole pot.

 

* sources of bias in studies,

Let's review a few sources of sampling bias.

Convenience sample:

Bias occurs when individuals who are easily accessible, are more likely to be included in the sample.

 

For example, say you want to find out how people in your city feel about a recent increase in public transportation costs. If you only poll people in your neighborhood, as opposed to a representative sample from the whole city,

tour study would suffer from convenience bias.

 

Non-response:

This happens if only a non-random fraction of the randomly sampled people respond

to a survey, such that the sample is no longer representative of the population.

 

For example, say you take a random sample of individuals from your city, and attempt to survey them, but certain segments of the population, say those from a lower socioeconomic status, are less likely to respond to the survey.

 

 

Similar sampling bias is called

voluntary response bias:

which occurs when the sample consists of only people who volunteer to respond because they have strong opinions on the issue.

 

For example, say you place polling machines at all bus stops and metro stations in your city, but only those who choose to do so actually take the time to vote and express their opinion on the recent increase on public transportation costs.

 

2013๋…„ 8์›” CNN์—์„œ ์„œ๋ฐฉ ๊ตญ๊ฐ€๋“ค์ด ์‹œ๋ฆฌ์•„์— ๊ฐœ์ž…ํ•ด์•ผํ•˜๋Š”์ง€ ์—ฌ๋ถ€๋ฅผ ๋ฌป๋Š” ์˜จ๋ผ์ธ ์—ฌ๋ก  ์กฐ์‚ฌ์—์„œ voluntary response bias๋ฅผ ํ™•์ธ ํ•  ์ˆ˜ ์žˆ๋‹ค. ์ด ์„ค๋ฌธ์— ๋‹ตํ•œ ์‚ฌ๋žŒ๋“ค์€ ํˆฌํ‘œ๋ฅผ ํ•ด์•ผํ•œ๋‹ค๊ณ  ๊ฐ•ํ•˜๊ฒŒ ๋Š๋‚€ ์‚ฌ๋žŒ๋“ค๋งŒ cnn ์‚ฌ์ดํŠธ์— ์ ‘์†ํ•ด์„œ ํˆฌํ‘œ๋ฅผ ํ–ˆ์„ ๊ฒƒ์ด๋ฏ€๋กœ, ์„ธ๊ณ„์‹œ๋ฏผ์„ ๋ชจ์ง‘๋‹จ์œผ๋กœ ํ•œ representative sample์ด๋ผ๊ณ  ๋ณผ ์ˆ˜ ์—†๋‹ค. ๊ทธ๋ž˜์„œ ์ด ํˆฌํ‘œ ์ž์ฒด๊ฐ€ ๊ณผํ•™์ ์ด๋ผ๊ณ  ๋ณผ ์ˆ˜ ์—†๋‹ค.

Voluntary response bias clearly exists in online polls like this one from CNN from August 2013, which asked whether the West should intervene in Syria. The people who responded to this poll definitely do not make up a representative sample of the world population, since these are people who happen to have visited cnn.com the day the poll was posted and felt strongly enough to vote. Indeed, the poll results say that this is not a scientific poll for this very reason.

 

 

The difference between voluntary response bias and non-response bias:

  • In non-response there is a random sample that is surveyed, but the people who choose to respond are not representative of the sample.

  • In voluntary response there is no initial random sample.

 

 


Q.

A retail store considering updates to their credit card policies randomly samples 1000 of their credit card holders to survey on the phone. The phone calls are made during business hours, therefore there is a lower rate of responses from members who work during these hours. What type of bias is this indicative of?

 

A.

There is an initial random sample, but not everyone in this random sample is reached. Therefore the issue is non-response of the sampled individuals.

 

 

* Sampling methods

...๋”๋ณด๊ธฐ

Let's examine a historical example of bias sample yielding misleading results. In 1936 Landon sought the Republican presidential nomination opposing the reelection of Franklin Delano Roosevelt, commonly referred to as FDR.

 

A popular magazine of the times, the Literary Digest, polled about 10 million Americans and got responses from about 2.4 million. To put things in perspective, nowadays reliable polls in the US routinely poll about 1,500 people, so this was a huge sample. The poll showed that Landon would likely be the overwhelming winner, and FDR would only get 43% of the votes. In reality, FDR won the election with 62% of the votes. The magazine was completely discredited because of the poll and was soon discontinued.

 

So, if you have never heard of this magazine, this might be the reason why. But what went wrong?

 

The magazine had surveyed its own readers, registered automobile owners and registered telephone users. These groups had incomes well above national average of the day. Remember, this was the great depression era, which resulted in lists of voters far more likely to support Republicans, than a truly typical voter of the time. In other words, the sample was not representative of the American population at the time. While The Literary Digest election poll was based on a sample size of 2.4 million, a huge sample, since the sample was biased, it did not yield an accurate prediction.

 

Going back to the soup analogy, if the soup is not well stirred, it doesn't matter how large a spoon you have, it will still not taste right. If the soup is well stirred, a small spoon will suffice to test the soup.

 

Now that we have a good idea of why we might want to sample, and why it's important for our sample to be representative of the population,

 

simple random sampling(SRS),

๋ชจ์ง‘๋‹จ ๋‚ด์˜ ๋ชจ๋“  case๋ฅผ ๋Œ€์ƒ๋“ค์ด ๊ฐ™์€ ํ™•๋ฅ ๋กœ ์„ ํƒ๋  ์ˆ˜ ์žˆ๋„๋ก ํ•œ๋‹ค.
we randomly select cases from the population, such that each case is equally likely to be selected. This is similar to randomly drawing names from a hat.

 

stratified sampling,

๋ชจ์ง‘๋‹จ์„ strata๋ผ๋Š” ๊ท ์งˆํ•œ ๊ทธ๋ฃน์œผ๋กœ ๋‚˜๋ˆˆ๋‹ค. ๊ฐ stratum ๋‚ด์—์„œ ๋ฌด์ž‘์œ„๋กœ ์ƒ˜ํ”Œ์„ ์ถ”์ถœํ•œ๋‹ค.

we first divide the population into homogenous groups called strata, and then randomly sample from within each stratum. For example, if we wanted to make sure both genders are equally represented in a study, we might divide the population first into males and females, and then randomly sample from within each group.

 

cluster sampling,

๋ชจ์ง‘๋‹จ์„ ํด๋Ÿฌ์Šคํ„ฐ๋กœ ๋‚˜๋ˆ  ๋ฌด์ž‘์œ„๋กœ ๋ช‡๊ฐœ์˜ ํด๋Ÿฌ์Šคํ„ฐ๋ฅผ ์ƒ˜ํ”Œ๋งํ•œ๋‹ค. ๊ทธ๋ฆฌ๊ณ  ์ƒ˜ํ”Œ๋ง๋œ ํด๋Ÿฌ์Šคํ„ฐ์—์„œ ๋ฐœ๊ฒฌ๋œ ๋ชจ๋“  observation์„ sample์— ํฌํ•จํ•œ๋‹ค.

we divide the population into clusters, randomly sample a few clusters, and then sample all observation within these clusters.

 

  • strata์™€ stratified sampling๊ณผ ๋‹ฌ๋ฆฌ clusters์€ ์ž๊ธฐ ์ž์‹  ๋‚ด์—์„œ ์ด์งˆ์ ์ธ ๊ฒƒ์„ ๋งํ•œ๋‹ค ๊ฐ๊ฐ์˜ cluster๋Š” ๋‹ค๋ฅธ cluster์™€ ์œ ์‚ฌํ•˜๋ฏ€๋กœ ๊ทธ๋ƒฅ ๋ช‡๊ฐœ์˜ ํด๋Ÿฌ์Šคํ„ฐ์—์„œ ์ƒ˜ํ”Œ๋ง์„ ์ˆ˜ํ–‰ํ•˜๋ฉด ๋œ๋‹ค.

    The clusters, unlike strata and stratified sampling, are heterogeneous within themselves, and each cluster is similar to another, such that we can get away with just sampling from a few of the clusters.

multistage sampling,

๋ชจ์ง‘๋‹จ์„ ํด๋Ÿฌ์Šคํ„ฐ๋กœ ๋‚˜๋ˆ  ๋ฌด์ž‘์œ„๋กœ ๋ช‡๊ฐœ์˜ ํด๋Ÿฌ์Šคํ„ฐ๋ฅผ ์ƒ˜ํ”Œ๋งํ•œ๋‹ค. ๊ทธ๋ฆฌ๊ณ  ์ƒ˜ํ”Œ๋ง๋œ ํด๋Ÿฌ์Šคํ„ฐ์—์„œ ๋ฐœ๊ฒฌ๋œ observation์— ๋Œ€ํ•ด ๋ฌด์ž‘์œ„๋กœ ์ƒ˜ํ”Œ๋งํ•œ๋‹ค.
In multistage sampling, adds another step to cluster sampling. Just like in cluster sampling, we divide the population into clustersrandomly sample a few clusters, and then we randomly sample observations from within these clusters.  For example, one might divide a city into geographic regions that are on average similar to each other, and then sample randomly a few of these regions, go to these randomly picked regions, and then, sample a few people from within these regions. This avoids the need to travel to all of the regions in the city.

 

  • Usually, we use cluster sampling and multistage sampling for economical reasons.

 

4. Experimental Design

we will discuss principles of experimental design and learn some experimental design terminology.

 

* Principles of experimental design

Control

Randomize

Replicate

Block

 compare the treatment of interest to a control group

randomly assign subjects to treatments

collect a sufficiently large sample, or replicate the entire study

block for variables known or suspected to affect the outcome

Lets discuss blocking a bit more, we would like to design and experiment to investigate if energy gels make you run faster. The treatment group gets the energy gel, the control group does not get any energy gel. It is suspected that energy gels might effect pro and amateur athletes differently therefore we block for pro status.

 

To do so, we divide our sample into pro and amateur athletes, and then, we randomly assign pro and amateur athletes to treatment and control groups, therefor, pro and amateur athletes are equally represented in the resulting treatment and control groups.

 

This way, if we do find a difference in running speed between the treatment and control groups we will be able to attribute it to the treatment, the energy gel, and can be assured that the difference isn't due to pro status since both pro and amateur athletes were equally represented in the treatment and control groups.

 

* blocking variable vs explanatory variable?

  • Explanatory variables also sometimes called factors, are conditions we can impose on our experimental units. 
  • Blocking variables are characteristics that the experimental units come with, that we would like to control for. 
    • Blocking is basically like stratifying,
    • expect used in experimental settings when randomly assigning as opposed to when sampling.

 

5. Random Sample Assignment

we will discuss random sampling and random assignment, two concepts that sound similar, but serve quite different purposes in study design. 

 

Random Sampling

  • Random sampling occurs when subjects are being selected for a study. 
  • If subjects are selected randomly from the population, then each subject in the population is equally likely to be selected, and the resulting sample is likely representative of the population. 
  • Therefore the study's results are generalizable to the population at large. 

Random assignment

  • Random assignment occurs only in experimental settings, where subjects are being assigned to various treatments. 
  • Taking a close look at our sample, we usually see that the subjects exhibit slightly different characteristics from one another. 
  • Through a random assignment, we ensure that these different characteristics are represented equally in the treatment and control groups. 
  • This allows us to attribute any observed difference between the treatment and control groups, to the treatment being observed on the subjects, since otherwise these groups are essentially the same. 
  • In other words, random assignment allows us to make causal conclusions based on the study. 

example,

Suppose you want to conduct a study, evaluating whether people read serif fonts or sans serif, or in other words, without serif fonts faster. Note that serifs are this small jacketed pieces at the ends of each character. 

Ideally, he would first randomly subjects for your study from your population. Then, you assigned the subjects in your sample to two treatment groups. One, where they read some text in serif font, and the other where they read the same text in sans serif font. Through random assignment, we ensure that other factors that may be contributing to reading speed indicated here with the different colors or the subjects. For example, fluency or how often the subject reads for leisure, are represented equally in the two groups. 

 

We call such variables confounders, or confounding variables. In this setting, if we observe any difference between the average reading speeds of the two groups, we can actually attribute it to the actual treatment, the font type, and know that it's likely not due to a confounding variable. 

So to recap, sampling happens first, and assignment happens second.

 

  • A study that employs random sampling and random assignment, can be used to make causal conclusions, and these conclusions can be generalized to the whole population. 
  • This would be an ideal experiment, but site studies are usually difficult to carry out,

 

  • especially if the experimental units are humans, since it may be difficult to randomly sample people from the population, and then impose treatments on them. This is why most experiments recruit volunteer subjects. You may have seen ads for these on a university campus, or in a newspaper.
  • Such human experiments that rely on volunteers employ random assignment, but not random sampling. 
  • These studies can be used to make causal conclusions, but the conclusions only apply to the sample, and the results cannot be generalized. 

 

  • A study that uses no random assignment, but does use random sampling, is your typical observational study. 
  • Results can only be used to make correlation statements, but they can be generalized to the population at large. 

 

  • A final type of study, one that doesn't use random assignment or random sampling, can only be used to make correlational statements, and these conclusions are not generalizable. This is an un-ideal observational study.

 

* Terminology

Placebo:

fake treatment, often used as the control group for medical studies.

Placebo effect: when the experimental unit show improvement simply because they believe they're receiving special treatment.

 

Blinding: when experimental units do not know whether they are in the control or the treatment groups.

Double-blind study: one where both the experimental units and the researchers do not know who is in the control and who is in the treatment group.

 

population - ๋ชจ์ง‘๋‹จ
sample - ํ‘œ๋ณธ
sampling - ํ‘œ๋ณธ ์ถ”์ถœ
observational study - ๊ด€์ฐฐ ์—ฐ๊ตฌ
experiment - ์‹คํ—˜
confounding variable - ๊ต๋ž€๋ณ€์ˆ˜
correlation - ์ƒ๊ด€๊ด€๊ณ„
causation - ์ธ๊ณผ๊ด€๊ณ„
bias - ํŽธํ–ฅ
census - ์ธ๊ตฌ ์กฐ์‚ฌ
simple random sample (SRS) - ๋‹จ์ˆœ์ž„์˜ํ‘œ๋ณธ
stratified sample - ์ธตํ™”ํ‘œ๋ณธ
explanatory variable - ์„ค๋ช…๋ณ€์ˆ˜
response varaible - ๋ฐ˜์‘๋ณ€์ˆ˜
blocking variable - ๊ตฌํš๋ณ€์ˆ˜
control group - ํ†ต์ œ์ง‘๋‹จ
numerical (quantitative) variable - ์ˆ˜์น˜ํ˜• (์–‘์ ) ๋ณ€์ˆ˜
categorical (qualitative) variable - ๋ฒ”์ฃผํ˜• (์งˆ์ ) ๋ณ€์ˆ˜
continuous - ์—ฐ์†ํ˜•
discrete - ์ด์‚ฐํ˜•
ordinal - ์ˆœ์„œํ˜•
scatterplot - ์‚ฐ์ ๋„
outlier - ์ด์ƒ์น˜
histogram - ๋„์ˆ˜๋ถ„ํฌํ‘œ
skewness - ์™œ๋„
kurtosis - ์ฒจ๋„
mean - ํ‰๊ท 
median - ์ค‘๊ฐ„๊ฐ’
mode - ์ตœ๋นˆ๊ฐ’
range - ๋ฒ”์œ„
variance - ๋ถ„์‚ฐ
standard deviation - ํ‘œ์ค€ํŽธ์ฐจ
quantile - ์‚ฌ๋ถ„์œ„์ˆ˜
robustness - ๊ฐ•๊ฑด์„ฑ
transformation - ๋ณ€ํ™˜